On an almost pure sieve by

(1)

LXVI.4 (1994)

On an almost pure sieve

by

C. Hooley (Cardiff)

Combinatorial sieve methods can be classified according to the extent they depart from the exact exclusion principle of Legendre. Such a proce- dure, for instance, was adopted in the introductory chapter of our tract [5], in which a discussion of the basic elimination process was followed succes- sively by descriptions of our simple asymptotic sieve, the first Brun sieve introduced in [1], and the second and more powerful sieve initiated by Brun in his seminal paper of 1920 [2]. But the more picturesque and elegant description of this classification in terms of purity or impurity of sieves is suggested to us by the name Brun’s Pure Sieve that Halberstam and Richert assign to the earlier Brun sieve in their monograph [3]. Thus the purer the sieve the more closely it approximates to the fundamental exclusion process, while the more powerful it is the impurer and more complicated it is likely to be; in particular, sieves of substantial impurity ap- plied to problems of difficulty normally yield only upper and lower positive bounds for what is sought. Naturally, under such an arrangement the so-called pure sieve of Brun is not actually completely unmixed because only the exclusion process of Legendre can have this status, albeit this qualification is merely a trifling matter of semantics that need not here detain us.

All versions or derivatives of Brun’s method normally used are consider- ably more complex than his pure sieve. It is therefore not a little surprising that it has not been previously noticed that the addition of only a minor in- gredient serves to convert the pure sieve into a much stronger instrument. So effective is the outcome that the resulting slightly impure sieve achieves upper and lower bounds of the same general power as those attained by much impurer versions of Brun’s method, while retaining a simplicity that enables it to be used in circumstances where other methods of comparable strength are unworkable. Such a situation, for example, occurs when the constituents in upper or lower sieving bounds P

d|n

%(d) for P

d|n

µ(d) happen to appear

in sums of the form

(2)

X

n≤x

ψ(n) X

d|n

%(d) 2

^ω(d)

₂

,

which possibly somewhat unexpectedly are also untreatable in terms of the elegant Selberg methods that fall outside our present classification of sieves.

Estimates of optimal orders of magnitude for sums of this general type being needed in our forthcoming work [6] on integers in small intervals that are sums of two squares, we devote the present paper to the definition of our almost pure sieve and to an explanation of its mechanism by an application to a problem of prime twins type; in so doing, we supply an easy demon- stration of the infinitude of pairs of integers differing by 2 that do not have more than fourteen prime factors—a result that is not markedly inferior to that first obtained by Brun in [2]. To the array of sieve methods we thus add a simple, versatile, and transparent tool that has much of the power enjoyed by more complicated procedures.

The almost pure sieve method used in our present application to almost prime twins has been deliberately described in such a way that only very minor modifications are needed for its indispensable operation in the above mentioned paper [6], in which the following new theorem is proved.

Let M (m, h) be the number of integers n in the interval m ≤ n < m + h that are expressible as the sum of two squares. Then, if h ≤ x and h/ √

log x

→ ∞ as x → ∞, we have A

₁

h

√ log x < M (m, h) < A

₂

h

√ log x

for all m not exceeding x save for at most o(x) exceptional values, where A

₁

, A

₂

denote positive absolute constants.

Essentially best possible as a result involving almost all values of x, this theorem improves upon previous work of Harman’s [4].

We follow the precedent set in [5] and describe the method in a general

context, since the applications are not necessarily always confined to the

divisibility properties of integers. Here, appropriately modifying our nota-

tion in [5] to suit the present occasion, we are presented with a finite set

A of N elements denoted generically by m, each of which may or may not

possess one or more of the properties α

1

, . . . , α

n

in a family S indexed by

1, . . . , n. This family is partitioned into h mutually exclusive sub-families

S

_j

of cardinality n

_j

in order that we may set up the present machinery for

estimating the number N

1,...,n

of elements in A that possess none of the

properties α

_i

. A typical choice (possibly vacuous) of s

_j

elements from the

indexing set of S

j

is then denoted by ι

j,s_j

so that a sub-set of 1, . . . , n of car-

dinality s can be expressed as ι

_s

= (ι

_1,s₁

, . . . , ι

_h,s_h

), where s = s

₁

+ . . . + s

_h

;

(3)

also N (ι

s

) = N (ι

1,s₁

, . . . , ι

h,s_h

) is to indicate the number of elements in A having all the properties indexed by ι

_s

.

Ere we delineate the present sieve it is helpful to digress momentarily by remarking that traditionally there have been two ways of approaching such a task. Some sieves lend themselves to an explanation via bounds for N

_1,...,n

in terms of the sums N (ι

_s

), while others are better elucidated by the bounds provided for the characteristic function f (m) of the non-excluded or sifted set. Formulae of the first type, however, are normally substantiated by considering their applicability to the special case N = 1 and hence by establishing results of the second type. Thus the difference in presentations is merely one of emphasis, corresponding in fact to a reversal of the order of summations in a double sum.

Although the more elementary sieve methods are usually expressed through the former method, the nature of our slightly impure sieve is more easily unveiled in terms of the characteristic function of the sifted set. This is bounded by means of the inequalities

(1) f (m) ≤ Y

1≤j≤h

X

sj≤rj

(−1)

^s^j

X

ι_j,sj∈m

1 and

f (m) ≥ Y

1≤j≤h

X

s_j≤r_j

(−1)

^s^j

X

ι_j,sj∈m

1 (2)

− X

1≤k≤h

X

ι_k,rk+1∈m

1 Y

j6=k

X

sj≤rj

(−1)

^s^j

X

ι_j,sj∈m

1 ,

in which r

₁

, . . . , r

_h

are suitably chosen non-negative even integers and in which the symbolism ι ∈ m means that m is to possess all the properties indexed by a sub-set ι of 1, . . . , n. The upper inequality has a right-hand side that is a product of expressions occurring in Brun’s pure sieve and therefore follows, for example, from equation (5A) in [5]. The lower inequality is per- force a little more complicated, since the total number of negative factors on the right of (1) might well be even when the numbers r

_j

were taken to be odd in accordance with the lower pure Brun sieve for each set of properties S

j

. To prove (2) we first note that its right-hand side is unchanged when the variables j, k of multiplication and summation are restricted to values l for which a positive number u

_l

of properties in S

_l

are possessed by m. Hence, also now using the Legendre formula, we see that (2) is certainly true either when m has none of the properties in S or when there is an exponent l such that 1 ≤ u

_l

≤ r

_l

. In the remaining case, if j

⁰

, k

⁰

denote indices for which u

_l

6= 0 and for which therefore

(3) u

_l

> r

_l

,

(4)

the substitution of j

⁰

, k

⁰

for j, k in the right of (2) does not affect its value and gives rise to a multiplicand and summand that are seen to be equal, respectively, to

X

s_j0≤r_j0

(−1)

^s^j0

u

_j⁰

s

_j⁰

=

u

_j⁰

− 1 r

_j⁰

and

u

_k⁰

r

k⁰

+ 1

Y

j⁰6=k⁰

u

_j⁰

− 1 r

j⁰

in virtue of the identity

(1 − y)

^u−1

= (1 − y)

^u

(1 − y)

⁻¹

= X

r

X

_r

s=0

(−1)

^s

u s

y

^r

. The right-hand side of (2) is thus

1 − X

k⁰

u

k⁰

r

_k⁰

+ 1

u

k⁰

− 1 r

_k⁰

Y

j⁰

u

j⁰

− 1 r

_j⁰

,

in which the first factor equals 1 − X

k⁰

u

k⁰

r

_k⁰

+ 1 ≤ 0 by (3); the proof of (2) is therefore complete.

Having concluded our discussion of the sieve in a general context, we illustrate its relevance to more familiar situations in the theory of numbers by considering its application to problems of twin primes type. We shall therefore be involved with the production of upper and lower bounds for the number π

2

(x, ξ) of positive integers m not exceeding x which are to have the feature that m(m + 2) is to be indivisible by any prime p not exceeding a certain limit ξ, concentrating almost entirely on the lower bound because only a small amount of the reasoning used for this is needed for the other. The primes p not exceeding ξ now describe the unwanted properties, wherefore the integers m not exceeding x with properties appertaining to p

1

, . . . , p

s

are just those for which m(m + 2) is divisible by d = p

1

. . . p

s

and are thus N (x, d) in number when the previous notation N (ι) is appropriately adapted. Consequently,

(4) N (x, d) = xν(d)

d + O(ν(d)) = xν(d)

d + O(d

^ε

) ,

where ν(d) is the multiplicative function of square-free numbers d defined by ν(p) = 1 if p = 2 and ν(p) = 2 otherwise.

Considerable latitude in the apportionment of the properties to sub-fam-

ilies is permissible provided that it be made within a framework having cer-

(5)

tain prescribed features. Yet some care should be taken in the specification of the numerical parameters defining the structure lest we stray too far from the theoretical limits of the method. First, having written ξ = x

^1/u

, we let

(5) η = log log x ,

which function tends to infinity so slowly as x → ∞ that π

₂

(x, ξ) can be directly evaluated asymptotically by Legendre’s principle whenever ξ ≤ η.

In the contrary situation ξ > η to which we may now confine ourselves, we bring in parameters a, a

₁

that are selected here to be 3.99, 4, respectively, and use the former to define the sequence ξ

0

, . . . , ξ

R

, ξ

R+1

by

(6) ξ

_j

=

 



ξ

^1/a^j

for j ≤ R − 1, η for j = R, 1 for j = R + 1,

where R is the least exponent j for which ξ

^1/a^j

≤ η. For j = 1, . . . , R + 1 the family S

j

of unwanted properties is then to appertain to the primes p

j

satisfying ξ

_j

< p

_j

≤ ξ

_j−1

, a typical square-free product (possibly empty) of which is to be denoted by d

_j

. The restricting agent r

_j

on the number s

_j

of properties in this family S

j

—in other words, the number ω(d

j

) of distinct prime factors of d

_j

—to be used in (2) is determined by those of the equations (7) r

₁

= r

₂

= r

₃

= 10; r

_j

= 14(j − 3) (j ≥ 4)

that appertain to subscripts not exceeding R, while r

_R+1

is formally taken to be ∞ in reflection of the fact that s

_R+1

is only circumscribed by the cardinality of S

R+1

.

Since, in the current circumstances, formula (2) becomes

f (m) ≥ Y

1≤j≤R+1

X

dj|m(m+2) ω(d_j)≤r_j

µ(d

_j

) (8)

− X

1≤k≤R+1

X

d_k|m(m+2) ω(d_k)=r_k+1

1 Y

j6=k

X

dj|m(m+2) ω(dj)≤rj

µ(d

j

)

in the notation described above, we have π

₂

(x, ξ) ≥ X

ω(dj)≤rj(j=1,...,R+1)

µ(d

₁

) . . . µ(d

_R+1

)N (x, d

₁

. . . d

_R+1

)

− X

1≤k≤R+1

X

ω(dj)≤rj(j6=k) ω(dk)=rk+1

µ(d

₁

) . . . µ(d

_R+1

)

µ(d

k

) N (x, d

₁

. . . d

_R+1

)

(6)

by substitution in

π

₂

(x, ξ) = X

m≤x

f (m)

and a change in the orders of summation in the resulting multiple sums.

Hence, recalling our convention about ω(d

_R+1

) and letting θ

_d₁_,...,d_R

denote the condition that there be R−1 indices i from 1, . . . , R for which ω(d

_i

) ≤ r

_i

and that the remaining one satisfy ω(d

i

) ≤ r

i

+ 1, we infer from (4) that (9) π

₂

(x, ξ)

≥ x X

ω(d_j)≤r_j(j=1,...,R+1)

µ(d

₁

) . . . µ(d

_R+1

)ν(d

₁

. . . d

_R+1

) d

₁

. . . d

_R+1

− x X

1≤k≤R

X

d1,...,dR+1

ω(d_j)≤r_j(j6=k) ω(d_k)=r_k+1

µ(d

₁

) . . . µ(d

_R+1

)ν(d

₁

, . . . , d

_R+1

) µ(d

_k

)d

₁

. . . d

_R+1

+ O

X

d1,...,dR+1

θ_d1,...,dR

(d

1

. . . d

R+1

)

^ε

= x Y

1≤j≤R+1

X

ω(dj)≤rj

µ(d

_j

)ν(d

_j

) d

j

− x X

1≤k≤R

X

ω(dk)=rk+1

ν(d

_k

) d

k

Y

j6=k

X

ω(dj)≤rj

µ(d

_j

)ν(d

_j

) d

j

+ O

X

d1,...,dR+1

θ_d1,...,dR

(d

1

. . . d

R+1

)

^ε

= x

1 − X

1≤k≤R

X

ω(d_k)=r_k+1

ν(d

k

) d

_k

X

ω(d_k)≤r_k

µ(d

k

)ν(d

k

) d

_k

× Y

1≤j≤R+1

X

ω(dj)≤rj

µ(d

_j

)ν(d

_j

) d

j

+ O

X

d₁,...,d_R+1 θ_d1,...,dR

(d

₁

. . . d

_R+1

)

^ε

= x

1 − X

1≤j≤R

X

(j) 2

. X

(j) 1

Y

1≤j≤R+1

X

(j)

1

+O X

3

, say,

since we shall see that P

_(j)

1

is certainly non-zero as soon as we begin to

develop this initial inequality.

(7)

Our exploitation of (9) is speeded by using the following lemma, whose proof is not unconnected with the principles of Brun’s pure sieve but whose existence is usually ignored in the treatment thereof.

Lemma. Let σ

_r

denote the r-th elementary symmetric function of numbers a

1

, . . . , a

n

lying between 0 and 1. Then

X

0≤r≤s

(−1)

^r

σ

_r

− Y

1≤i≤n

(1 − a

_n

)

is non-negative or non-positive according as s is even or odd.

Immediately, we have

(10) X

(j)

1

≥ Y

ξ_j<p≤ξ_j−1

1 − ν(p) p

> 0 , thus vindicating the assertion made after (9). Indeed, since

Y

p≤y

1 − ν(p) p

∼ c

log

²

y

1 + O

1 log y

, we even have

X

_(j)

1

≥ log

²

ξ

j

log

²

ξ

j−1

1 + O

1 log ξ

_j

(11)

≥ 1

a

²

+ o(1) ≥ 1

a

²₁

(j = 1, . . . , R) by (6). Also, in like manner,

X

ξ_j<p≤ξ_j−1

1 p = log log ξ

j−1

− log log ξ

j

+ O

1 log ξ

j

≤ log a + o(1) ≤ log a

1

(j = 1, . . . , R) so that, by (9) and the elementary inequality N ! ≥ (N/e)

^N

,

X

_(j)

2

≤ 1

(r

j

+ 1)!

2 X

ξ_j<p≤ξ_j−1

1 p

_r_j₊₁

(12)

≤

2e log a

1

r

_j

+ 1

_r_j₊₁

≤

2e log a

1

t

_j

_t_j

(j = 1, . . . , R) under the sufficient condition

(13) 2e log a

₁

< t

_j

< r

_j

+ 1 .

The indices r

_j

were chosen in (7) so that favourable estimates for the sums P

_(j)

2

are consistent with a satisfactory value of P

3

when u is a small

(8)

constant. To attend to the former sums, we set t

j

= 2eα

j

log a

1

in conformity with (13), where α

_j

> 1 is a suitably close lower approximation to the number (r

_j

+ 1)/2e log a

₁

that can be verified, for example, by the use of mathematical tables. Then, considering the values of r

j

in (7), we can display the appropriate values of α

_j

and lower bounds β

_j

for 2eα

_j

log α

_j

in the following table:

j = 1, 2, 3 j ≥ 4

α

_j

= 1.455 (1.85)(j − 3) β

j

= 2.95 6(j − 3) . Therefore, by (11) and (12),

X

(j) 2

. X

(j) 1

< a

²₁

1 α

_j

_2eα_j_{log a}₁

= 1

a

^2eα₁ ^j^{log α}^j⁻²

< 1

a

^β₁^j⁻²

(j = 1, . . . , R) , whence

1 − X

1≤j≤R

X

(j) 2

. X

(j) 1

> 1 − X

1≤j≤R

1 a

^β₁^j⁻²

(14)

> 1 − 3 4

^19/20

−

X

∞ k=1

1 4

^4k

= 1 − 3

4

^19/20

− 1 255 > 3

20 by a relatively crude calculation.

An examination of the conditions of summation in P

3

shews that the size of any number d

₁

. . . d

_R+1

corresponding to a contributing set d

₁

, . . . , d

_R+1

is circumscribed by x

^γ

η

^η

, where

γ = 1 u

1 + X

1≤j≤R

r

_j

a

^j−1

< 1 u

1 + 10 + 10 a + 10

a

²

+ 14 a

³

X

∞ k=0

k + 1 a

^k

= 1 u

1 + 10a

a − 1 + 14 a(a − 1)

²

< 74 5u . Hence

(15) X

3

= O

x

log

³

x

whenever u ≥ 74/5.

(9)

The first result attained by the method follows from (9), (10), (14), and (15), which give

(16) π

2

(x, ξ) > 3x 20

Y

p≤ξ

1 − ν(p) p

= 3x 40

Y

2<p≤ξ

1 − 2

p

when ξ ≤ x

^5/74

. In particular, we have established the existence of infinitely many pairs of natural numbers differing by 2 neither of which has more than fourteen prime factors.

A slightly shorter account would have been possible if we had been merely content to derive an inequality of type (16) for values of ξ not exceeding a smaller limit of the type x

^A¹

. On the other hand, some further fine tun- ing of the procedures might have resulted in some minor improvements, which, however, are not worth seeking through this avenue because stronger and more complicated methods are available for this particular purpose.

In like but much simpler manner, the formula (1) yields the upper bound O(x/ log

²

x) for the number of prime twins not exceeding x.

The development adopted above was chosen to suit the needs of a theorem about pseudo-prime twins. Yet, if we look for an improvement in (16) that is suitable for large values of u, then we must vary the previous process by selecting the parameters in terms of u. This is done easily by setting b to be the least even integer exceeding u/2 and then redefining r

_j

and α

_j

as r

j

= bj and α

j

= (r

j

+ 1)/(2e log a

1

), it being easily verified that (15) continues to hold. Since an examination of the consequential change to (14) reveals that now

1 − X

1≤j≤R

X

(j) 2

. X

(j) 1

> 1 − e

^−A²^{u log u}

,

we obtain the lower bound that is latent in the formula π

2

(x, x

^1/u

) = x

1 + O(e

^{−Au log u}

) + O

1 log x

Y

p≤x^1/u

1 − ν(p) p

, the upper bound aspect of which is obtainable via (1). Our method thus produces fundamental lemmata, keener versions of which have been derived by Halberstam, Richert [3], and others by the more usual forms of Brun’s method.

References

[1] V. B r u n, ¨Uber das Goldbachsche Gesetz und die Anzahl der Primzahlpaare, Arch.

Math. Naturvidenskab 34 (1915), no. 8.

[2] —, Le crible d’Erathosthène et le théorème de Goldbach, Videnskapsselskapets Skrifter Mat.-nat. Kl. Kristiania 1920, no. 3.

(10)

[3] H. H a l b e r s t a m and H. E. R i c h e r t, Sieve Methods, Academic Press, 1975.

[4] G. H a r m a n, Sums of two squares in short intervals, Proc. London Math. Soc. (3) 62 (1991), 225–241.

[5] C. H o o l e y, Applications of Sieve Methods to the Theory of Numbers, Cambridge Univ. Press, 1976.

[6] —, On the intervals between numbers that are sums of two squares: IV , J. Reine Angew. Math., to appear.

SCHOOL OF MATHEMATICS

UNIVERSITY OF WALES COLLEGE OF CARDIFF CARDIFF, U.K.

Received on 26.1.1993

and in revised form on 17.11.1993 (2369)

On an almost pure sieve by