XC.3 (1999)
Numbers representable by five prime squares with primes in an arithmetic progression
by
Yonghui Wang (Beijing)
1. Introduction. A classical result in additive number theory is the five prime squares theorem proved by L. K. Hua: the diophantine equation (1.1) N = p
21+ p
22+ p
23+ p
24+ p
25is solvable for large odd N satisfying N ≡ 5 (mod 24).
This theorem can be regarded as a nonlinear extension of the Goldbach ternary theorem (Goldbach–Vinogradov Theorem), it also gives a deep in- sight into the Lagrange four square theorem. In this paper we study the equation (1.1) with prime variables in an arithmetic progression, i.e. the prime variables satisfy p
i≡ b
i(mod d), i = 1, . . . , 5, and b = (b
1, . . . , b
5) ∈ B(N, d), where
(1.2) B(N, d)
= {b ∈ N
5: 1 ≤ b
i≤ d, (b
i, d) = 1, b
21+ . . . + b
25≡ N (mod σ(d)d)}, with σ(d) = 1, 4, 2 for 2 - d, 2 k d and 4 | d respectively. We will use this notation in the rest of the paper.
Our main result is
Theorem. There exists an effective positive constant δ such that the diophantine equation
(1.3)
N = p
21+ p
22+ p
23+ p
24+ p
25, p
i≡ b
i(mod d), i = 1, . . . , 5,
with prime variables is solvable for all positive integers d ≤ N
δprovided N ≡ 5 (mod 24) is a large odd integer with B(N, d) nonempty.
It should be mentioned that this result implies the famous Linnik theo- rem on the least prime in an arithmetic progression.
1991 Mathematics Subject Classification: Primary 11P55, 11P32, 11L07, 11L05.
[217]
In the corresponding linear case, i.e. for the Goldbach ternary theorem with prime variables in an arithmetic progression, M. C. Liu and Tao Zhan [7] proved that there exists an effective positive constant δ > 0 such that, for all positive integers d ≤ N
δ, the diophantine equation
N = p
1+ p
2+ p
3,
p
i≡ b
j(mod d), j = 1, 2, 3, where (b
j, d) = 1, P
3j=1
b
j≡ N (mod d), is solvable for large odd N . Their result improved the work of Rademacher, Ayoub and Zulauf, the previous results holding only for a fixed positive integer d or d ≤ (log N )
C. Hua’s method with minor modifications actually gives that the equation (1.3) is solvable for large N with the set (1.2) nonempty and d log
AN , but it fails when we want to enlarge the scope of d to d ≤ N
δ, where δ is an absolute positive constant.
The difficulty lies in two respects. First, in the case of d ≤ N
δ, we cannot use the Siegel–Walfisz theorem as usual to estimate the major arcs. Second, the restriction to an arithmetic progression requires finding a way to deal with exponential sums and Gauss sums over an arithmetic progression.
The second difficulty was overcome by Jianya Liu and Tao Zhan in [4, Lemma 2]. By using multiplicativity ingeniously, they transform the expo- nential sum over an arithmetic progression to the usual exponential sum with Dirichlet characters and Gauss sums. The starting point of this pa- per is a similar result for the quadratic case, i.e. Lemma 3.2. But we use a different method to deal with the first difficulty.
In 1975, Montgomery and Vaughan [8] diminished the exceptional set of the Goldbach problem from O(x log
−Ax) to O(x
1−δ). Their difficulty also was wider major arcs and the fact that the Siegel–Walfisz theorem could not be used. They solved it by using the Deuring–Heilbronn phenomenon and Gallagher’s theorem. But if we use their method for our quadratic problem, we will face too many cases and need to do lots of calculation. This is also the reason why we do not apply the method of M. C. Liu and Tao Zhan [7]. Hence we shall apply a modification of the method by Liu and Tsang [6]. The point is that we only need to estimate the singular series and singular integral separately and only once. But as we are concerned with the quadratic case and the restriction to an arithmetic progression, we have to work harder from the beginning to estimate the complicated singular series (Lemma 4.8).
2. Notations and the minor arcs. Define
(2.1) d := N
δ, Q := N
21δ, T := N
√δ, L := N/50, τ := N
−1T
1/4,
where δ is a small computable positive constant. Then Q < T < L < N. We
write
(2.2) L
2= L
1/2and N
2= N
1/2.
In the following, ε > 0 is a comparable very small constant, and the implied constants in the symbols O and are computable, positive and depend at most on δ, ε. We write e(y) for e
2πiyand e
q(y) for e(y/q).
For any a, q such that (a, q) = 1, 1 ≤ a ≤ q ≤ Q, let m(a, q) :=
a − τ q , a + τ
q
.
We can easily see that these intervals are mutually disjoint and all lie in [τ, 1 + τ ]. We call the union of these m(a, q) the major arcs M and [τ, 1 + τ ] \ M the minor arcs M
0.
Define
S
i(α) := S(α, d, b
i) := X
n≤N2
n≡bi(mod d)
Λ(n)e(αn
2), (2.3)
R(N ) :=
1+τ
\
τ
Y
5 i=1S
i(α)e(−N α) dα.
(2.4)
Then the Theorem holds if R(N ) > 0. By interval dissection we get (2.5) R(N ) =
n \
M
+ \
M0
o Y
5i=1
S
i(α)e(−N α) dα =: R
1(N ) + R
2(N ), say. The integral over the minor arcs contributes the error term R
2(N ). We now estimate it by the following lemma from [11].
Lemma 2.1. For α satisfying |α − a/q| ≤ 1/q
2, (a, q) = 1 and h = (q, d), we have
(2.6) S
i(α) N
21+εh
dq
1/2+ N
211/14d
1/2+ N
26/7h
3/4d
3/4q
1/4+
q
1/2h
1/2+ q
1/4h
1/4N
21/7N
ε. For any α ∈ M
0, by Dirichlet’s lemma we see that there exist q, a satis- fying Q ≤ q ≤ τ
−1, (a, q) = 1, 1 ≤ a ≤ q such that |α − a/q| ≤ τ /q ≤ 1/q
2. Notice that d = N
δ, Q = N
21δand δ is sufficiently small. We have
Corollary 2.2. For any α ∈ M
0, we have
S
i(α) N
1/2Q
−1/2provided N ≥ N
0(δ).
Lemma 2.3. We have
R
2(N ) N
3/2Q
−1/2d
−2log
4+cN whenever N ≥ N
1(δ, c).
P r o o f. By Corollary 2.2 for S
5(α), we have R
2(N ) N
1/2Q
−1/21+τ
\
τ
Y
4 i=1S
i(α)
dα N
1/2Q
−1/2X
4 i=11+τ
\
τ
|S
i(α)|
4dα.
Clearly, the inner integral is equal to X
nj≤N2
n21+n22=n23+n24 nj≡bi(mod d)
Λ(n
1)Λ(n
2)Λ(n
3)Λ(n
4) ≤ log
4N X
nj≤N2
n21+n22=n23+n24 nj≡bi(mod d)
1
= log
4N
1
\
0
X
n≤N2 n≡bi(mod d)
e(αn)
4dα.
Following the arguments in Hua’s lemma [9], it is seen that there is a constant c such that
1
\
0
X
n≤N2
n≡bi(mod d)
e(αn)
4dα N
d
2log
cN, whence our lemma follows.
From (2.5), to obtain our Theorem it remains to find a lower bound for R
1(N ) such that R
1(N ) > |R
2(N )|.
3. Notations and the major arcs. We shall use χ (mod q) and χ
0(mod q) to denote a Dirichlet character and the principal character modulo q respectively. It is known ([1], Chapter 14) that there exists a small c
1such that there is at most one primitive character e χ to a modulus e r ≤ T for which the corresponding L-function L(s, e χ) has a zero in the region σ >
1 − c
1(log T )
−1, |t| ≤ T ; and if there is such an exceptional character, it is quadratic and the corresponding zero e β, called the exceptional zero, is real, simple and unique. Furthermore we have
(3.1) c
2(e r
1/2log
2r) e
−1≤ 1 − e β ≤ c
1/log T.
We write P
q0a=1
or P
(a,q)=1
for a sum over integers a satisfying 1 ≤ a ≤ q
and (a, q) = 1. For any character χ (mod dq/h), h = (d, q) define
(3.2)
S(χ, y) := S(χ, y, d, q) := X
L2≤n≤N2
χ(n)Λ(n)e(n
2y),
I(y) :=
N
\
2L2
e(x
2y) dx,
I(y) := e
N
\
2L2
x
β−1ee(x
2y) dx,
I(χ, y) := X
0 γ≤TN
\
2L2
x
%−1e(x
2y) dx,
where Λ(n) is the von Mangoldt function and P
0γ≤T
denotes the summation over all zeros % = β + iγ of the function L(s, χ) lying in the region 1/2 ≤ β ≤ 1 − c
1(log T )
−1, |γ| ≤ T (hence excluding e β, if it exists). By ([1], p. 120) we can easily deduce that
Lemma 3.1. For any real y and any χ (mod dq/h) with dq/h ≤ T, we have
(3.3) S(χ, y) = δ
0χI(y) − δ
χI(y) − I(χ, y) + O((1 + |y|N )N e
2T
−1log
2N ), where
δ
χ0=
1 if χ = χ
0(mod dq/h), 0 otherwise;
δ
χ=
1 if χ = e χχ
0(mod dq/h), 0 otherwise.
We next transform the exponential sum S
i(α) into character sums or integrals of the above forms. To do this we need some more notations.
For positive integers d, q and h(q) := (d, q), i.e. the largest common divisor of d, q, define positive integers α
i, β
i, γ
iaccording to
d = p
α11. . . p
αssd
0, q = p
β11. . . p
βssq
0, (d
0, q
0) = 1, (3.4)
h(q) = p
γ11. . . p
γss, (3.5)
hence γ
i= min(α
i, β
i), i = 1, . . . , s. Define h
1(q) := p
δ11. . . p
δss, δ
i=
α
iif β
i> α
i, 0 otherwise, (3.6)
h
2(q) := h(q)/h
1(q).
(3.7)
For brevity, we write h = h(q), h
1= h
1(q) and h
2= h
2(q). It is easily seen
that (h
1, h
2) = 1 and (d/h
1, q/h
2) = 1.
Lemma 3.2. For α = a/q + λ, we have (3.8) S
i(α) = ϕ
−1(d/h
1)ϕ
−1(q/h
2)
× X
η (mod d/h1)
ζ(b
i) X
η (mod q/h2)
G
i(a, η, q)S(ζη, λ) + O(log
2N ), where
(3.9) G
i(a, η, q) := G(h, b
i, a, η, q) := X
(c,q)=1 c≡bi(mod h)
e(ac
2/q)η(c),
and η, ζ are characters modulo q/h
2and d/h
1respectively.
P r o o f. We have S
i(α) = X
n≤N2
n≡bi(mod d)
Λ(n)e(αn
2)
= X
n≤N2 n≡bi(mod d)
(n,q)=1
Λ(n)e(αn
2) + O X
pk≤N2
p|q
log pe(p
2kα)
= X
(c,q)=1
e
ac
2q
X
n≤N2
n≡bi(mod d) n≡c (mod q)
Λ(n)e(n
2λ) + O
log N X
p|q
log p
= X
(c,q)=1 c≡bi(mod h)
e
ac
2q
X
n≤N2 n≡bi(mod d)
n≡c (mod q)
Λ(n)e(n
2λ) + O(log
2N ).
The inner sum of the main term is empty unless c ≡ b
i(mod h), we can therefore add the restriction c ≡ b
i(mod h) to the sum over c. On the other hand, under the condition c ≡ b
i(mod h), the congruences
n ≡ b
i(mod d), n ≡ c (mod q) are equivalent to
n ≡ b
i(mod d/h
1), n ≡ c (mod q/h
2).
Then
S
i(α) = X
(c,q)=1 c≡bi(mod h)
e
ac
2q
X
n≤N2 n≡bi(mod d/h1)
n≡c (mod q/h2)
Λ(n)e(n
2λ) + O(log
2N )
= ϕ
−1(d/h
1)ϕ
−1(q/h
2) X
(c,q)=1 c≡bi(mod h)
e
ac
2q
× X
ζ (mod d/h1)
ζ(b
i) X
η (mod q/h2)
η(c)
× X
n≤N2
ζη(n)Λ(n)e(n
2λ) + O(log
2N ).
Hence we get the assertion.
By using the above lemmas, we now simplify R
1(N ) as follows.
For any α = a/q + λ ∈ m(a, q), we have |λ| < τ /q and q ≤ Q. By Lemmas 3.1 and 3.2,
S
i(α) = ϕ
−1(d/h
1)ϕ
−1(q/h
2)
G
i(a, η
0, q)I(λ) − δ
qζζ e
0(b
i)G
i(a, e ηη
0, q)e I(λ)
− X
ζ (mod d/h1) η (mod q/h2)
ζ(b
i)G
i(a, η, q)I(ζη, λ)
+ O
ϕ
−1(q/h
2) X
η (mod q/h2)
|G
i(a, η, q)|(1 + |λ|N )N
1/2T
−1log
2N
+ O(log
2N ),
where e ζζ
0(mod d/h
1) e ηη
0(mod q/h
2) = e χχ
0(mod dq/h), e ζ, e η are primi- tive characters, and
(3.10) δ
q:=
n 1 if e χ (mod e r) exists and e r | dq/h, 0 otherwise.
Since |λ| τ /q and |λ|N < T
1/4q
−1, the trivial bound X
η (mod q/h2)
|G
i(a, η, q)| ϕ(q/h
2)ϕ(q) shows that the first O-term above is
ϕ(q)T
1/4q
−1N
2T
−1log
2N ≤ N
2T
−3/4log
2N.
Hence, for α = a/q + λ ∈ m(a, q) we obtain
(3.11) S
i(α) = ϕ
−1(d/h
1)ϕ
−1(q/h
2)H
i(a, q, λ) + O(N
2T
−3/4log
2N ) where
H
i(a, q, λ) := G
i(a, η
0, q)I(λ) − δ
qζζ e
0(b)G
i(a, e ηη
0, q)e I(λ) (3.12)
− F
i(a, q, λ),
F (a, q, λ) := X
ζ (mod d/h1) η (mod q/h2)
ζ(b
i)G
i(a, η, q)I(ζη, λ).
(3.13)
To estimate H
i(a, q, λ), we need the following lemma which can be de- duced similarly to Lemma 3.3 of [5].
Lemma 3.3. Let I(λ), e I(λ) and I(χ, λ) be defined as in (3.2).
(a) For any real y, we have
I(y) min(N
2, L
−12|y|
−1), I(y) min(N e
2βe, L
β−12e|y|
−1) and
I(χ, y)
N
2for any real y, N
2(L|y|)
−1/2for |y| > L
−1, L
2(L|y|)
−1for |y| > T /(πL).
(b) We have
∞
\
−∞
|I(y)|
2dy N
2L
−12,
∞
\
−∞
|e I(y)|
2dy N
2βeL
β−22eand
∞
\
−∞
|I(χ, y)|
2dy N L
−1log N.
By the trivial estimates for I(λ), e I(λ) and I(χ, λ) in Lemma 3.3(a), ϕ
−1(d/h
1)ϕ
−1(q/h
2)H
i(a, q, λ) ϕ(q)N
2.
Then
R
1(N ) = X
q≤Q
ϕ
−5(d/h
1)ϕ
−5(q/h
2)
× X
(a,q)=1 τ /q
\
−τ /q
e(−N (a/q + λ)) Y
5 i=1H
i(a, q, λ) dλ
+ O X
q≤Q
X
(a,q)=1
τ q
X
4 k=0(ϕ(q)N
2)
k(N
2T
−3/4log
2N )
5−k= X
q≤Q
ϕ
−5(d/h
1)ϕ
−5(q/h
2) X
(a,q)=1
e
q(−N a)
×
τ /q
\
−τ /q
e(−N λ) Y
5 i=1H
i(a, q, λ) dλ + O(N
3/2T
−1/2Q
5).
The product Q
5k=1
H
k(a, q, λ) is a sum of at most (φ(dq/h)+2)
5terms, each
of the form Q
5i=1
E
iwhere E
iis either G
i(q)I(λ), −δ
qζ(b e
i)G(a, e ηη
0, q)e I(λ) or −ζ(b
i)G(a, η, q)I(ζη, λ).
By comparing the estimates for I(λ), e I(λ) and I(χ, λ) in Lem- ma 3.3(a) with |λ| > τ /q > L
−1, it is easily seen that the weakest one among them is N
1/2(L|λ|)
−1/2, since τ = N
−1T
1/4, L = N/25 and T = Q
1/√δ. Then
\
R\[−τ /q,τ /q]
Y
5 i=1E
idλ φ
3(q)|τ /q|
−3/2∞
\
−∞
|E
1E
2| dλ
φ
5(q)|τ /q|
−3/2, by Cauchy’s inequality and Lemma 3.3(b). Hence
X
q≤Q
φ
−5d h
1φ
−5q h
2X
(a,q)=1
\
R\[−τ /q,τ /q]
e(−N λ) Y
5 i=1H
i(a, b, λ) dλ
X
q≤Q
φ
−5d h
1φ
−5q h
2φ(q)
φ
dq h
+ 2
5φ
5(q)q
3/2N
3/2T
−3/8Q
9N
3/2T
−3/8N
3/2Q
−1. Therefore
R
1(N ) = X
q≤Q
φ
−5(d/h
1)φ
−5(q/h
2) X
(a,q)=1
e
q(−N a) (3.14)
×
∞
\
−∞
e(−N λ) Y
5 i=1H
i(a, q, λ) dλ + O(N
3/2Q
−1).
4. Some lemmas for singular series and singular integrals Lemma 4.1. Let χ (mod p
β/h
2) be any character with β ≥ 0, let h
2= h
2(p
β) be defined as in (3.7), and let α be such that p
αk d. We have
(a) G
i(a, χ, p
β) = 0 if χ (mod p
β) is primitive and p | a, β > α.
(b) G
i(a, χη
0, p
t) = 0 if η
0is modulo p
t/h
2(p
t), p - a and t ≥ θ + max{θ, α, β} where θ = 1 + [1/p].
(c) G
i(a, χ, p
β) ≤ 2(2, p)p
β/2if p - a.
P r o o f. (a) Let a
0= a/p. For 1 ≤ c ≤ p
β, write c = u + vp
β−1. Since
β > α, h(p
β) = p
αand the restriction c ≡ b
i(mod p
α) is equivalent to
u ≡ b
i(mod p
α). Hence
G
i(a, χ, p
β) =
pβ
X
c≡bic=1(mod pα)
e
a
0c
2p
β−1χ(c)
=
p
X
β−1u≡biu=1(mod pα)
e
a
0u
2p
β−1X
p v=1χ(u + vp
β−1) = 0.
For a primitive character χ (mod p
β), the inner sum over v is zero.
(b) For t > α, we see from (3.7) that h(p
t) = p
α, h
2(p
t) = 1, so η
0is modulo p
t.
For 1 ≤ c ≤ p
t, write c = u + vp
t−θ. The restriction c ≡ b
i(mod p
α) is equivalent to u ≡ b
i(mod p
α), since t ≥ θ + max{θ, α, β}. Moreover, we have c
2≡ u
2+ 2uvp
t−θ(mod p
t), c ≡ u (mod p
β). Hence
G(a, χ, p
t) =
p
X
t−θu≡biu=1(mod pα)
χη
0(u)e
au
2p
tX
pθ v=1e
2auv p
θ.
For each u coprime with p, in view of p - a, the inner sum over v is zero.
Hence (b) is proved.
(c) For β ≤ α, |G(a, χ, p
β)| ≤ 1. For β > α, we have G(a, χ, p
β) =
pβ
X
c≡bic=1(mod pα)
e
ac
2p
βχ
pβ(c)
= ϕ
−1(p
α) X
χpα(mod pα)
χ
pα(b
i)
pβ
X
c=1
e
ac
2p
βχ
pαχ
pβ(c).
By Exercise 14 in Chapter 6 of [10], the inner sum over c is less than 2(2, p)p
β/2for p - a. Hence we get (c).
We shall use the following sums to form the singular series:
Z(q) := Z(q; η
1, . . . , η
5) := X
(a,q)=1
e
q(−N a) Y
5 i=1G
i(a, η
i, q), (4.1)
Y (q) := Y (q; η
1, . . . , η
5) :=
X
q a=1e
q(−N a) Y
5 i=1G
i(a, η
i, q), (4.2)
where η
iis modulo q/h
2(q). We can also write (4.3) Y (q; η
1, . . . , η
5) = q X
(q)
η
1(c
1) . . . η
5(c
5),
where P
(q)
denotes the sum over c
1, . . . , c
5satisfying
(4.4)
1 ≤ c
1, . . . , c
5≤ q, c
i≡ b
i(mod (d, q)), (c
i, q) = 1,
X
5 i=1c
2i≡ N (mod q).
Denote by N (q) the number of solutions of the above congruence equation.
By Hua’s work on Tarry’s problem [3, p. 162] and M. C. Liu, K. M. Tsang [6, (1.5)], we see that if N ≡ 5 (mod 24) and N satisfies (1.2) then N (q) ≥ 1 for all q. In the case where η
iare all principal characters, we see that (4.5) Y (q; η
0, . . . , η
0) = qN (q).
Furthermore we put
(4.6) A(q) := ϕ
−5(q(d, q)
∞/h)Z(q; η
0, . . . , η
0),
where (d, q)
∞has the same prime factors of (d, q), and (d, q)
∞k d which means that if p
αk (d, q)
∞then p
αk d.
Lemma 4.2. Both Z(q) and Y (q) are multiplicative in the sense that if q = q
1. . . q
twith (q
i, q
j) = 1 for i 6= j, then for each i = 1, . . . , 5, the decompositions η
i(mod q/h
2(q)) = Q
tj=1
η
ij(mod q
j/h
2(q
j)) are unique.
We have
Z(q; η
1, . . . , η
5) = Y
t j=1Z(q
j; η
1j, . . . , η
5j) and
Y (q; η
1, . . . , η
5) = Y
t j=1Y (q
j; η
1j, . . . , η
5j).
In particular , N (q) and A(q) are multiplicative functions of q.
P r o o f. It suffices to consider the case t = 2, and then use induction.
Let q = q
1q
2with (q
1, q
2) = 1. It is easily seen from (3.7) that
h(q) = h(q
1)h(q
2), h
i(q) = h
i(q
1)h
i(q
2), (h
i(q
1), h
i(q
2)) = 1, q
h
i(q) = q
1h
i(q
1) · q
2h
i(q
2) ,
q
1h
i(q
1) , q
2h
i(q
2)
= 1, i = 1, 2.
Then η
i(mod q/h
2(q)) = Q
2j=1
η
ij(mod q
j/h
2(q
j)) are uniquely deter- mined.
Let a = a
1q
2+ a
2q
1. If a
1, a
2run over reduced residue systems modulo q
1, q
2respectively, then a will run over a similar system modulo q
1q
2. So (4.7) Z(q
1q
2) = X
(a,q)=1
e
q(−N a) Y
5 i=1G
i(a, η
i, q
1q
2)
= X
(a1,q1)=1
e
q1(−N a
1) X
(a2,q2)=1
e
q2(−N a
2)
× Y
5 i=1G
i(a
1q
2+ a
2q
1, η
i, q
1q
2).
In the same way, let c = c
1q
2+ c
2q
1. We see that the restriction c ≡ b
i(mod h(q)) is equivalent to c
1q
2≡ b
i(mod h(q
1)) and c
2q
1≡ b
i(mod h(q
2)), so we have
G
i(a
1q
2+ a
2q
1, η
i, q
1q
2)
=
q1q2
X
c≡bi(mod h(q))c=1
e
(a
1q
2+ a
2q
1)c
2q
1q
2η(c)
= X
(c1q2,q1)=1 c1q2≡bi(mod h(q1))
X
(c2q1,q2)=1 c2q1≡bi(mod h(q2))
e
a
1(c
1q
2)
2q
1η
i1(c
1q
2)
× e
a
2(c
2q
1)
2q
2η
i2(c
2q
1)
= G
i(a
1, η
i1, q
1)G
i(a
2, η
i2, q
2).
Hence the right hand side of (4.7) is equal to Z(q
1)Z(q
2) as desired. The proof for the multiplicativity of Y (q) is similar.
Lemma 4.3. For any positive integer q, we have ϕ
−5dq h(q)
Z(q) h
5(q)
d
5q
−3/2L.
P r o o f. Let q = Q
p|q
p
βpbe the prime factorization of q. By Lemma 4.1(c) and multiplicativity of Z(q), we have
|Z(q)| = Y
p|q
|Z(p
βp)| Y
p|q
p
βpY
5 i=1p
βp/2q
7/2.
Hence by ϕ(q) q log log q, we get the assertion.
Lemma 4.4. For i = 1, . . . , 5, let χ
i(mod p
βi) be primitive characters and β = max{β
1, . . . , β
5}. Define α := α(p) such that p
α(p)k d. For brevity, write Z(p
t) = Z(p
t; χ
1χ
0, . . . , χ
5χ
0), where χ
0is modulo p
t. Then
(a) Z(p
β) = Y (p
β) if β > α.
(b) Z(p
t) = 0 if t ≥ θ + max{θ, β, α}, where θ = 1 + [2/p].
(c) P
tv=β
ϕ
−5(p
v)Z(p
v) = ϕ
−5(p
t)Y (p
t) for β>α or P
αv=0
ϕ
−5(p
α)Z(p
v) + P
tv=α+1
ϕ
−5(p
v)Z(p
v) = ϕ
−5(p
t)Y (p
t) for β = 0 and t > α.
P r o o f. (a) Lemma 4.1(a) asserts that G
i(a, χ, p
β) = 0 for p - a and β > α. By comparing (4.1) and (4.2), the sum over a is empty if p | a, hence (a) is proved.
(b) This follows readily from (4.1) and Lemma 4.1(b).
(c) The sum for Z(p
t) can be written as P
pva=1
− P
pva=1, p|a
. The first sum is Y (p
α). By setting a = pa
0, we see that the second sum is equal to p
5Y (p
v−1) when v ≥ max(β + 1, α + 1, 2). Thus we have
ϕ
−5(p
v)Z(p
v) = ϕ
−5(p
v)Y (p
v) − ϕ
−5(p
v−1)Y (p
v−1).
For β > α, summing both sides for v = β + 1, . . . , t and using (a), we get the first equality. In order to consider the condition β = 0, t > α, we still need to sum over those t ≤ α and prove that this yields ϕ
−5(p
α)Y (p
α). For t ≤ α, h(p
t) = p
t, h
2(p
t) = p
twe have
Z(p
t) = X
(a,pt)=1
e
−N a p
tY
5i=1
X
(ci,pt)=1 ci≡bi(mod pt)
e
ac
2ip
t= X
(a,pt)=1
e
a( P
5i=1
b
2i− N ) p
t= ϕ(p
t).
Since, by (1.2), P
b
2i≡ N (mod d), we have (4.8)
X
α v=0ϕ
−5(p
α)Z(p
v) = ϕ
−5(p
α)p
α, which is equal to ϕ
−5(p
α)Y (p
α). Hence we get (c).
Taking χ
1= . . . = χ
5= χ
0and β = 0 in Lemma 4.4, we obtain
Corollary 4.5. Let A(q), N (q), α = α(p) be defined as in (4.6), (4.5), and Lemma 4.4 respectively. Then
(a) A(p
t) = 0 for p ≥ 3, t ≥ 1 + α, and A(2
t) = 0 for t ≥ 2 + max{2, α}.
(b) p
tϕ
−5(p
t)N (p
t) = p
αϕ
−5(p
α)N (p
α) for p ≥ 3, t ≥ α.
(c) 2
tϕ
−5(2
t)N (2
t) = 2
α0ϕ
−5(2
α0)N (2
α0) for t ≥ α
0, α
0= 1+max{2, α}.
In view of the above corollary, we now define
s(p) := X
0≤t<θ+max{θ,α(p)}
A(p
t) (4.9)
= ϕ
−5(σ(p
α(p))p
α(p))N (σ(p
α(p))p
α(p))σ(p
α(p))p
α(p),
where σ(q) is defined in (1.2). We now simplify s(p).
Lemma 4.6. (a) s(p) = ϕ
−5(p
α)p
αif p 6= 2, α = α(p) ≥ 1.
(b) s(2) =
2
3if α(2) = 1,
ϕ
−5(2
α(2))2
α(2)+1if α(2) ≥ 2, hence we can also write s(2) = ϕ
−5(2
α)2
ασ(d).
(c) s(p) = 1 + A(p) if p 6= 2, p - d, s(2) = 1 + A(2) + A(2
2) + A(2
3) if 2 - d.
P r o o f. (a) Use a similar argument to Lemma 4.4(c), (4.8).
(b) By a similar argument, we have A(2
t) = ϕ
−5(2
α)ϕ(2
t) for t ≤ α.
If α = 1, it remains to consider t = 2, 3. Since P
(a,pt)=1
= P
pta=1
− P
pta=1, p|a
, we have
A(2
t) = ϕ
−5(2
t) X
(a,2t)=1
e
−N a 2
tY
5i=1
X
2tci=1 ci≡bi(mod 2)
e
ac
2i2
t= ϕ
−5(2
t)
X
2ta=1
2t
X
ci=1 ci≡bi(mod 2)
e
a( P
5i=1
c
2i− N ) 2
t−
2
X
t−1a0=1 2t
X
ci=1 ci≡bi(mod 2)
e
a
0( P
5i=1
c
2i− N ) 2
t−1= ϕ
−5(2
t)(2
tN (2
t) − 2
t−12
5N (2
t−1))
= ϕ
−5(2
t)2
tN (2
t) − ϕ
−5(2
t−1)2
t−1N (2
t−1)
where N (2
t) is the number of solutions of the following congruence equation:
1 ≤ c
i≤ 2
t, (c
i, 2) = 1,
c
21+ . . . + c
25≡ N (mod 2
t), c
i≡ b
i(mod 2).
By an easy calculation, we see that N (2
3) = 2
10, N (2) = 1. Then s(2) = 1 + A(2) + A(2
2) + A(2
3)
= 1 + 1 − ϕ
−5(2)2N (2) + ϕ
−5(2
3)2
3N (2
3) = 2
3. If α > 1, it remains to consider t = α + 1. We have
A(2
α+1) = ϕ
−5(2
α+1) X
(a,2α+1)=1
e
−N a 2
α+1Y
5i=1
2X
α+1ci=1 ci≡bi(mod 2α)
e
ac
2i2
α+1= ϕ
−5(2
α+1)
2X
α+1a=1
2
X
α+1ci=1 ci≡bi(mod 2α)
e
a( P
5i=1
c
2i− N ) 2
α+1−
2α
X
a0=1
e
−N a
02
αY
5i=1
2X
α+1ci=1 ci≡bi(mod 2α)
e
a
0c
2i2
α= ϕ
−5(2
α+1)2
α+1N (2
α+1) − ϕ
−5(2
α)2
α,
where N (2
α+1) is the number of solutions of the following congruence equa- tion:
1 ≤ c
i≤ 2
α+1, (c
i, 2) = 1,
c
21+ . . . + c
25≡ N (mod 2
α+1), c
i≡ b
i(mod 2
α).
Obviously N (2
α+1) = 2
5. Then
s(2) = ϕ
−5(2
α) + A(2) + . . . + A(2
α) + A(2
α+1)
= ϕ
−5(2
α+1)2
α+1N (2
α+1) = ϕ
−5(2
α)2
α+1. Part (c) follows immediately from (4.9) and Corollary 4.5.
Lemma 4.7. We have:
(a) A(p) < 30p
−2for all p - d.
(b) Q
p
s(p) converges absolutely and Q
p
s(p) ϕ
−5(d)dσ(d).
X
∞ q=1 (q,r)=1ϕ
−5(dq/h)Z(q; η
0, . . . , η
0) = Y
p-r
s(p) (c)
= σ(d/(d, r))d/(d, r) ϕ
−5(d/(d, r))
Y
p-r p-d
s(p).
(d) X
q≥y
ϕ
−5(dq/h)Z(q; η
0, . . . , η
0) y
−1d
−3log
30(y + 1).
P r o o f. (a) Since p - d, we have h(p) = 1; let g be a quadratic nonresidue modulo p. Then
A(p) = ϕ
−5(p)
p−1
X
a=1
e
−N a p
Y
5 i=1 p−1X
ci=1
e
ac
2ip
= 1
2 ϕ
−5(p)
p−1
X
a=1
e
−N a
2p
Y
5i=1
C
p(a
2) + e
−N ga
2p
Y
5i=1
C
p(ga
2)
, where C
p(a) = P
p−1c=1
e(ac
2/p). It is well known [1] that C
p(1) = λ − 1, λ =
√ p if p ≡ 1 (mod 4), i √
p if p ≡ −1 (mod 4).
Furthermore C
p(1) + C
p(g) = 2 P
p−1c=1
e
p(c) = −2, hence C
p(g) = −λ − 1.
Plainly
C
p(a) =
C
p(1) if a is a quadratic residue modulo p, C
p(g) otherwise.
We get
A(p) =
12ϕ
−5(p)
×
(p − 1)((λ − 1)
5+ (−λ − 1)
5) if p | N,
(λ − 1)
6+ (−λ − 1)
6if p - N and
Np= 1, (−λ − 1)(λ − 1)
5+ (λ − 1)(−λ − 1)
5if p - N and
Np= −1,
N p
being the Legendre symbol. Hence |A(p)| < 30p
−2for p - d. The 30 comes from explicit computation, but it is unimportant for our application.
(b) By Lemmas 4.6 and 4.7(a), we have Y
p
s(p) = Y
p|d
s(p) Y
p-d
(1 + A(p))
ϕ
−5(2
α(2))σ(d)2
α(2)Y
pα(p)kd p6=2
ϕ
−5(p
α(p))p
α(p)Y
p-d
(1 − 30p
−2)
σ(d)ϕ
−5(d)d.
The proof of the convergence is similar.
(c) Let q = q
0q
00, (q
0, q
00) = 1 and q
0| d
∞, (q
00, d) = 1. By the multiplica- tivity of Z(q), we have
X
∞ q=1 (q,r)=1ϕ
−5dq h
Z(q)
=
X
∞q0=1 (q0,r)=1
q0|d∞
ϕ
−5dq
0h
Z(q
0)
X
∞q00=1 (q00,r)=1 (q00,d)=1
ϕ
−5(q
00)Z(q
00)
.
Hence by Corollary 4.5, (4.9) and Lemma 4.6, we obtain the result.
(d) Let δ = (log(y + 1))
−1. We have X
q≥y
ϕ
−5(dq/h)Z(q) ≤ X
q≥y
|A(q)| ≤ y
−1+δX
∞ q=1q
1−δ|A(q)|
y
−1Y
p|d
θ+max{θ,α(p)}X
t=0
p
t(1−δ)|A(p
t)| Y
p-d
(1 + 30p
−1−δ)
y
−1d ϕ
5(d)
Y
p|d
p
α(p)Y
p