XCIV.2 (2000)
Kloosterman sums and primitive elements in Galois fields
by
Stephen D. Cohen (Glasgow)
1. Introduction. Let F
qdenote the finite (Galois) field of order q, a power of a prime p. The multiplicative group F
∗qof F
qis cyclic of order q −1:
a generator is known as a primitive element of F
q. Hence F
qcontains φ(q−1) primitive elements, where φ is Euler’s function. Generally, primitivity is a fragile property that may be destroyed when the element in question is modified through multiplication or addition. Nevertheless, if ξ is a primitive element, then so is 1/ξ.
When q = 2, H. Niederreiter [Ni] has expressed the number of irreducible polynomials of degree n (≥ 3) over the binary field F
2, having the coefficients of x
n−1and x both equal to 1, as a formula involving Kloosterman sums over F
2n. Thereby, this number is shown to be positive, except when n = 3.
An alternative formulation of this conclusion is that, except when n = 3, F
2ncontains an element ξ such that F
2n= F
2(ξ), and both ξ and 1/ξ have (F
2n, F
2)-trace equal to 1. In this paper we consider extensions F
qnof a general finite field F
q. The aim is to show that Kloosterman sums are adequate for the stiffer task of generalising the above result (when n ≥ 5) to yield the existence of a primitive element ξ of F
qnsuch that T
n(ξ) = a and T
n(1/ξ) = b, where a and b are (arbitrary) given elements of F
qand T
n(ξ) := ξ + ξ
q+ . . . + ξ
qn−1denotes the (F
qn, F
q)-trace of ξ. The result to be proved is as follows.
Theorem 1.1. Let q be a prime power and n (≥ 5) be an integer. Suppose that arbitrary elements a and b of F
qare given. Then there exists a primitive element ξ of F
qnsuch that T
n(ξ) = a and T
n(1/ξ) = b, except when a = b = 0 and (q, n) = (4, 5), (2, 6), or (3, 6).
Theorem 1.1 is consistent with the pattern that, as n increases, one can expect to guarantee the existence of a primitive element satisfying additional
2000 Mathematics Subject Classification: 11L05, 11T24, 11T30.
[173]
constraints. Let it be stressed that what are sought are complete results listing all exceptions. For example, prior to Theorem 1.1 is the theorem of Cohen [Co1] (see also [JuVa]) that, given a ∈ F
qand n ≥ 2 (n ≥ 3, if a = 0), there exists a primitive element ξ of F
qnwith T
n(ξ) = a, except when n = 3, q = 4, and a = 0. Another stage in the scheme is described later in the introduction.
The proof of Theorem 1.1 derives from careful estimates in respect of expressions that combine Kloosterman sums over F
qnand over F
q. Next, some properties of Kloosterman sums (and Gauss sums) will be developed.
For example, whereas the absolute value of a Kloosterman sum over F
qis bounded by 2 √
q, on average, it is less than √
q (Corollary 3.2). By means of a sieving process, the proof is completed theoretically, without direct verification, except for a few small values of q (≤ 16) and n = 5 or 6, plus q = 2 when n = 8 (when a and b are not both zero). In fact, when a and b are both zero, then Theorem 1.1 follows from the work of W.-S. Chou and the author [ChCo] and is “best possible” in the sense that it fails for all pairs (q, n) with n < 5. Otherwise, the method will succeed, in principle, also when n = 4. We defer the study of this case to a further paper, because of the difficulty of identifying efficiently those values of q for which direct verification is required. The question addressed is sensible even when n = 3, but the method fails, and it may be difficult to resolve that case.
For n ≥ 2, the associated (irreducible) minimal polynomial (of degree n) over F
qof a primitive element ξ of F
qis itself referred to as primitive. Since the (F
qn, F
q)-norm of such an element ξ is necessarily a primitive element of F
q, and so, when q = 2 or 3, is uniquely determined, we have the following consequence of Theorem 1.1.
Corollary 1.2. Suppose that q is a prime power , n ≥ 5, and a
n−1and a
1are given elements of F
q. Then, if either a
n−1= a
1= 0 or q ≤ 3, there exists a primitive polynomial of the form
(1.1) x
n+ a
n−1x
n−1+ . . . + a
1x + a
0.
More generally, Theorem 1.1 implies that there is a primitive polynomial of the form (1.1) for n ≥ 5 with both a
n−1and the ratio a
1/a
0prescribed.
The Kloosterman sum technique should be instrumental in delivering the next stage in the programme beyond Theorem 1.1, namely, that for given values of a
n−1, a
1, and a
0(with a
0a primitive element of F
qand n ≥ 6 or, perhaps, n ≥ 7), there is a primitive polynomial (1.1) with prescribed values of a
n−1, a
1and a
0. For other results, conjectures, and data on the existence of primitive elements of F
qnsatisfying further constraints, see, for example, [HaMu], [Mu], [MoMu], [Ha], [CoHa1,2], [Co2]. I also acknowledge some preliminary notes by Dr Wun-Seng Chou (Taipei) on the “fixed traces”
question, out of which the considerations of this paper arose.
From the results of [ChCo] we can exclude the case a = b = 0 of Theorem 1.1 in what follows. Indeed, by symmetry, we shall assume, without loss of generality, that a 6= 0.
2. Character sum formulation. The number of elements ξ of F
∗qfor which T
n(ξ) = a and T
n(1/ξ) = b can be expressed in terms of (standard) Kloosterman sums over F
qn. The further constraint that ξ can be primitive heralds the introduction of multiplicative characters and more intricate ex- pressions involving generalised (or twisted) Kloosterman sums. Obtaining the relevant formulae in their most transparent form is the object of this section. For the background on Kloosterman sums and Gauss sums for this section and Section 3, Chapter 5 of [LiNi] (including the Exercises) or some other source may be consulted.
Let M be a divisor of q
n− 1. If ξ ∈ F
∗qnis such that ξ = α
d, where d | M and α ∈ F
qn, implies d = 1, we shall say that ξ is not any kind of Mth power in F
qn. Given q, n, a, b as in Section 1, define N
q,n(M ; a, b) (= N
q,n(M ) = N (M )) to be the number of elements ξ of F
∗qn, scaled (multiplied) by a factor q
2(for convenience), such that T
n(ξ) = a, T
n(1/ξ) = b, and ξ is not any kind of M th power in F
qn. To establish Theorem 1.1 in respect of q, n, a, b, it is necessary to show that N
q,n(M ; a, b) for general divisors M of q
n− 1.
Note, in particular, that the value of N (M ) depends only on the distinct prime divisors of M , i.e., on the square-free part of M .
Next, we lay down the basic material on characters. It will be amplified later. Let χ be the canonical additive character of F
q. Thus, for x ∈ F
q, χ(x) = exp(2πiT (x)/p), where here T denotes the absolute trace (from F
qto F
p). Moreover, every additive character b χ of F
qis such that b χ(x) = χ(cx) (x ∈ F
q) for some c ∈ F
q; take c = 0 to obtain the trivial character χ
0. Further, let χ
0= χ(T
n) denote the lift of χ to F
qn. Passing to multiplicative characters of F
qn, we shall reserve the symbol ψ for such; more precisely, for any divisor d of q
n− 1, ψ
dwill denote a typical character of F
qnof exact order d. Thus, ψ
1is the trivial character.
Now, for any α, β ∈ F
qnand any multiplicative character ψ, we define the generalised Kloosterman sum K
n(α, β; ψ) (= K
q,n(α, β; ψ)) by
K
n(α, β; ψ) = X
ξ∈F∗qn
χ
0(αξ + βξ
−1)ψ(ξ).
In particular, we write K
n(α, β) for K
n(α, β; ψ
1), the (standard) Klooster- man sum.
As a final preliminary to the basic formula, we describe some further notation. In a sum P
u
(or double sum P
u,v
), the variable(s) will be assumed to run over all members of the ground field F
q. If u runs over F
∗qwe will write P
u6=0
, etc. For any divisor M of q
n− 1, abbreviate to T
d|M
a weighted
sum of the form P
d|M
(µ(d)/φ(d)) P
(d)
ψ
d, where µ is the M¨obius function, and the inner sum ranges over all φ(d) characters of order d. Further for any positive integer h, set
Θ(h) = φ(h)/h = Y
l|h, l prime
(1 − l
−1).
Moreover, a bar over a symbol signifies complex conjugation.
Proposition 2.1. Let q be a prime power and n a positive integer. Sup- pose that elements a, b of F
qare given. Then, for any divisor M of q
n− 1, we have
(2.1) N
q,n(M ; a, b) = Θ(M ) X
u,v
χ(au + bv) \
d|M
K
n(u, v; ψ
d).
P r o o f. The characteristic function for the subset of F
∗qncomprising elements that are not any kind of M th power can be expressed as an ex- tension of the Vinogradov formula, [Ju], Lemma 7.5.3. It takes the form Θ(M ) T
d|M
ψ
d(ξ). Moreover, that for the subset of F
qncomprising elements with T
n(ξ) = a is (1/q) P
c
χ(c(T
n(ξ) − a)). Hence, taking account of the scaling factor q
2, we have
N
q,n(M ; a, b) = X
ξ∈F∗qn
Θ(M ) X
u,v
χ(u(T
n(ξ) − a))χ(v(T
n(ξ
−1) − b)) \
d|M
ψ
d(ξ)
= Θ(M ) X
u,v
χ(au + bv) \
d|M
χ(T
n(uξ + vξ
−1))ψ
d(ξ), and the result follows from the definition of the Kloosterman sum.
From Proposition 2.1, N (M ) can immediately be estimated using the standard bound for Kloosterman sums that follows.
Lemma 2.2. Let ψ be a multiplicative character of F
qn. Then K
n(0, 0; ψ) =
n q
n− 1 if ψ = ψ
1, 0 otherwise.
Further , if either ψ 6= ψ
1or α, β ∈ F
qnare not both zero, then
|K
n(α, β; ψ)| ≤ 2q
n/2.
Nevertheless, before applying Lemma 2.2, it is profitable to develop the formula (2.1). This is the aim in the next sequence of lemmas. They also involve Gauss sums, since some Kloosterman sums reduce to these. The Gauss sum G
n(ψ) is defined by
G
n(ψ) = X
ξ∈F∗qn
χ
0(ξ)ψ(ξ).
Lemma 2.3. (i) If α (6= 0), β ∈ F
qn, then
K
n(α, β; ψ) = ψ(α)K
n(1, αβ; ψ).
(ii) If β 6= 0, then K
n(0, β; ψ) = ψ(β)G
n(ψ).
(iii) If α 6= 0, then K
n(α, 0; ψ) = ψ(α)G
n(ψ).
Lemma 2.4. (i) G
n(ψ
1) = −1.
(ii) If ψ 6= ψ
1, then |G
n(ψ)| = q
n/2.
(iii) If q is odd and n = 1, then G
21(ψ
2) = (−1)
(q−1)/2q.
(iv) If q is odd and n is even, then G
n(ψ
2) = −(−1)
n(q−1)/4q
n/2. (v) If q and n are odd and λ
2is the quadratic character on F
∗q, then
G
1(λ
2)G
n(ψ
2) = (−1)
(n−1)(q−1)/4q
(n+1)/2.
Note that Lemma 2.4 contains information relating to the specific char- acters ψ
1and (when q is odd) ψ
2, the quadratic character. Similarly, we can specialise Kloosterman sums to these cases, though the results are more complicated. First, when ψ = ψ
1and u, v are in the ground field F
q, then K
n(u, v) can be expressed in terms of Kloosterman sums over F
q. For t ∈ F
∗q, set k
t:= K
1(1, t). Further, let D
n(X, c) be the Dickson polynomial of the first kind of degree n. Thus, D
n(X + c/X, c) = X
n+ c
n/X
n.
Lemma 2.5. Suppose t ∈ F
q∗. Then
K
n(1, t) = (−1)
n−1D
n(k
t, q).
Define δ
q,n:=
1 q − 1
X
t6=0
K
n(1, t)
q
n/2, δ
q,n∗:=
1
q − 1 X
t6=0
|K
n(1, t)|
q
n/2. Then, from Lemma 2.2, it follows that
(2.2) |δ
q,n| ≤ δ
q,n∗≤ 2.
The next result indicates that the numbers k
tare essentially uniformly distributed.
Lemma 2.6. For a given prime power q, P
t6=0
k
t= 1.
P r o o f.
X
t6=0
k
t= X
t,u6=0
χ
u + t
u
= χ(0) = 1,
since, evidently, for any c ∈ F
q, the equation u + t/u = c (i.e., t = u(c − u)) has q − 2 solution pairs (t, u) ∈ (F
∗q)
2, unless c = 0, when there are q − 1 solutions.
When q is odd, Kloosterman sums with ψ = ψ
2(the quadratic character)
are apparently easier to evaluate than standard sums. The following two
results derive, in essence, from Lemma 3.5 and Corollary 3.6 of [ChCo]. For clarity, for any divisor e of q − 1, we denote a multiplicative character of order e of F
qby λ
e.
Lemma 2.7. Let q be an odd prime power. If n is odd and t ∈ F
∗qis such that λ
2(t) = −1, then K
n(1, t; ψ
2) = 0. If λ
2(t) = 1 with s
2= t (s ∈ F
q) and p - n, then
K
n(1, t; ψ
2) = (χ(2ns) + χ(−2ns))G
n(ψ
2).
Otherwise, if n is odd, p | n, and λ
2(t) = 1, or n is even and either p | n or λ
2(t) = −1, then
K
n(1, t; ψ
2) = 2G
n(ψ
2).
Lemma 2.8. Let q be an odd prime power and n be an even integer. Then X
t6=0
K
n(1, t; ψ
2) = −(−1)
n(q−1)/4(εq − 2)q
n/2, where
ε =
2 if p | n, 1 if p - n.
Given q, n, define
m = m(q, n) := q
n− 1 q − 1 .
The significance of this number is that there are occasions when it is useful to distinguish characters ψ
dfor which d | m. This arises as follows. For any divi- sor d of q
n−1, the restriction to F
∗qof a character ψ
d(of F
∗qn) will be denoted by b ψ
d. Then, in fact, b ψ
d= λ
d∗, where d
∗= (q − 1)/gcd((q
n− 1)/d, q − 1).
In particular, b ψ
d= λ
1if and only if d | m. Moreover, if q is odd, then
(2.3) ψ b
d=
λ
2if n is odd, λ
1if n is even.
To present refinements of Proposition 2.1, we modify the notation T
d|M
used in its statement. First, for a divisor M
0of M , write T
d|M, d
-
M0to indicate a similar sum that excludes terms corresponding to divisors d of M
0. This is modified further to T
d|M, d
-
M0, 6=2to signify that the terms with d = 2 are excluded (if there remain any). This makes a difference only if q is odd, M is even, and M
0is odd. A frequent choice of M
0is M
∗:= gcd(M, m).
We come to the main theorems of this section; their statements depend
heavily on declared notation. Because the formulae have a different shape
according to whether a, b are zero or not, it is from this point on we insist
that a 6= 0. In the first case, we also suppose that b 6= 0.
Theorem 2.9. Let q be a prime power and n (≥ 4) an integer. Suppose that a, b ∈ F
∗q. Then, for any divisor M of q
n− 1, we have
N
q,n(M ; a, b) = Θ(M ) n
q
n+ 1 + ∆
2+ (−1)
n−1X
t6=0
k
tabD
n(k
t, q) (2.4)
+ \
d|M d
-
2X
t6=0
K
1(1, tab; b ψ
d)K
n(1, t; ψ
d) − 2 \
d|M∗ d
-
2G
n(ψ
d)
+ \
d|M d
-
M∗, 6=2( b ψ
d(a) + b ψ
d(b))G
1( b ψ
d)G
n(ψ
d) o
,
where ∆
2= 0, unless q is odd and M is even. In the latter event, if n is odd,
∆
2= (−1)
(n−1)(q−1)/4γq
(n+1)/2, where
γ =
4 if λ
2(a) = λ
2(b) = −1, but ab 6= n
2, 2(1 − λ
2(a)) − q if ab = n
2,
0 otherwise,
and, if n is even, ∆
2= (−1)
n(q−1)/4γq
n/2, where γ =
(−1)
(q+1)/2+
12q −
12if ab = n
2,
(−1)
(q+1)/2λ
2(ab) −
12λ
2(ab − n
2)
q −
12otherwise.
P r o o f. We consider the terms on the right hand side of (2.1) (taking for granted the factor Θ(M )). First, by Lemma 2.2, the contribution of the terms with u = v = 0 is exactly q
n− 1. Next, by Lemma 2.3(i), the contribution of the terms with uv 6= 0, on replacement of uv by t, becomes
X
t6=0
X
u6=0
χ
au + bt u
ψ b
d(u)K
n(1, t; ψ
d).
This yields the first two sums on the right hand side of (2.4) (with T
16=d|M
rather than T
d|M , d
-
2) on replacing au by u and separating, with the aid of Lemma 2.5, the contribution arising from ψ
1.
Next, by Lemma 2.3(ii), the contribution of the terms of (2.1) with u = 0,
v 6= 0, is \
d|M
n X
v6=0
χ(bv) b ψ
d(v) o
G
n(ψ
d).
On interchanging ψ
dwith its conjugate and replacing bv by v, we obtain
\
d|M
ψ b
d(b)G
1( b ψ
d)G
N(ψ
d) = 1 − \
16=d|M∗
G
n(ψ
d) + \
d|M d
-
M∗ψ b
d(b)G
1(ψ
d)G
n(ψ
d),
by Lemma 2.4(i). There is a similar contribution from the terms with u 6= 0, v = 0.
As a consequence of the above, it suffices to assume that q is odd and M is even, and prove that the net contribution of the terms in (2.1) corresponding to ψ
2is the designated expression for ∆
2.
To this end, suppose first that n is odd; thus M
∗is odd. Observing that the weighting factor µ(2)/φ(2) = −1, from (2.4) we have
∆
2= −λ
2(a)(λ
2(ab) + 1)G
1(λ
2)G
n(ψ
2) − X
t6=0
K
1(1, tab; λ
2)K
n(1, t; ψ
2).
Now, for t ∈ F
∗q, λ
2(a) = 1 if and only if ψ
2(a) = 1. Consequently, from Lemma 2.7, the product K
1(1, tab; λ
2)K
n(1, t; ψ
2) is zero unless λ
2(t) = λ
2(tab) = 1. Evidently, therefore, ∆
2= 0, unless λ
2(ab) = 1. Moreover, if λ
2(ab) = 1 and c
2= ab, c ∈ F
∗q, then, by Lemma 2.7 again, we have
X
t6=0
K
1(1, tab; λ
2)K
n(1, t; ψ
2)
= G
1(λ
2)G
n(ψ
2) X
s6=0
χ(2ns)(χ(2sc) + χ(−2sc))
= G
1(λ
2)G
n(ψ
2) X
s6=0
{χ(2s(n + c)) + χ(2s(n − c))}.
The sum in this expression is q − 2 if c = ±n (i.e., ab = n
2). Otherwise, it is −2. The result for n odd now follows from Lemma 2.4(v).
Finally, suppose n is even, so that M
∗is even. Then, by (2.3), b ψ
2= λ
1, and
∆
2= 2G
n(ψ
2) − X
t6=0
K
1(1, tab)K
n(1, t; ψ
2) (2.5)
= G
n(ψ
2)
2 − X
t6=0
(1 − λ
2(t))K
1(1, tab) − X
s6=0
χ(2ns)K
1(1, s
2ab)
= G
n(ψ
2)
1 + X
t6=0
λ
2(t)K
1(1, tab) − X
s6=0
χ(2ns)K
1(1, s
2ab)
, by Lemmas 2.7 and 2.6. Now, in (2.5), we have
X
t6=0
λ
2(t)K
1(1, tab) = X
t,u6=0
χ
u + tab u
λ
2(t) = X
u6=0
χ(u) X
t6=0
χ
abt u
λ
2(t)
= λ
2(ab) X
u6=0
χ(u)λ
2(u)G
1(λ
2)
= λ
2(ab)G
21(λ
2) = (−1)
(q−1)/2λ
2(ab)q,
by Lemma 2.4(iii). Also, in (2.5), we have S := X
s6=0
χ(2ns)K
1(1, s
2ab) = 1 + X
s
X
u6=0
χ
u + s
2ab u + 2ns
= 1 + X
u6=0
χ
u
1 − n
2ab
X
s
χ
ab u
s − nu
ab
2= 1 + X
u6=0
χ
u
1 − n
2ab
1 2
X
v6=0
χ
ab u v
(1 + λ
2(v))
+ 1
.
Now, if ab = n
2, it follows that S = 1+q−1+ 1
2 X
v6=0
(1+λ
2(v)) X
u6=0
χ(abvu) = q− 1 2
X
v6=0
(1+λ
2(v)) = 1 2 (q+1).
Hence, in this case,
∆
2= G
n(ψ
2)
1
2 + (−1)
(q−1)/2q − 1 2 q
,
as required (by Lemma 2.4(iv)). On the other hand, if ab 6= n
2, then S = 1
2 X
u6=0
χ
u
1 − n
2ab
−1 + X
v6=0
χ
ab u v
λ
2(v)
= 1 2
1 + G
1(λ
2) X
u6=0
χ
u
1 − n
2ab
λ
2(abu)
= 1
2 (1 + λ
2(ab − n
2)G
21(λ
2)) = 1
2 (1 + (−1)
(q−1)/2λ
2(ab − n
2)q).
Again, by Lemma 2.4(iv), this yields the stated expression for ∆
2. Here now is the corresponding result with b = 0.
Theorem 2.10. Let q be a prime power and n (≥ 4) an integer. Suppose that a ∈ F
∗q. Then, for any divisor M of q
n− 1, we have
N
q,n(M ; a, 0) = Θ(M ) n
q
n− q + 1 + ∆
2+ (−1)
nX
t6=0
D
n(k
t, q) (2.6)
+ \
d|M∗ d
-
2h
(q − 2)G
n(ψ
d) − X
t6=0
K
n(1, t; ψ
d) i
+ \
d|M d
-
M∗, 6=2ψ
d(a)G
1( b ψ
d) h
G
n(ψ
d) + X
t6=0
K
n(1, t; ψ
d) io
,
where ∆
2= 0, unless q is odd, M is even and p | n. In the latter event, we have
∆
2=
−(−1)
n(q−1)/4q
n/2+1if n is even,
−λ
2(a)(−1)
(n−1)(q−1)/4q
(n+3)/2if n is odd.
P r o o f. Again the contribution of the terms in (2.1) with u = v = 0 is q
n− 1. That from the terms with u = 0, v 6= 0 is
\
d|M
G
n(ψ
d) X
v6=0
ψ b
d(v) = −(q − 1) + (q + 1) \
16=d|M∗
G
n(ψ
d),
by Lemmas 2.3 and 2.4, plus the fact that b ψ
dis non-trivial if and only if d - M
∗. The contribution from the terms with u 6= 0, v = 0 is
\
d|M
G
n(ψ
d) X
u6=0
χ(au) b ψ
d(u) = 1− \
16=d|M∗
G
n(ψ
d)+ \
d|M d|M∗
ψ
d(a)G
1( b ψ
d)G
n(ψ
d).
This, again, has used the fact that b ψ
dis trivial whenever d | M
∗. The con- tribution of the terms with uv 6= 0 is
\
d|M
X
t,u6=0
χ(au) b ψ
d(u)K
n(1, t; ψ
d)
= − \
d|M∗
K
n(1, t; ψ
d) + \
d|M d
-
M∗ψ
d(a)G
1( b ψ
d)K
n(1, t; ψ
d).
This yields the remaining terms on the right-hand side of (2.6), apart from the evaluation of the terms arising from ψ
2.
Indeed, we now describe the contribution of the terms involving ψ
2(when q is odd and M is even). If n is even, then M
∗is even and
∆
2= −(q − 2)G
n(ψ
2) + X
t6=0
K
n(1, t; ψ
2)
=
−(−1)
n(q−1)/4q
n/2+1if p | n,
0 otherwise,
by Lemma 2.8. On the other hand, if n is odd and so M
∗is odd, then
∆
2= −λ
2(a)G
1(λ
2)
G
n(ψ
2) + X
t6=0
K
n(1, t; ψ
d)
= −λ
2(a)G
1(λ
2)
G
n(ψ
2) + X
s6=0
χ(2ns)G
n(ψ
d)
, by Lemma 2.7. Thus, λ
2= 0, unless p | n, in which event,
∆
2= −qλ
2(a)G
1(λ
2)G
n(ψ
2) = −λ
2(a)(−1)
(n−1)(q−1)/4q
(n+3)/2,
by Lemma 2.4(v). Hence, everything is proved.
3. Inequalities. The sums in the identities of Theorems 2.9 and 2.10 can obviously be estimated by means of the fundamental bounds for Kloost- erman sums and Gauss sums stated in Lemmas 2.2 and 2.4(ii). Thus, for example, with regard to the main error terms in Theorem 2.9, we have
X
t6=0
K
1(1, tab; b ψ
d)K
n(1, t; ψ
d)
≤ 4(q − 1)q
(n+1)/2, d > 1.
Crucially, we are able to halve this estimate; it then becomes similar to the corresponding estimate for the main error terms in Theorem 2.10, namely
G
1( b ψ
d) X
t6=0
K
n(1, t; ψ
d)
≤ 2(t − 1)q
(n+1)/2, d > 1.
Lemma 3.1. For any multiplicative character λ of F
∗q, we have X
t6=0
|K
1(1, t; λ)|
2=
q(q − 1) − 1 if λ = λ
1, q(q − 2) if λ 6= λ
1. P r o o f.
X
t6=0
|K
1(1, t; λ)|
2= X
t,u,v6=0
χ
u + t
u − v − t v
λ
u v
= (q − 1)
2+ X
t6=u
X
u,v6=0 u6=v
χ
u − v − t(u − v) uv
λ
u v
= (q − 1)
2+ X
t6=u
X
y6=0 z6=0,1
χ
y − t(z − 1)
2yz
λ(z)
= (q − 1)
2+ X
y6=0 z6=0,1
λ(z) X
t6=0
χ
y − t(z − 1)
2yz
= (q − 1)
2− X
z6=0,1
λ(z) X
y6=0
χ(y) = (q − 1)
2+ X
z6=0,1
λ(z)
=
(q − 1)
2+ (q − 2) if λ = λ
1, (q − 1)
2− 1 if λ 6= λ
1, and the result follows.
Corollary 3.2. For λ as in Lemma 3.1, X
t6=0
|K
1(1, t; λ)| < (q − 1) √
q.
Hence, if d (> 1) is a divisor of q
n− 1 and ab ∈ F
∗q, then
X
t6=0
K
1(1, tab; b ψ
d)K
n(1, t; ψ
d)
< 2(q − 1)q
(n+1)/2. P r o o f. By Cauchy’s inequality and Lemma 3.1,
X
t6=0
|K
1(1, t; λ)| ≤ p
q − 1 X
t6=0
|K
1(1, t; λ)|
2 1/2< p
q − 1 p
q(q − 1)
= (q − 1) √ q.
Granted Lemma 2.2, the other inequality is immediate.
Taking λ = λ
1in Corollary 3.2, we see that δ
∗q:= δ
q,1∗(see Section 2) satisfies δ
∗q< 1 (cf. (2.2)). Indeed, for a range of prime values of q tested (with 7 < q < 200), δ
∗qlay within the interval (0.82, 0.89).
Moreover, we can effectively improve further on Corollary 3.2 under the following circumstances: q, n odd, d even, d | 2m (so that b ψ
d= λ
2).
Lemma 3.3. Let q be an odd prime power and χ be the canonical additive character on F
q. Then
X
s
|χ(s) + 1| = 2q p cosec
π 2p
. P r o o f. Suppose q = p, an odd prime. Then
X
s
|χ(s) + 1| = X
s
2|cos(πs/p)| = 2
1 + 2
(p−1)/2
X
s=0
cos(πs/p)
= 2 cosec(π/(2p)).
This implies the result for a prime power q, since the elements of F
qare uniformly distributed with respect to absolute trace.
Lemma 3.4. Let q be an odd prime power. Define κ
qby
κ
q=
4
π if q = p or p
2, 4
π
1 + 2
p
2otherwise.
Then X
s6=0
|χ(s) + 1| < κ
q(q − 1).
P r o o f. Using Lemma 3.3 and setting x = π/(2p), we obtain X
s6=0
|χ(s) + 1| = 2
q
p cosec x − 1
< 2
q
p(x − x
3/6)
− 1 < 4
π (q − 1),
provided
q p
2< 12
π
1 − 2
π
1 − x
26
.
Because x = π/(2p) ≤ π/6, the right side of this inequality exceeds 1.3, and so the inequality holds whenever q = p or p
2. The adjustment shown suffices for general values of q.
Lemma 3.5. For any α ∈ F
∗qnand multiplicative character ψ, K
n(1, α; ψ) = ψ(α)K
n(1, α; ψ).
P r o o f. Replace ξ by α/ξ in the definition of K
n(1, α; ψ).
The climax of the preceding few lemmas comes next. In it, for a character ψ, set
S(ψ) := X
t6=0
K
1(1, t; b ψ)K
n(1, t; ψ).
Lemma 3.6. Let q be an odd prime power and n an odd integer. Suppose that a, b ∈ F
∗qand d is an even divisor of 2m. If λ
2(ab) = −1, then S(ψ
d) + S(ψ
d) = 0, whereas if λ
2(ab) = 1, then
|S(ψ
d) + S(ψ
d)| < 2κ
q(q − 1)q
(n+1)/2,
where κ
qis as defined in Lemma 3.4; in particular , κ
q= 4/π whenever q = p or p
2.
P r o o f. By Lemma 3.5, and the fact that b ψ
d= λ
2, S(ψ
d) + S(ψ
d) = X
t6=0
(1 + λ
2(t))K
1(1, tab; λ
2)K
n(1, t; ψ
d).
If λ
2(ab) = −1, then either λ
2(t) = −1 or λ
2(tab) = −1 whenever t ∈ F
∗q. Hence, the above expression is zero by Lemma 2.7. If λ
2(ab) = 1, with c
2= ab (c ∈ F
∗q), then, again by Lemma 2.7,
|S(ψ
d) + S(ψ
d)| = 2|G
1(λ
2)|
X
s6=0
χ(2cs)K
n(1, s
2; ψ
d)
= √ q
X
s6=0
(χ(2cs) + χ(−2cs))K
n(1, s
2; ψ
d)
≤ 2q
(n+1)/2X
s6=0
|χ(2cs) + χ(−2cs)|, by Lemma 2.2. The result follows from Lemma 3.4, since
X
s6=0
|χ(2cs) + χ(−2cs)| = X
s6=0
|χ(s/2) + χ(−s/2)| = X
s6=0
|χ(s) + 1|.
Lemma 3.5 can be exploited for other characters χ, but not with such generality. The consequences tend to depend on the order of ab more specif- ically.
The remaining inequality is of a different nature and derives from a sieving process. The aim is to obtain a lower bound for N
q,n(M ) that may not be good asymptotically (as q → ∞) but is especially effective for small values of q.
As noted in Section 2, given q, n, a, b, the value of N (M ) (M is a divisor of q
n−1) depends only on the distinct prime factors of M . Accordingly, divisors M
1, . . . , M
r(r ≥ 1) of M will be called complementary divisors of M with common divisor M
0if the set of distinct prime divisors of lcm{M
1, . . . , M
r} is the same as that of M , and, for any pair (i, j) with 1 ≤ i 6= j ≤ r, the set of distinct prime divisors of gcd(M
i, M
j) is that of M
0. When r = 1, we have M
1= M
0= M . Though easy to prove (see [ChCo], Proposition 6.1), the following inequality is extremely useful.
Proposition 3.7. Let q be a prime power and n (≥ 1) an integer. Sup- pose that a, b ∈ F
q. Let M
1, . . . , M
rbe complementary divisors of M (it- self a divisor of q
n− 1) with common divisor M
0. Then, with N (M ) = N
q,n(M ; a, b), etc., we have
N (M ) ≥ n X
ri=1
N (M
i) o
− (r − 1)N (M
0).
Proposition 3.7 motivates a generalisation of the ratio Θ(M ) defined in Section 2. With the same notation, define
Θ(M
1, . . . , M
r) :=
n X
ri=1
Θ(M
i) o
− (r − 1)Θ(M
0).
This number represents the coefficient of q
non the right hand side of the inequality when Theorems 2.9 or 2.10 are applied to each of the constituents.
To be useful, therefore, it is vital that Θ(M
1, . . . , M
r) be positive. Indeed, preferably the ratio b Θ(M
1, . . . , M
r) := Θ(M
1, . . . , M
r)/Θ(M
0) should not be too small. For this reason, when q is odd we suppose (except in one place) that M
0is even, because of the effect of the prime 2 otherwise. Consequently, we say that the complementary divisors are regular if qM
0is even.
4. Criteria for success. These criteria are effective only if n ≥ 4, which we assume from now on. Again, assume also that the prime power q and a (6= 0), b ∈ F
qare given. Also, from now on, fix M as q
n− 1; M
1, . . . , M
rwill be regular complementary divisors of M with common divisor M
0. We
supplement the previously defined integer m = (q
n− 1)/(q − 1) with the
definitions m
i:= gcd(M
i, m), i = 0, 1, . . . , r. For any positive integer h, let
W (h) = 2
ω(h)be the number of square-free divisors of h (where ω(h) is the number of distinct prime factors of h).
We derive criteria for N
q,n(M ; a, b) to be positive. First, we suppose b 6= 0.
Proposition 4.1. Let q be a prime power and n (≥ 4) be an integer.
Suppose that a, b ∈ F
q. Let M
1, . . . , M
rbe regular complementary divisors of M with common divisor M
0. Assume that Θ := Θ(M
1, . . . , M
r) is posi- tive. If the following condition (labelled A
q,n(M
1, . . . , M
r)) is satisfied, then N
q,n(M ; a, b) is positive:
(4.1) q
(n−3)/2> 2
W (M
0) − W (m
0) + δ
∗q1 − 1
q
(W (m
0) − η
1) + 1
q
3/2(W (m
0) − 1)
+ η
13 2 √
q − η
22 − κ
q1 − 1
q
(W (m
0) − 1) + 1
+ Θ
−1X
ri=1
Θ(M
i)
2
W (M
i) − W (M
0) − W (m
i) + W (m
0)
+
δ
∗q1 − 1
q
+ 1
q
3/2(W (m
i) − W (m
0))
− η
22 − κ
q1 − 1
q
(W (m
i) − W (m
0))
, where κ
q(∼ 4/π) and δ
q∗(< 1) are as in Section 3, and
η
1=
1 if q is odd and n is even,
0 otherwise; η
2=
n 1 if q and n are odd, 0 otherwise.
P r o o f. By Proposition 3.7 and Theorem 2.9, N (M ) = ΘN (M
0) +
X
r i=1Θ(M
i) n \
d|Mi
d
-
M0X
t6=0
K
1K
n− 2 \
d|mi
d
-
m0G
n(4.2)
+ \
d|Mi d
-
M0( b ψ
d(a) + b ψ
d(b))G
1G
no
,
where we have employed an abbreviated notation based on (2.4). Further, N (M
0) itself is given by (2.4) with M = M
0, M
∗= m
0. In particular, since the complementary divisors are regular, all contributions relating to ψ
2are counted within N (M
0). Now, for each of the φ(d) characters ψ
d, we have an absolute bound of 2q
(n+3)/2for P
t6=0
K
1K
n+ ( b ψ
d(a) + b ψ
d(b))G
1G
nin (4.2), by Corollary 3.2 and Lemma 2.4(iii). Indeed, for those d that are divisors of m
i, but not m
0(so that b ψ
dis trivial), we have the improve- ment that the factor 2 may be replaced by 2(1 − q
−1)δ
q∗, and we have a further contribution (bounded absolutely by 2q
n/2) from the minor term
−2 T
d|mi, d
-
m0G
n.
More significantly, when q and n are odd (so that η
2= 1) and d is an even divisor of 2m
i, we can, on average (when ψ
dis paired with ψ
d), replace the factor 2 by κ
q(1 − q
−1), by Lemma 3.6. Because each d in T
d|Mi, d
-
M0, say, appears with a weighting factor µ(d)/φ(d) (so that only square-free d matter, and each carries an absolute weight 1/φ(d)), and because the main term in ΘN (M
0) is Θq
n, we deduce that (4.1) is correct insofar as the terms under the sum P
ri=1
are concerned.
The estimate for the remaining terms of N (M
0) is similar. The only modifications are as follows. There is no contribution from d = 1 to
−2 T
d|M0, d
-
2G
n. Also, when q is odd, then M
0is even and we need to adjust the terms with ψ
2using the expression for ∆
2in Theorem 2.9. Thus, if n (and so m
0) are even, then η
1= 1 and |∆
2| <
32q
n/2+1, whereas, if n (and so m
0) are odd, the contribution in the worst case (when ab = n
2) is halved to q
(n+1)/2. This completes the proof.
When b = 0, we analogously employ Theorem 2.10.
Proposition 4.2. Let q be a prime power and n (≥ 4) an integer. Sup- pose that a ∈ F
∗q. Let M
1, . . . , M
rbe regular complementary divisors of M with common divisor M
0. Assume that Θ is positive. If the following condi- tion (labelled B
q,n(M
1, . . . , M
r)) is satisfied, then N
q,n(M ; a, 0) is positive:
(4.3) q
(n−3)/2− q − 1 q
(n+3)/2>
2 − 1
q
(W (M
0) − W (m
0) − η
2) + (−1)
(n−1)(q−1)/4λ
2(a)ε
2+ 1
√ q
(W (m
0) − 1 − η
1) +
1 − 1
q
δ
q,n+ (−1)
n(q−1)/4ε
1+ Θ
−1X
r i=1Θ(M
i)
2 − 1 q
(W (M
i) − W (M
0) − W (m
i) + W (m
0))
+ 1
√ q
3 − 4
q
(W (m
i) − W (m
0))
where η
1and η
2are as in Proposition 4.1, δ
q,n(with absolute value at most
2) is as defined in Section 2, and ε
1and ε
2are zero, unless q is odd and
p | n, in which case, ε
1=
1 if n is even,
0 otherwise; ε
2=
n 1 if n is odd, 0 otherwise.
P r o o f. This is similar to Proposition 4.1. But note the extra (mi- nor) term on the left hand side of (4.3) springing from terms of the shape
−Θ(M
i)(q −1) on the right side of (2.6). Further, for d a divisor of M
i(say), but neither of m
inor M
0, the contribution from b ψ
d(a)G
1(G
n+ P
t6=0
K
n) is (1 + 2(q − 1))q
(n+1)/2= (2 − q
−1)q
(n+3)/2, and, for d a divisor of m
i, but not m
0, that of (q −2)G
n− P
t6=0
K
nis ((q −2)+2(q −1))q
n/2= (3−4/q)q
n/2+1. In this way, the result is proved.
We shall sometimes abbreviate A
q,n(M
1, . . . , M
r), say, to A
q,n. We also use R
q,n(M
1, . . . , M
r) (possibly abbreviated) and L
q,nto denote the right and left sides of (4.1) or (4.3) as appropriate to the context. Note that, even for B
q,n, L
q,nis essentially q
(n−3)/2as the other term is generally negligible.
Although these conditions seem complicated, for larger values of q and n, it suffices to use the coarser estimate obtained by selecting only the terms involving M
0, M
1, . . . , M
r. Note finally that, because the terms in respect of divisors of m are generally diminished in B
q,nby a factor of order √
q, the condition A
q,nis essentially more stringent than B
q,nand therefore, as a rule, B
q,nholds whenever A
q,ndoes.
5. Theorem 1.1 for “almost all” pairs (q, n). In this section we shall take r = 1 and show that A
q,n(M ) and B
q,n(M ) hold for all but finitely many pairs (q, n). Nevertheless, in interpreting this conclusion, caution must be exercised because the number of potential exceptions is huge. Hence, properly understood, this is merely the first stage in the application of the theory. Note that, in A
q,n(M ), say, all the “Θ-terms” are absent.
We begin with a weak, but convenient, lemma to bound the function W . Note that 2 · 3 · 5 · 7 · 11 · 13 = 30030.
Lemma 5.1. Set γ := 64/30030
1/4< 4.9. Suppose that h is an integer indivisible by a prime p. Then
W (h) ≤ γ
ph
1/4, where γ
p=
12