LXXVI.2 (1996)
Equations in roots of unity
by
Hans Peter Schlickewei (Ulm)
1. Introduction. Suppose that n ≥ 1 and that a
1, . . . , a
nare nonzero complex numbers. We study equations
(1.1) a
1ξ
1+ . . . + a
nξ
n= 1
to be solved in roots of unity ξ
i. We call a solution ξ = (ξ
1, . . . , ξ
n) of (1.1) nondegenerate if P
i∈I
a
iξ
i6= 0 for each nonempty subset I of {1, . . . , n}.
Write ν(a
1, . . . , a
n) for the number of nondegenerate solutions ξ of (1.1), whose components are roots of unity. Equations (1.1) have been first studied by H. B. Mann [2]. His result implies that for a
1, . . . , a
n∈ Q we have
ν(a
1, . . . , a
n) ≤ e
c1n2,
where c
1is a positive absolute constant. This was improved by J. H. Conway and A. J. Jones [1]. They proved that for a
1, . . . , a
n∈ Q we get
ν(a
1, . . . , a
n) = O
c(exp(cn
3/2(log n)
1/2))
for any c > 1. In the case when a
1, . . . , a
nlie in a number field K of degree d, A. Schinzel [4] has shown that
ν(a
1, . . . , a
n) ≤ c
2(n, d)
for some function c
2, which depends only upon n and d. U. Zannier [7] has improved upon Schinzel’s result and also determined c
2explicitly.
In the current paper we address the problem to derive a bound for ν(a
1, . . . , a
n) for arbitrary complex numbers a
i. We prove that in fact
ν(a
1, . . . , a
n) ≤ c
3(n),
i.e., we derive a uniform bound that depends only upon n.
Write E for the group of roots of unity. Given a point w = (w
1, . . . , w
n) in E
nand a natural number m we write A(w, m) for the set of points v = (v
1, . . . , v
n) ∈ E
nwith components of the shape v
i= η
iw
i, where η
iis an mth root of unity. We prove
[99]
Theorem. There exist points w
1, . . . , w
t∈ E
nwith
(1.2) t ≤ 2
2(n+1)!and there exist prime numbers
p
1< p
2< . . . < p
s≤ (n + 1)!
with the following property: Any nondegenerate solution ξ = (ξ
1, . . . , ξ
n) of (1.1) in roots of unity is contained in the union
[
t τ =1A(w
τ, p
1. . . p
s).
Moreover , we have for any nonzero complex numbers a
1, . . . , a
n(1.3) ν(a
1, . . . , a
n) ≤ 2
4(n+1)!.
We remark at this point that the results of [1], [2], [4] and [7] for rational or algebraic coefficients a
irespectively are constructive, i.e., in principle the proofs provide us with algorithms to determine explicitly the solutions ξ of (1.1). This is no longer the case with our Theorem. In fact, it is not clear how our method of proof could give us an effective procedure to construct the points w
1, . . . , w
t.
In a subsequent paper [5] we will apply our result to estimate the number of solutions of linear equations over division groups of finitely generated subgroups G of the multiplicative group C
∗. In that wider setting the current Theorem establishes the result of [5] for groups G of rank 0.
2. Rational coefficients. Suppose m ≥ 2. Let b
1, . . . , b
mbe nonzero integers and consider the relation
(2.1)
X
m i=1b
iξ
i= 0,
where the ξ
iare roots of unity. We say that a solution ξ = (ξ
1, . . . , ξ
m) of (2.1) is nondegenerate if for each nonempty proper subset I of {1, . . . , m}
we have
(2.2) X
i∈I
b
iξ
i6= 0.
Lemma 2.1. There exist distinct primes p
1, . . . , p
u≤ m such that any nondegenerate solution ξ = (ξ
1, . . . , ξ
m) of (2.1) in roots of unity is of the shape
(2.3) ξ
i= ξη
i,
where the η
iare p
1. . . p
u-th roots of unity (and where ξ is a suitable root of
unity).
This is Theorem 1 of H. B. Mann [2].
Lemma 2.2. Let the hypotheses be the same as in Lemma 2.1. Then the primes p
1, . . . , p
usatisfy
(2.4)
X
u i=1(p
i− 2) ≤ m − 2.
This is Theorem 5 of Conway and Jones [1].
3. Multilinear maps. Let C
Nbe the vector space of N -tuples (x
1, . . . , x
N). Let S
Nbe the group of permutations of {1, . . . , N }, so that
|S
N| = N !. Let V be the space of vectors with components z
σ(σ ∈ S
N), so that dim V = N !. We introduce the map from C
N× . . . × C
N(with N factors) into V given by
(x
1, . . . , x
N) 7→ z(x
1, . . . , x
N).
Here z has components
(3.1) z
σ= (sign σ)x
1,σ(1). . . x
N,σ(N ),
where x
i= (x
i1, . . . , x
iN) and sign σ = ±1 according as σ is an even or odd permutation. It is clear that z is linear in each x
iand we have
(3.2) X
σ∈SN
z
σ(x
1, . . . , x
N) = det(x
1, . . . , x
N).
Suppose we are given a hyperplane U of C
Ndefined by the equation (3.3)
X
N i=1c
ix
i= 0.
Assume that the coefficients in (3.3) satisfy
(3.4) c
1. . . c
N6= 0.
Given vectors x
1, . . . , x
N∈ U , we obviously get
(3.5) X
σ∈SN
z
σ(x
1, . . . , x
N) = 0.
Lemma 3.1. Let d
σ(σ ∈ S
N) be complex numbers such that
(3.6) X
σ∈SN
d
σz
σ(x
1, . . . , x
N) = 0
for each tuple of points (x
1, . . . , x
N) with x
i∈ U (1 ≤ i ≤ N ). Then (d
σ)
σ∈SNis proportional to (1, . . . , 1), i.e., (3.6) is a consequence of (3.5).
This is Lemma 4 of [6].
Write M for the set of proper nonempty subsets R of S
N. Given R ∈ M we define the multilinear form F
Rin vectors x
1, . . . , x
N∈ U , by
(3.7) F
R(x
1, . . . , x
N) = X
σ∈R
z
σ(x
1, . . . , x
N).
Lemma 3.1 implies that F
Rdoes not vanish identically on U × . . . × U (N factors). We may conclude that the set of points y ∈ U such that F
R(y, x
2, . . . , x
N) vanishes identically in x
2, . . . , x
N∈ U is a proper sub- space of U . We denote this subspace by U
1R. We may perform this procedure for every R ∈ M. Now pick an element y
1∈ U \ S
R∈M
U
1R. Consider the set of elements y ∈ U such that F
R(y
1; y, x
3, . . . , x
N) vanishes identically as x
3, . . . , x
Nrun through U . By our choice of y
1, this condition defines for each R ∈ M a proper subspace U
2Rof U . Pick y
2in U \ S
R∈M
U
2R. Our set of multilinear forms F
R(R ∈ M) is symmetric in the following sense:
For each R ∈ M and for each σ ∈ S
Nthere exists R
0∈ M such that F
R(x
σ(1), . . . , x
σ(N )) = ±F
R0(x
1, . . . , x
n).
It follows that we have
[
R∈M
U
2R⊃ [
R∈M
U
1R.
We may continue our construction in an obvious way. Finally, we get points y
1, . . . , y
N −1and subspaces U
iRfor i = 1, . . . , N − 1 and R ∈ M. Then each equation
(3.8) F
R(y
1, . . . , y
N −1, x) = 0 in x ∈ U will define a proper subspace U
N Rof U and we have
[
R∈M
U
N R⊃ [
R∈M
U
N −1,R⊃ . . . ⊃ [
R∈M
U
1R. Our construction now implies
Lemma 3.2. Suppose the points y
1, . . . , y
N −1and the subspaces U
N R(R ∈ M) are constructed as above. Then for each point x ∈ U \ S
R∈M
U
N Rwe have
(3.9) X
σ∈SN
z
σ(y
1, . . . , y
N −1, x) = 0
but no proper nonempty subsum of the left hand side of (3.9) vanishes.
For the proof it suffices to observe that the subsums of (3.9) are just our multilinear forms F
R. But our construction is such that for x ∈ U \ S
R∈M
U
N Rwe have F
R(y
1, . . . , y
N −1, x) 6= 0 for each R ∈ M.
We finally remark that the number of subspaces U
N R(R ∈ M) is bounded by
(3.10) 2
N !.
4. Linear subspaces. Let N ≥ 2 and suppose that c
1, . . . , c
Nare nonzero complex numbers. Consider the subspace U of C
Ndefined by (4.1) c
1x
1+ . . . + c
Nx
N= 0,
which was already studied in Section 3. Write B for the subset of points x = (x
1, . . . , x
N) in U whose components x
iare roots of unity. Denote by E the set of all roots of unity and by E
mthe set of mth roots of unity. If B 6= ∅, we define for u ∈ B and for l ∈ N the set B(u, l) by
(4.2) B(u, l) = {(ζζ
1u
1, . . . , ζζ
Nu
N) | ζ ∈ E, ζ
i∈ E
l}.
Lemma 4.1. Suppose that B 6= ∅. Then there exists u
0∈ B, there are primes q
1< . . . < q
awith
X
a i=1(q
i− 2) ≤ N ! − 2
and there are proper linear subspaces W
1, . . . , W
t1of U such that the follow- ing assertion holds true: The set of solutions B of (4.1) in roots of unity is contained in the union
B(u
0, q
1. . . q
a) ∪ W
1∪ . . . ∪ W
t1. Here we have
(4.3) t
1≤ 2
N !.
P r o o f. Recall the definition of the multilinear forms F
R(x
1, . . . , x
N) in (3.7). The subspaces U
1R(R ∈ M) in Section 3 do not depend on any choice of points. We distinguish two cases: Either B is contained in the union S
R∈M
U
1R. Then obviously the assertion of the lemma is satisfied with {W
1, . . . , W
t1} = {U
1R| R ∈ M}. In that case the set B(u
0, q
1. . . q
a) will not be needed at all. So any u
0∈ B and any primes q
1< . . . < q
aas in the assertion will do. Otherwise we may pick y
1∈ B \ S
R∈M
U
1Rand define subspaces U
2R(R ∈ M) with respect to y
1. Then if B ⊂ S
R∈M
U
2R, we may choose u
0∈ B, q
1< q
2< . . . < q
aaccording to the assertion arbitrarily and again the lemma follows. Continuing in this way, there are two alternatives: either the construction ends after step j with j ≤ N − 1, B will be contained in the union S
R∈M
U
jRand we are done. Or we may find
points y
1, . . . , y
N −1∈ B and define subspaces U
N R(R ∈ M) with respect
to these points.
Then we take {W
1, . . . , W
t1} = {U
N R| R ∈ M}. We now assume that B \ S
R∈M
U
N R6= ∅. We may apply Lemma 3.2 and conclude that for each x ∈ B \ S
R∈M
U
N R(4.4) X
σ∈SN
z
σ(y
1, . . . , y
N −1, x) = 0
but no proper nonempty subsum of the left hand side of (4.4) vanishes.
However, by definition of B, the summands z
σ(y
1, . . . , y
N −1, x) in (4.4) are roots of unity. So (4.4) is an equation of the type considered in (2.1), and in fact we have nondegenerate solutions (z
σ(y
1, . . . , y
N −1, x))
σ∈SN. We may apply Lemma 2.2 with m = N ! and with b
1= . . . = b
m= 1. Now Lemma 2.2 says the following: For any solution x ∈ B \ S
R∈M
U
N Rthe point (z
σ(y
1, . . . , y
N −1, x))
σ∈SNis of the shape (ζη
σ)
σ∈SN, where ζ is an arbitrary root of unity and where the η
σare q
1. . . q
a-th roots of unity with
X
a i=1(q
i− 2) ≤ N ! − 2.
We may assume without loss of generality that 2 ∈ {q
1, . . . , q
a}. Other- wise we enlarge the set by taking {2, q
1, . . . , q
a}.
Recall that the components z
σare the summands in the Laplace expan- sion of the determinant
det(y
1, . . . , y
N −1, x) =
y
11. . . y
1Ny
21. . . y
2N. . . . x
1. . . x
N.
We claim that for i = 1, . . . , N we can find a root of unity ζ
iof order dividing q
1. . . q
asuch that
(4.5) x
i= x
1y
11y
1iζ
iholds true for i = 1, . . . , N .
To verify (4.5) consider in the expansion of the determinant the ele- ment we get along the main diagonal, i.e., y
11y
22. . . y
N −1,N −1x
N. Com- pare this element with the one where we replace the top left corner by the bottom left corner and the bottom right corner with the top right corner but otherwise we remain on the main diagonal, i.e., consider the element x
1y
22. . . y
N −1,N −1y
1N. Taking quotients we get
y
11x
Ny
1Nx
1= ±η
N,
where η
Nis a root of unity of order dividing q
1. . . q
a. By our assumption 2 ∈
{q
1, . . . , q
a} however, and therefore −η
Nis a root of unity of order dividing
q
1. . . q
aas well. Thus (4.5) in the case i = N is verified with ζ
N= ±η
N.
Interchanging columns in the determinant we get (4.5) in general. Therefore taking u
0= y
1we may infer that each point x ∈ B \ S
R∈M
U
N Ris of the shape
(x
1, . . . , x
N) = (ζζ
iy
1i) = (ζζ
iu
0i) with ζ = x
1/y
11and the lemma follows.
5. Proof of the Theorem. For n = 1 equation (1.1) has at most a single solution in E and the Theorem follows trivially.
Now suppose that n > 1 and that the assertion is proved for all n
0< n.
Put n + 1 = N and write equation (1.1) in homogenized form as (5.1) a
1x
1+ . . . + a
N −1x
N −1− x
N= 0,
to be solved in roots of unity x
i.
To prove assertion (1.2) of the Theorem it will suffice to show that there are points u
1, . . . , u
t∈ E
Nwith
t ≤ 2
2N !such that any nondegenerate solution x = (x
1, . . . , x
N) of (5.1) is contained in the union
[
t τ =1B(u
τ, p
1. . . p
s)
with B(u, l) as in (4.2). Then clearly the general solution ξ = (ξ
1, . . . , ξ
n) of (1.1) will be of the shape
ξ
i= x
ix
N(i = 1, . . . , n).
Thus in the Theorem the sets A(w
τ, p
1. . . p
s) (τ = 1, . . . , t) will do, where w
τ= (w
τ 1, . . . , w
τ n) is defined by
w
τ i= u
τ iu
τ N(i = 1, . . . , n).
We may apply Lemma 4.1 to (5.1). Thus, there exist proper linear sub- spaces W
1, . . . , W
2N !of the (N − 1)-dimensional space defined by (5.1) and there exists a tuple u
0= (u
01, . . . , u
0N) of roots of unity such that any solution x = (x
1, . . . , x
N) of (5.1) is contained in the union
B(u
0, q
1. . . q
a) ∪ W
1∪ . . . ∪ W
2N !. Here
X
a i=1(q
i− 2) ≤ N ! − 2.
Consider a typical subspace, say W . It may be defined by an equation (5.2) b
1x
1+ . . . + b
N −1x
N −1= 0,
where not all b
iare equal to zero.
Let I be a nonempty subset of {1, . . . , N − 1} and let C(I) be the subset of solutions of (5.2) satisfying
(5.3) X
i∈I
b
ix
i= 0,
but no proper subsum of the left hand side of (5.3) vanishes. We may apply the induction hypothesis to the solutions of (5.3) lying in C(I).
Consequently, there exist 2
2|I|!points v
jin |I|-dimensional space, and with roots of unity as components, such that any nondegenerate solution of (5.3) is contained in the union
[ B(v
j, r
1. . . r
b).
Here r
1, . . . , r
bare suitable primes satisfying r
1< . . . < r
b≤ |I|!.
Let v = (v
i)
i∈Ibe a typical point among the v
j. Then the elements in B(v, r
1. . . r
b) may be written as (x
i)
i∈Iwith components
(5.4) x
i= xη
iv
i(i ∈ I),
where x is an arbitrary root of unity and η
iis a root of unity whose order divides r
1. . . r
b. We may substitute (5.4) into (5.1) and obtain, writing a
N=
−1,
(5.5) X
i∈I
a
iη
iv
ix + X
i6∈I
a
ix
i= 0.
This is an equation in the N −|I|+1 variables x
iwith i 6∈ I and x. As in (5.3) we clearly have |I| ≥ 2, we may apply the inductive hypothesis again. By the hypothesis of our Theorem, on the left hand side of (5.5) no proper subsum vanishes. Thus by induction we get 2
2(N −|I|+1)!points y
j∈ E
N −|I|+1and primes s
1< . . . < s
c≤ (N − |I| + 1)! such that the solutions x, x
i(i 6∈ I) of (5.5) are contained in the union of the sets B(y
j, s
1. . . s
c). We now combine the results we have derived for (5.3) and (5.5).
Given a point v corresponding to (5.3) and a point y corresponding to (5.5) we construct a point u ∈ E
Nas follows: Suppose y has the components y, y
i(i 6∈ I). Then we put
u
i=
v
iy for i ∈ I, y
ifor i 6∈ I.
So u = (u
1, . . . , u
N) has components which are roots of unity.
Recall that the subspace W with the set I produces 2
2|I|!points v and 2
2(N −|I|+1)!points y. So the pair W , I gives rise to no more than
(5.6) 2
2(|I|!+(N −|I|+1)!)points u.
A closer look at (5.3) and (5.5) shows that for |I| = 2 in (5.3) a single point v and for |I| = N − 1 in (5.5) a single point y will suffice. In both cases we obtain only 2
2(N −1)!points u. Estimating the total number of points u corresponding to a single subspace W we finally obtain the bound
N 2
+
N
N − 1
2
2(N −1)!+
N −2
X
|I|=3
N
|I|
2
2(|I|!+(N −|I|+1)!)≤ 2
2(N −1)!(2
N− N ).
Consequently, each subspace W gives rise to not more than 2
N !−1 points u.
Allowing a factor 2
N !from Lemma 4.1 for the number of subspaces W , we finally see that altogether 2
2N !points u will suffice for the set of sub- spaces W . This bound also easily takes care of the extra point u
0arising from the assertion of Lemma 4.1.
We still have to discuss how the primes r
1, . . . , r
band s
1, . . . , s
cfrom (5.3) and (5.5) and the primes q
1, . . . , q
acorresponding to u
0fit together.
However, we had
q
1< . . . < q
a, X
a i=1(q
i− 2) ≤ N ! − 2, r
1< . . . < r
b≤ |I|!,
s
1< . . . < s
c≤ (N − |I| + 1)!.
Therefore, we have only primes not exceeding N !. Thus in the assertion of the Theorem it suffices to take the set {p
1, . . . , p
s} as the union of the sets {r
1, . . . , r
b}, {s
1, . . . , s
c}, the union being taken over all points v and y in our construction together with the primes {q
1, . . . , q
a} coming from the extra point u
0in Lemma 4.1. Assertion (1.2) follows with N = n + 1.
As for assertion (1.3), we remark that by Theorem 9 of J. B. Rosser and L. Schoenfeld [3] we have for x > 0
(5.7) X
p≤x