The number of families of solutions of decomposable form equations

(1)

LXXX.4 (1997)

The number of families of solutions of decomposable form equations

by

J.-H. Evertse (Leiden) and K. Gy˝ ory (Debrecen)

1. Introduction. In [16], Schmidt introduced the notion of family of solutions of norm form equations and showed that there are only finitely many such families. In [18], Voutier gave an explicit upper bound for the number of families. Independently, in [5], Gy˝ory extended the notion of family of solutions of norm form equations to decomposable form equations and gave an explicit upper bound for the number of families. In this paper, we obtain a significant improvement of the upper bounds of Voutier and Gy˝ory, by applying the results from Evertse [4].

Let β be a non-zero rational integer. Further, let M denote an algebraic number field of degree r and l(X) = α

₁

X

₁

+ . . . + α

_m

X

_m

a linear form with coefficients in M . There is a non-zero c ∈ Q such that the norm form (1.1) F (X) = cN

_M/Q

(l(X)) = c

Y

r i=1

(α

⁽ⁱ⁾₁

X

₁

+ . . . + α

⁽ⁱ⁾_m

X

_m

)

has its coefficients in Z. Here, we denote by α

⁽¹⁾

, . . . , α

^(r)

the conjugates of α ∈ M . We deal among other things with norm form equations of the shape

F (x) = ±β in x ∈ Z

^m

.

It is more convenient for us to consider the equivalent equation which is also called a norm form equation,

(1.2) cN

_M/Q

(x) = ±β in x ∈ M,

where M is the Z-module {x = l(x) : x ∈ Z

^m

} which is contained in M . In 1971, Schmidt [15] proved his fundamental result that (1.2) has only finitely many solutions if M satisfies some natural non-degeneracy condition.

Later, Schmidt [16] dealt also with the case where M is degenerate and

Research of the second author was supported in part by Grants 16975 and 16791 from the Hungarian National Foundation for Scientific Research and by the Foundation for Hungarian Higher Education and Research.

[367]

(2)

showed that in that case, the set of solutions of (1.2) can be divided in a natural way into families, and is the union of finitely many such families.

Below, we give a precise definition of a family of solutions of (1.2); here we mention that it is a coset xU

M,J

contained in M, where x is a solution of (1.2) and U

_M,J

is a particular subgroup of finite index in the unit group of the ring of integers of some subfield J of M . Schmidt’s results have been generalised to equations of the type

(1.3) cN

_M/K

(x) ∈ βO

^∗_S

in x ∈ M,

where K is an algebraic number field, O

S

is the ring of S-integers in K for some finite set of places S, O

^∗_S

is the unit group of O

_S

, c, β are elements of K

^∗

= K\{0}, M is a finite extension of K, and M is a finitely generated O

_S

-module contained in M . In fact, Schlickewei [13] proved the analogue of Schmidt’s result on families of solutions in the case where O

_S

is contained in Q, and Laurent [9] generalised this to arbitrary algebraic number fields K. The main tools in the proofs of these results were Schmidt’s subspace theorem and Schlickewei’s generalisation to the p-adic case and to number fields.

In [5], Gy˝ory generalised the concept of family of solutions to decompos- able form equations over O

S

, i.e. to equations of the form

(1.4) F (x) ∈ βO

^∗_S

in x = (x

1

, . . . , x

m

) ∈ O

^m_S

,

where K, S are as above, β is a non-zero element of O

S

and F (X) = F (X

₁

, . . . , X

_m

) is a decomposable form with coefficients in O

_S

, that is, F can be expressed as a product of linear forms in m variables with coeffi- cients in some extension of K. We can reformulate (1.4) in a shape similar to (1.3) as follows. According to [1], pp. 77–81, there are finite extension fields M

₁

, . . . , M

_t

of K, linear forms l

_j

(X) = α

_1j

X

₁

+ . . . + α

_mj

X

_m

with coefficients in M

j

for j = 1, . . . , t and c ∈ K

^∗

such that

(1.5) F (X) = c

Y

t j=1

N

_M_j_/K

(l

_j

(X)).

Now let

A = M

1

⊕ . . . ⊕ M

t

be the direct K-algebra sum of M

₁

, . . . , M

_t

, that is, the cartesian product M

₁

× . . . × M

_t

endowed with coordinatewise addition and multiplication. If we express an element of A as (α

1

, . . . , α

t

), then we implicitly assume that α

_j

∈ M

_j

for j = 1, . . . , t. We define the norm N

_A/K

(a) of a = (α

₁

, . . . , α

_t

) ∈ A to be the determinant of the K-linear map x 7→ ax from A to itself. This norm is known to be multiplicative. Further, we have

(1.6) N

_A/K

(a) = N

_M₁_/K

(α

₁

) . . . N

_M_t_/K

(α

_t

)

(3)

where N

_M_j_/K

is the usual field norm. Note that the O

S

-module M = {x = (l

₁

(x), . . . , l

_t

(x)) : x ∈ O

_S^m

}

is contained in A. Now (1.5) and (1.6) imply that (1.4) is equivalent to (1.7) cN

_A/K

(x) ∈ βO

^∗_S

in x ∈ M;

(1.7) will also be referred to as a decomposable form equation. In [5], Gy˝ory showed that the set of solutions of (1.7) is the union of finitely many fam- ilies. Further, in [5] he extended some of his results to decomposable form equations over arbitrary finitely generated integral domains over Z.

In [17], Schmidt made a further significant advancement by deriving, as a consequence of his quantitative subspace theorem, an explicit upper bound for the number of solutions of norm form equation (1.2) over Z for every non-degenerate module M. Schlickewei proved a p-adic generalisation of Schmidt’s quantitative subspace theorem and used it to derive an explicit upper bound for the number of solutions of S-unit equations [14]. Among others, this was used by Gy˝ory [5] to obtain an explicit upper bound for the number of families of solutions of decomposable form equation (1.7).

Independently, Voutier [18] obtained upper bounds similar to Gy˝ory’s for the number of families of solutions of norm form equation (1.3), in the special case where K = Q, β = 1. Recently, Evertse [4] improved the results of Schmidt and Schlickewei just mentioned. In this paper, we apply the results from [4] to obtain an upper bound for the number of families of solutions of (1.7) which is much sharper than Gy˝ory’s and Voutier’s (cf. Theorem 1 in Section 1.2).

In Section 1.1 we introduce the necessary terminology. In Section 1.2 we state our main results (Theorems 1 and 2) and some corollaries. In partic- ular, in Corollary 2 we give an upper bound for the number of O

^∗_S

-cosets of solutions of (1.7) in the case where that number is finite; here, an O

^∗_S

-coset is a set xO

_S^∗

= {εx : ε ∈ O

^∗_S

} where x is a fixed solution of (1.7). Further, in Section 2 we derive from Theorem 1 an asymptotic formula (cf. Corollary 4) for the number of O

^∗_S

-cosets of solutions of (1.7), whenever this number is infinite. The other sections are devoted to the proofs of Theorems 1 and 2.

1.1. Terminology. Here and in the sequel we use the following notation:

the unit group of a ring R with 1 is denoted by R

^∗

and for x ∈ R and a

subset H of R we define xH := {xh : h ∈ H}. Let K be an algebraic number

field. Denote by O

_K

the ring of integers and by M

_K

the collection of places

(equivalence classes of absolute values) on K. Recall that M

_K

consists of

finitely many infinite (i.e. archimedean) places (the number of these being

r

₁

+r

₂

where r

₁

, r

₂

denote the number of isomorphic embeddings of K into R

and the number of complex conjugate pairs of isomorphic embeddings of K

into C, respectively) and of infinitely many finite (non-archimedean) places

(4)

which may be identified with the prime ideals of O

K

. For every v ∈ M

K

we choose an absolute value | · |

_v

from v. Now let S be a finite subset of M

_K

containing all infinite places. The ring of S-integers and its unit group, the group of S-units, are defined by

O

_S

= {x ∈ K : |x|

_v

≤ 1 for v 6∈ S}, O

_S^∗

= {x ∈ K : |x|

_v

= 1 for v 6∈ S}, respectively, where v 6∈ S means v ∈ M

_K

\S. For a finite extension J of K, we denote by O

_J,S

the integral closure of O

_S

in J.

We first introduce families of solutions for norm form equations (1.3) cN

_M/K

(x) ∈ βO

^∗_S

in x ∈ M,

where, as before, M is a finite extension of K, M is a finitely generated O

_S

-module contained in M and c, β are elements of K

^∗

. Let V := KM be the K-vector space generated by M. For a subfield J of M containing K, define the sets

(1.8) V

^J

= {x ∈ V : xJ ⊆ V }, M

^J

= V

^J

∩ M.

As is easily seen, we have λx ∈ V

^J

for x ∈ V

^J

, λ ∈ J. Further, define the subgroup of the unit group of O

_J,S

,

(1.9) U

M,J

:= {ε ∈ O

^∗_J,S

: εM

^J

= M

^J

}.

For instance from Lemma 9 of [5] it follows that U

_M,J

has finite index in O

^∗_J,S

. Note that N

_M/K

(ε) ∈ O

^∗_S

for ε ∈ U

M,J

. Hence if x ∈ M

^J

is a solution of (1.3) then so is every element of the coset xU

_M,J

. Such a coset is called a family of solutions (or rather an (M, J)-family of solutions) of (1.3). Laurent [9] proved the generalisation of Schmidt’s result that the set of solutions of (1.3) is the union of at most finitely many families.

Now let A = M

₁

⊕. . .⊕M

_t

be the direct K-algebra sum of finite extension fields M

1

, . . . , M

t

of K. Note that A has unit element 1

A

= (1, . . . , 1) (t times) where 1 is the unit element of K and that the unit group of A is A

^∗

= {(ξ

₁

, . . . , ξ

_t

) ∈ A : ξ

₁

. . . ξ

_t

6= 0}. For each K-subalgebra B of A, denote by O

B,S

the integral closure of O

S

in B. Thus,

O

_A,S

= O

_M₁_,S

⊕ . . . ⊕ O

_M_t_,S

is the direct sum of the integral closures of O

_S

in M

₁

, . . . , M

_t

, respectively, and

O

_B,S

= O

_A,S

∩ B

for each K-subalgebra B of A. From these facts and (1.6) it follows easily that for b ∈ O

_A,S

we have N

_A/K

(b) ∈ O

_S

and that for b in the unit group O

^∗_A,S

we have N

_A/K

(b) ∈ O

^∗_S

.

Let c, β ∈ K

^∗

, let M be a finitely generated O

_S

-module contained in A, and consider the equation

(1.7) cN

_A/K

(x) ∈ βO

^∗_S

in x ∈ M.

(5)

Families of solutions of (1.7) are defined in precisely the same way as for (1.3), but now the role of the subfields J of M in (1.3) is played by the K-subalgebras B of A that contain the unit element 1

_A

of A. Thus, let V := KM be the K-vector space, contained in A, generated by M and for each K-subalgebra B of A with 1

_A

∈ B define the sets

(1.10) V

^B

:= {x ∈ V : xB ⊆ V }, M

^B

:= V

^B

∩ M and the subgroup of the unit group of O

_B,S

,

(1.11) U

M,B

:= {ε ∈ O

_B,S^∗

: εM

^B

= M

^B

}

which is known to have finite index [O

_B,S^∗

: U

M,B

] in O

^∗_B,S

(cf. [5], Lemma 9). Clearly, V

^B

is closed under multiplication by elements of B (and in fact the largest subspace of V with this property). An (M, B)-family of solutions of (1.7) is a coset xU

_M,B

, where B is a K-subalgebra of A containing 1

_A

and x ∈ M

^B

is a solution of (1.7); since N

_A/K

(ε) ∈ O

_S^∗

for ε ∈ U

M,B

, every element of xU

_M,B

is a solution of (1.7). If A = M is a finite extension field of K this notion of family of solutions coincides with that for norm form equation (1.3), since then the K-subalgebras of A containing 1

A

are precisely the subfields of M containing K. In [5], Gy˝ory proved among other things that the set of solutions of (1.7) is the union of finitely many families.

1.2. Results. Below, we first recall Gy˝ory’s result on the number of fam- ilies of solutions of (1.7) and then state our improvement. As before, let K be an algebraic number field, S a finite set of places on K containing all infinite places, A = M

₁

⊕ . . . ⊕ M

_t

where M

₁

, . . . , M

_t

are finite extensions of K, and M a finitely generated (not necessarily free) O

S

-submodule of A.

Let a

_i

= (α

_i1

, . . . , α

_it

) (i = 1, . . . , m) be a set of generators of M. Thus, M = {x = (l

₁

(x), . . . , l

_t

(x)) : x ∈ O

_S^m

}

where l

_j

(x) = α

_1j

x

₁

+ . . . + α

_mj

x

_m

for j = 1, . . . , t, and by (1.6) we have N

_A/K

(x) = Q

_t

j=1

N

_M_j_/K

(l

_j

(x)). We call d a denominator of M if d ∈ K

^∗

and if the polynomial d Q

_t

j=1

N

_M_j_/K

(l

j

(X)) has its coefficients in O

S

. This notion of denominator is easily shown to be independent of the choice of the generators a

1

, . . . , a

m

.

We consider equation (1.7), and impose the following conditions on S, A, M, β and c:

(1.12)

 

 

 



S has cardinality s, A has dimension P

_t

i=1

[M

i

: K] = r ≥ 2 as a K-vector space, the K-vector space V := KM has dimension n ≥ 2,

β ∈ O

_S

\{0}, c is a denominator of M.

For every finite place v on K, let ord

v

(·) denote the discrete valuation cor-

responding to v with value group Z; recall that | · |

_v

= C

_v^{− ord}^v^(·)

for some

(6)

C

v

> 1. For β ∈ K

^∗

, let ω

S

(β) denote the number of v 6∈ S with ord

v

(β) 6= 0 and put

ψ

₁

(β) :=

r n − 1

_ω_S_(β)

Y

v6∈S

r · ord

_v

(β) + n n

.

Further, let D be the degree over Q of the normal closure of the composite M

₁

. . . M

_t

over Q; thus, [K : Q] ≤ D ≤ (r[K : Q])!. Gy˝ory [5] proved that the set of solutions of (1.7) is contained in some finite union of cosets of unit groups

(1.13) x

1

O

_B^∗₁_,S

∪ . . . ∪ x

w

O

_B^∗_w_,S

with w ≤ (4sD)

²^37nD^s⁶

ψ

1

(β), where for i = 1, . . . , w, B

i

is a K-subalgebra of A with 1

A

∈ B

i

, x

i

∈ A

^∗

with x

_i

B

_i

⊂ V , and where the set of solutions of (1.7) contained in x

_i

O

^∗_B_i_,S

is the union of at most [O

^∗_B_i_,S

: U

M,B_i

] (M, B

i

)-families of solutions. This implies an upper bound for the number of families of solutions of (1.7) which depends on n, r, β, s and the indices [O

_B^∗_i_,S

: U

_M,B_i

] (cf. [5], Theorem 3), so ultimately on the module M. We mention that Voutier [18], Chap. V independently obtained a result similar to (1.13) but only for norm form equation (1.3) and with K = Q, β = 1.

Gy˝ory’s result can be improved as follows. A K-subalgebra B of A is said to be S-minimal if 1

A

∈ B, and if for each proper K-subalgebra B

⁰

of B with 1

_A

∈ B

⁰

, the quotient group O

^∗_B,S

/O

^∗_B0,S

is infinite. A family of solutions of (1.7) is said to be reducible if it is the union of finitely many strictly smaller families of solutions, and irreducible otherwise. Put

(1.14)

ψ

2

(β) :=

r n − 1

_ω_S_(β)

Y

v6∈S

ord

v

(β) + n − 1 n − 1

,

e(n) := 1

3 n(n + 1)(2n + 1) − 2.

Theorem 1. Assume (1.12). The set of solutions of (1.7) cN

_A/K

(x) ∈ βO

_S^∗

in x ∈ M

can be expressed as a finite union of irreducible families of solutions. More precisely, the set of solutions of (1.7) is contained in some finite union of cosets

(1.15) x

1

O

^∗_B₁_,S

∪ . . . ∪ x

w

O

^∗_B_w_,S

with w ≤ (2

³³

r

²

)

^e(n)s

ψ

2

(β)

such that for i = 1, . . . , w, B

i

is an S-minimal K-subalgebra of A, x

i

∈ A

^∗

with x

_i

B

_i

⊂ V , and the set of solutions of (1.7) contained in x

_i

O

_B^∗_i_,S

is the

union of at most [O

^∗_B_i_,S

: U

M,B_i

] (M, B

i

)-families of solutions which are all

irreducible.

(7)

R e m a r k 1. The right-hand side of Gy˝ory’s bound (1.13) depends dou- bly exponentially on n and in the worst case when D = (r[K : Q])! triply exponentially on r, whereas our bound (1.15) depends only polynomially on r and exponentially on n

³

. (1.13) can be better than (1.15) in terms of r only if D is very small compared with r, e.g. if A = Q

^r

for some large r.

It is likely that, in (1.15), 2

³³

can be improved upon, and that e(n) can be replaced by a linear expression of n.

For some very special type of norm form equation, Voutier succeeded in deriving an upper bound for the number of families of solutions independent of the module M (see the remark after Corollary 1). It is an open problem whether an explicit bound independent of M exists in full generality, for equations (1.3) or (1.7) (

¹

).

R e m a r k 2. We can express the set of solutions of (1.7) as a minimal finite union of irreducible families, that is, as a union F

₁

∪ . . . ∪ F

_g

where F

1

, . . . , F

g

are irreducible families of solutions, none of which is contained in the union of the others. We claim that any other irreducible family of solutions of (1.7) is contained in one of F

₁

, . . . , F

_g

. In other words, F

₁

, . . . , F

_g

are the maximal irreducible families of solutions of (1.7). Hence Theorem 1 above gives automatically an upper bound for the number of maximal irreducible families. To prove our claim, let G be an arbitrary irreducible family of solutions of (1.7). Then G is the union of the sets G ∩ F

i

for i = 1, . . . , g and by Lemma 3 in Section 2, each of these sets is a union of finitely many families. Then one of these families, contained in F

1

, say, is equal to G. Hence G ⊆ F

₁

.

R e m a r k 3. There is only one way to express the set of solutions of (1.7) as a minimal union of irreducible families, since the families appearing in such a union are the maximal irreducible families of solutions of (1.7).

We also investigate the problem to give an upper bound for the number of K-subalgebras B of A for which (1.7) has (M, B)-families of solutions. Let again V = KM. Suppose again that dim

_K

A = r and dim

_K

V = n. If x is a solution in M

^B

, then x ∈ V

^B

∩ A

^∗

, where A

^∗

is the unit group of A. Hence (1.7) can have (M, B)-families of solutions only for those K-subalgebras B of A for which

(1.16) 1

A

∈ B, V

^B

∩ A

^∗

6= ∅.

In [5], Gy˝ory proved that the number of algebras B with (1.16) is at most n

^r

. We can improve this as follows:

(

¹

) A d d e d i n p r o o f: W. M. Schmidt and P. Voutier have recently proved that, in

general, an upper bound for the number of families of solutions of (1.3) or (1.7) must

depend on the module M (see also footnote (

²

)).

(8)

Theorem 2. The number of K-subalgebras B of A with (1.16) is at most (n max(r − n, 2))

ⁿ

.

We do not know whether the dependence on r is necessary.

We derive some corollaries from Theorem 1. First we specialise Theorem 1 to norm form equation (1.3). Let K, S be as above so that in particular S has cardinality s. Further, let M be a finite extension of K of degree r ≥ 2, M a finitely generated O

_S

-submodule of M such that the K-vector space KM has dimension n ≥ 2, and c, β constants such that β ∈ O

S

\{0} and c is a denominator of M. Then, by applying Theorem 1 with A = M , we get at once the following result which improves upon the corresponding results in [5] and [18]:

Corollary 1. The set of solutions of

(1.3) cN

_M/K

(x) ∈ βO

^∗_S

in x ∈ M

can be expressed as a finite union of irreducible families of solutions. More precisely, the set of solutions of (1.3) is contained in some finite union of cosets

x

1

O

^∗_J₁_,S

∪ . . . ∪ x

w

O

^∗_J_w_,S

with w ≤ (2

³³

r

²

)

^e(n)s

ψ

2

(β)

such that for i = 1, . . . , w, J

i

is a subfield of M containing K, x

i

∈ M

^∗

is such that x

_i

J

_i

⊂ V , and the set of solutions of (1.3) in x

_i

O

^∗_J

i,S

is the union of at most [O

^∗_J_i_,S

: U

_M,J_i

] (M, J

_i

)-families of solutions which are all irreducible.

As mentioned before, for a very special type of norm form equation Voutier ([18], Theorem V.3) obtained an upper bound for the number of families independent of M: namely, he proved that if M is a Z-module of rank 3 contained in the ring of integers of an algebraic number field M of degree r > rank M = 3, then the set of solutions of the equation

N

_M/Q

(x) = 1 in x ∈ M is the union of at most r

²⁸⁶^r²

families (

²

).

We return to equation (1.7). In what follows, we consider K as a K- subalgebra of A by identifying α ∈ K with α · 1

_A

. The set of solutions of (1.7) can be divided into O

^∗_S

-cosets xO

^∗_S

. Gy˝ory [5], Corollary 2, gave an explicit upper bound for the number of O

_S^∗

-cosets of solutions of (1.7) in the case where this number is finite. We can improve this as follows:

Corollary 2. Assume (1.12). Suppose that (1.7) has only finitely many O

^∗_S

-cosets of solutions. Then this number is at most (2

³³

r

²

)

^e(n)s

ψ

2

(β).

(

²

) A d d e d i n p r o o f: W. M. Schmidt and P. Voutier have recently constructed a

class of ternary cubic norm form equations N

_M/_Q

(x) = 1 in which there are equations

with arbitrarily many families of solutions.

(9)

For β = 1, this gives the Corollary to Theorem 1 of [4].

P r o o f. Let B be one of the S-minimal K-subalgebras of A occurring in (1.15). We may assume that (1.7) has an (M, B)-family of solutions, xU

_M,B

, say. By identifying ε ∈ O

_S^∗

with ε · 1

A

, we may view O

_S^∗

as a subgroup of U

_M,B

. Let w ≤ ∞ be the index of O

^∗_S

in U

_M,B

. Then xU

_M,B

is the union of precisely w O

^∗_S

-cosets. So our assumption implies that w is finite. Therefore, [O

^∗_B,S

: O

_S^∗

] is finite. Now since B is S-minimal, it follows that B = K. So each algebra B

_i

occurring in (1.15) is equal to K, i.e. O

_B^∗_i_,S

= O

^∗_S

, and Corollary 2 follows.

In general, it is as yet not effectively decidable whether (1.7) has only finitely many O

_S^∗

-cosets of solutions. Schmidt [17], Theorem 3, derived an explicit upper bound for the number of solutions of norm form equations over Z satisfying an effectively decidable non-degeneracy condition. It is possible to give a similar effective non-degeneracy condition for (1.7) as well, which implies that for every β ∈ O

_S

\ {0}, the number of O

^∗_S

-cosets of solutions is finite. Moreover, under that condition we can derive an upper bound for the number of O

^∗_S

-cosets of solutions with a better dependence on β in that unlike the bound in Corollary 2, it does not depend on the quantities ord

_v

(β) (v ∈ M

K

\S) appearing in ψ

2

(β).

The vector space V = KM is said to be non-degenerate if V

^B

∩ A

^∗

= ∅ for every K-subalgebra B of A with 1

_A

∈ B, B 6= K, where A

^∗

is the unit group of A. (1.16) implies that in that case, each algebra B

i

occurring in (1.15) is equal to K. Hence the set of solutions of (1.7) is the union of finitely many O

^∗_S

-cosets.

Corollary 3. Assume (1.12) and in addition that V = KM is non- degenerate. Then the set of solutions of (1.7) is the union of at most (2

³³

r

²

)

^e(n)(s+ω^S^(β))

O

_S^∗

-cosets.

P r o o f. We apply Theorem 1 with S

⁰

:= S ∪ {v 6∈ S : ord

v

(β) > 0}

replacing S. Thus, β ∈ O

^∗_S0

. We have to replace s by the cardinality of S

⁰

which is s

⁰

:= s + ω

_S

(β). Moreover, in the definition of ψ

₂

(β), S has to be replaced by S

⁰

, which means that ψ

2

(β) has to be replaced by 1. Let M

⁰

be the O

_S0

-module generated by M. Thus, every solution of (1.7) satisfies (1.7

⁰

) cN

_A/K

(x) ∈ O

_S^∗0

in x ∈ M

⁰

.

Clearly, c is a denominator of M

⁰

. Moreover, since V is non-degenerate,

the set of solutions of (1.7

⁰

) is the union of finitely many O

^∗_S0

-cosets. So by

Corollary 2, the set of solutions of (1.7

⁰

), and hence also the set of solutions

of (1.7), is contained in the union of at most (2

³³

r

²

)

^e(n)s⁰

O

^∗_S0

-cosets. Now

if any two solutions x

₁

, x

₂

of (1.7) belong to the same O

^∗_S0

-coset then they

belong to the same O

_S^∗

-coset: for if x

2

= εx

1

with ε ∈ O

^∗_S0

, then ε

^r

=

cN

_A/K

(x

₂

)/cN

_A/K

(x

₁

) ∈ O

^∗_S

, hence ε ∈ O

_S^∗

. This proves Corollary 3.

(10)

2. An asymptotic formula. In this section, we state and prove an asymptotic density result for the collection of O

_S^∗

-cosets of solutions of equa- tion (1.7), in the case where the number of these is infinite. This asymptotic density result is a consequence of (the qualitative part of) Theorem 1.

We recall the definition of absolute (multiplicative) Weil height. Let Q denote the algebraic closure of Q. Let x = (x

₁

, . . . , x

_n

) ∈ Q

ⁿ

\{0}. Take any algebraic number field L containing x

1

, . . . , x

n

, and let σ

1

, . . . , σ

d

be the isomorphic embeddings of L into Q, where d = [L : Q]. Further, let (x

₁

, . . . , x

_n

) denote the fractional ideal with respect to the ring of integers of L, generated by x

1

, . . . , x

n

, and denote by N

_L/Q

((x

1

, . . . , x

n

)) its norm.

Then the absolute Weil height of x is defined by H(x) = H(x

₁

, . . . , x

_n

) :=

Q

_d

i=1

max(|σ

_i

(x

₁

)|, . . . , |σ

_i

(x

_n

)|) N

_L/Q

((x

1

, . . . , x

n

))

_1/d

. It is clear that H(x) does not depend on the choice of L. Further, (2.1) H(λx) = H(x) for x ∈ Q

ⁿ

\{0}, λ ∈ Q

^∗

.

Now let K be an algebraic number field and A = M

₁

⊕ . . . ⊕ M

_t

, where M

₁

, . . . , M

_t

are finite extension fields of K. We define the height H(x) of x = (ξ

1

, . . . , ξ

t

) ∈ A to be the absolute Weil height of the vector with coordinates consisting of ξ

₁

, . . . , ξ

_t

and their conjugates over K, that is, if τ

_i,1

, . . . , τ

_i,r_i

with r

_i

= [M

_i

: K] are the K-isomorphic embeddings of M

_i

into Q then we put

H(x) := H(τ

_1,1

(ξ

₁

), . . . , τ

_1,r₁

(ξ

₁

), . . . , τ

_t,1

(ξ

_t

), . . . , τ

_t,r_t

(ξ

_t

)).

Note that by (2.1) we have

(2.2) H(x) = H(λx) for x ∈ A\{0}, λ ∈ K

^∗

,

i.e. H may be viewed as a height on the collection (A\{0})/K

^∗

of K

^∗

-cosets xK

^∗

(x ∈ A\{0}). This height satisfies

(2.3) #{x ∈ (A\{0})/K

^∗

: H(x) ≤ X} < ∞ for X > 0.

Namely, by Northcott’s theorem [10], [11] we know that for every d >

0, X > 0, there are, up to multiplication by elements from Q

^∗

, only finitely

many x = (ξ

₁

, . . . , ξ

_n

) ∈ Q

ⁿ

\{0} with H(x) ≤ X and [Q(ξ

_i

) : Q] ≤ d

for i = 1, . . . , n. This implies that the set of non-zero elements x of A

with H(x) ≤ X can be divided into finitely many classes, where x =

(ξ

₁

, . . . , ξ

_t

), y = (η

₁

, . . . , η

_t

) ∈ A are said to belong to the same class

if (τ

1,1

(ξ

1

), . . . , τ

t,r_t

(ξ

t

)) = α(τ

1,1

(η

1

), . . . , τ

t,r_t

(η

t

)) for some α ∈ Q

^∗

. But

clearly, if for instance ξ

₁

6= 0, then α = τ

_1,1

(η

₁

/ξ

₁

) = . . . = τ

_1,r₁

(η

₁

/ξ

₁

),

which implies that α ∈ K. So if x, y ∈ A\{0} belong to the same class then

they belong to the same K

^∗

-coset.

(11)

For a finitely generated abelian group Λ, denote by Λ

tors

the torsion subgroup of Λ and by rank Λ the rank of the free abelian group Λ/Λ

_tors

. Let as usual S be a finite set of places on K which contains all infinite places.

For a K-subalgebra B of A containing the unit element 1

A

of A we put

%

B,S

:= rank O

_B,S^∗

/O

_S^∗

,

where we view O

^∗_S

as a subgroup of O

^∗_B,S

by identifying ε ∈ O

^∗_S

with ε·1

A

. By a straightforward generalisation of Dirichlet’s unit theorem, O

^∗_B,S

is finitely generated, hence %

_B,S

is finite.

Let again β, c ∈ K

^∗

, and let M be a finitely generated O

S

-submodule of A such that condition (1.12) holds. For every X > 0 we consider the set of solutions of

(2.4) cN

_A/K

(x) ∈ βO

^∗_S

in x ∈ M with H(x) ≤ X.

From (2.2) and O

_S^∗

⊂ K

^∗

it follows that the set of solutions of (2.4) can be divided into O

^∗_S

-cosets xO

_S^∗

. Denote by N (X) the maximal number of distinct O

_S^∗

-cosets contained in the set of solutions of (2.4). From (2.3) it follows that N (X) is finite: namely, if x, y are solutions of (2.4) with y = εx for some ε ∈ K

^∗

, then ε

^r

= N

_A/K

(y)/N

_A/K

(x) ∈ O

_S^∗

, so x, y belong to the same O

^∗_S

-coset. For norm form equations over Q, asymptotic formulas for N (X) were derived by Gy˝ory and Peth˝o [6] (in the archimedean case) and Peth˝o [12] (for an arbitrary finite set of places S); Gy˝ory and Peth˝o [7]

and Everest [2] obtained more precise results in certain special cases. From (the qualitative part of) Theorem 1 we derive the following generalisation of Peth˝o’s result [12]:

Corollary 4. We have

N (X) = γ(log X)

^%

+ O((log X)

^%−1

) as X → ∞,

where γ is a positive number independent of X and where % is the maximum of the numbers %

_B,S

, taken over all K-subalgebras B of A with 1

_A

∈ B for which the equation cN

_A/K

(x) ∈ βO

^∗_S

in x ∈ M has (M, B)-families of solutions.

We mention that for O

S

= Z, Everest and Gy˝ory [3] recently obtained some refinements for equations of the form (1.4).

R e m a r k 4. γ, % and the constant in the error term are all ineffec-

tive. By (1.16), we can estimate % from above by the effectively computable

number %

0

, which is the maximum of the numbers %

B,S

, taken over all K-

subalgebras B of A with 1

_A

∈ B, V

^B

∩ A

^∗

6= ∅. Further, using the explicit

bound in Theorem 1, one can effectively compute an upper bound for γ; we

shall not work this out.

(12)

To derive Corollary 4 we need some lemmas. The first lemma is undoubt- edly well-known but we could not find a proof of it in the literature.

Lemma 1. Let Λ be a finitely generated additive abelian group of rank %, and let f be a function from Λ to R with the following properties:

f (x) ≥ 0 for x ∈ Λ;

(2.5)

f (x + y) ≤ f (x) + f (y) for x, y ∈ Λ;

(2.6)

f (λx) = λf (x) for x ∈ Λ, λ ∈ Z

_≥0

; (2.7)

for every Y > 0, the set {x ∈ Λ : f (x) ≤ Y } is finite.

(2.8) Then

(2.9) #{x ∈ Λ : f (x) ≤ Y } = γY

^%

+ O(Y

^%−1

) as Y → ∞ where γ = γ(Λ, f ) is a positive constant.

P r o o f. We first assume that Λ = Z

^%

. For x = (ξ

1

, . . . , ξ

%

) ∈ R

^%

we define the maximum norm kxk := max(|ξ

₁

|, . . . , |ξ

_%

|). Letting e

_i

= (0, . . . , 1, . . . , 0) (i = 1, . . . , %) denote the vector in Z

^%

with a single 1 on the ith place, we infer from (2.5)–(2.7) that for x = (ξ

1

, . . . , ξ

%

), y = (η

1

, . . . , η

%

) ∈ Z

^%

we have

|f (x) − f (y)| ≤ max(f (x − y), f (y − x)) ≤ X

% i=1

|ξ

_i

− η

_i

| max(f (e

_i

), f (−e

_i

)), whence

(2.10) |f (x) − f (y)| ≤ Ckx − yk, where C := P

_%

i=1

max(f (e

_i

), f (−e

_i

)).

We extend f to a function on Q

^%

by putting f (x) := λ

⁻¹

f (λx) for x ∈ Q

^%

where λ is the smallest positive integer such that λx ∈ Z

^%

. This extended f satisfies again (2.5)–(2.7) and (2.10), but now for all x, y ∈ Q

^%

and λ ∈ Q

_≥0

. Using (2.10) and taking limits we can extend f to a continuous function f : R

^%

→ R which satisfies (2.5)–(2.7) and (2.10) for all x, y ∈ R

^%

and λ ∈ R

≥0

.

For Y > 0 we define the set C

Y

:= {x ∈ R

^%

: f (x) ≤ Y }. Since f is continuous, this set is Lebesgue measurable. By (2.7) we have C

_Y

= {Y x : x ∈ C

1

}. Hence C

Y

has Lebesgue measure γY

^%

, where γ is the Lebesgue measure of C

₁

. We can cover R

^%

by the unit cubes U

_z

:= {x ∈ R

^%

: kx − zk ≤ 1/2} (z ∈ Z

^%

). These cubes have Lebesgue measure 1, and any two different cubes have at most part of their boundary in common. (2.7) and (2.10) imply that

C

_{Y −C/2}

⊆ [

z∈Z^% f (z)≤Y

U

_z

⊆ C

_{Y +C/2}

for Y ≥ C/2.

(13)

Now let n(Y ) be the number of z ∈ Z

^%

with f (z) ≤ Y . By comparing Lebesgue measures, we get

(2.11) γ(Y − C/2)

^%

≤ n(Y ) ≤ γ(Y + C/2)

^%

for Y ≥ C/2.

From (2.8) it follows that n(Y ) is finite; hence γ is finite. Moreover, for Y sufficiently large, n(Y ) > 0, hence γ > 0. Now (2.9) follows at once from (2.11). This settles the case Λ = Z

^%

.

Now let Λ be an arbitrary additive abelian group. There are u

₁

, . . . , u

_%

∈ Λ such that every x ∈ Λ can be expressed uniquely as

x = t + ζ

1

u

1

+ . . . + ζ

%

u

%

with t ∈ Λ

tors

, z = (ζ

1

, . . . , ζ

%

) ∈ Z

^%

. Put f

⁰

(z) := f (ζ

₁

u

₁

+ . . . + ζ

_%

u

_%

). (2.6) implies that f

⁰

(z) − f (−t) ≤ f (x) ≤ f

⁰

(z)+f (t). Further, (2.7) with λ = 0 implies that f (0) = 0. More generally, (2.7) implies that f (t) = 0 for t ∈ Λ

_tors

since for such t there is a positive integer λ with λt = 0. Hence f (x) = f

⁰

(z) for x ∈ Λ. Clearly, f

⁰

and Z

^%

satisfy (2.5)–(2.8). So by what we proved above we have

#{z ∈ Z

^%

: f

⁰

(z) ≤ Y } = γ

⁰

Y

^%

+ O(Y

^%−1

) as Y → ∞

with some positive γ

⁰

. From this, one deduces easily that (2.9) holds with γ = γ

⁰

· #Λ

_tors

. This completes the proof of Lemma 1.

For a subset F of A with the property that for each x ∈ F the coset xO

_S^∗

is contained in F, we denote by N

_F

(X) the maximal number of distinct O

^∗_S

-cosets xO

^∗_S

with x ∈ F and H(x) ≤ X.

Lemma 2. Let F = xU

M,B

be a family of solutions of (1.7), where B is a K-subalgebra of A containing 1

_A

and x ∈ M

^B

. Then for some positive real γ depending only on M and B we have

(2.12) N

_F

(X) = γ(log X)

^%^B,S

+ O((log X)

^%^B,S⁻¹

) as X → ∞.

P r o o f. We use the following properties of the absolute Weil height which are straightforward consequences of its definition:

(2.13)

 

 

 



H(x) ≥ 1 for x ∈ Q

ⁿ

\{0},

H(x

₁

y

₁

, . . . , x

_n

y

_n

) ≤ H(x

₁

, . . . , x

_n

)H(y

₁

, . . . , y

_n

)

for x

₁

, . . . , x

_n

, y

₁

, . . . , y

_n

∈ Q, H(x

^λ₁

, . . . , x

^λ_n

) = H(x

1

, . . . , x

n

)

^λ

for x

1

, . . . , x

n

∈ Q, λ ∈ Z

≥0

. Let U := U

_M,B

and %

₀

:= %

_B,S

. Since U has finite index in O

_B,S^∗

, the factor group U/O

_S^∗

has rank %

₀

. We apply Lemma 1 to Λ = U/O

^∗_S

and f = log H.

By (2.2), f is well-defined on Λ. Further, (2.13) implies (2.5)–(2.7), and (2.8) follows from (2.3) and the fact that U/O

_S^∗

= U/(K

^∗

∩ U) may be viewed as a subgroup of A

^∗

/K

^∗

. It follows that

(2.14) N

_U

(X) = γ(log X)

^%⁰

+ O((log X)

^%⁰⁻¹

) as X → ∞

(14)

for some positive constant γ. By (2.13) we have c

1

H(xu) ≤ H(u) ≤ c

2

H(xu) for u ∈ U, where c

₁

= H(x)

⁻¹

and c

₂

= H(x

⁻¹

), and this implies that N

_U

(c

⁻¹₂

X) ≤ N

_xU

(X) ≤ N

_U

(c

⁻¹₁

X). Now Lemma 2 follows from (2.14) and the fact that both (log(c

⁻¹₁

X))

^%⁰

and (log(c

⁻¹₂

X))

^%⁰

differ from (log X)

^%⁰

by at most O((log X)

^%⁰⁻¹

).

Lemma 3. For any two K-subalgebras B

1

, B

2

of A containing 1

A

, the intersection of an (M, B

₁

)-family and an (M, B

₂

)-family is the union of at most finitely many (M, B

₁

∩ B

₂

)-families.

P r o o f. Let G

_i

= x

_i

U

_M,B_i

with x

_i

∈ M

^Bⁱ

for i = 1, 2 be the two families of solutions and put B := B

₁

∩ B

₂

. Let x

₀

∈ G

₁

∩ G

₂

. Then x

₀

∈ M

^B¹

∩ M

^B²

. From definition (1.10) it follows easily that M

^Bⁱ

⊆ M

^B

for i = 1, 2. Therefore, x

₀

∈ M

^B

. Further, we have G

_i

= x

₀

U

_M,B_i

for i = 1, 2, hence G

1

∩ G

2

= x

0

(U

M,B₁

∩ U

M,B₂

). We claim that U

M,B

is a subgroup of finite index in U

_M,B₁

∩ U

_M,B₂

; then it follows at once that G

₁

∩ G

₂

is the union of finitely many families yU

_M,B

with y ∈ M

^B

. To prove the claim, let ε ∈ U

M,B

and take i ∈ {1, 2}. Then ε ∈ B ⊆ B

i

, whence by (1.10), εM

^Bⁱ

⊆ V

^Bⁱ

where V = KM. Further, by (1.11) we have εM

^Bⁱ

⊆ εM

^B

= M

^B

⊆ M. Therefore, by (1.10), εM

^Bⁱ

⊆ M

^Bⁱ

. Similarly, we find ε

⁻¹

M

^Bⁱ

⊆ M

^Bⁱ

. Hence εM

^Bⁱ

= M

^Bⁱ

, i.e. ε ∈ U

M,B_i

for i = 1, 2. So U

_M,B

⊆ U

_M,B₁

∩ U

_M,B₂

. Now our claim follows from the fact that both groups have finite index in O

^∗_B,S

= O

_B^∗₁_,S

∩ O

_B^∗₂_,S

.

P r o o f o f C o r o l l a r y 4. By Theorem 1, the set of solutions of (1.7) can be expressed as

(2.15) F

1

∪ . . . ∪ F

p

where for each i, F

i

is an (M, B

i

)-family of solutions of (1.7) for some K-subalgebra B

_i

of A containing 1

_A

. For a tuple I = {i

₁

< . . . < i

_t

} of integers from {1, . . . , p}, let B

_I

:= B

_i₁

∩ . . . ∩ B

_i_t

, F

_I

:= F

_i₁

∩ . . . ∩ F

_i_t

, and let N

I

(X) be the number of cosets xO

_S^∗

with x ∈ F

I

and H(x) ≤ X.

Put %

₁

:= max{%

_B_i_,S

: i = 1, . . . , p}. Thus, %

_B_I_,S

≤ %

₁

for each tuple I as above. Lemma 3 implies that for each I, F

_I

is the union of finitely many (M, B

I

)-families. So by Lemma 2 we have

N

I

(X) = γ

I

(log X)

^%¹

+ O((log X)

^%¹⁻¹

) as X → ∞

where γ

I

= 0 if %

B_I,S

< %

1

. Note that γ

i

> 0 for at least one i ∈ {1, . . . , p}.

Now by (2.15) and the rule of inclusion and exclusion we have N (X) =

X

p i=1

N

_i

(X) − X

#I=2

N

_I

(X) + X

#I=3

N

_I

(X) − . . . , hence

N (X) = γ(log X)

^%¹

+ O((log X)

^%¹⁻¹

) as X → ∞

(15)

where

γ = X

p i=1

γ

i

− X

#I=2

γ

I

+ X

#I=3

γ

I

− . . .

Since N (X) ≥ N

i

(X) for i = 1, . . . , p we have γ ≥ γ

i

for i = 1, . . . , p, hence γ > 0. Lemma 2 implies that (1.7) does not have any family of solutions xU

_M,B

with %

_B,S

> %

₁

; therefore, %

₁

= %. This completes the proof of Corollary 4.

3. Reduction to O

_A,S^∗

-cosets. Let K be an algebraic number field, and let S, M

1

, . . . , M

t

, A = M

1

⊕ . . . ⊕ M

t

, M be as in Section 1.2. Further, let s = #S, r = dim

_K

A ≥ 2, n = dim

_K

KM ≥ 2, c, β be as in (1.12). For x ∈ A, we define the coset xO

_A,S^∗

= {εx : ε ∈ O

^∗_A,S

}. In this section we prove Lemma 4 below which is in fact an improvement of Lemma 5 of [5].

Lemma 4. The set of solutions of

(1.7) cN

_A/K

(x) ∈ βO

^∗_S

in x ∈ M

is contained in some union x

₁

O

^∗_A,S

∪ . . . ∪ x

_t₁

O

^∗_A,S

where t

₁

≤ ψ

₂

(β) and where for j = 1, . . . , t

1

, x

j

∈ M is a solution of (1.7).

We prove this by slightly refining some arguments of Schmidt [17]. In the proof of Lemma 4 we need some further lemmas. We first recall some lemmas from [17]. Let E be a field endowed with a non-archimedean additive valuation V (i.e. V (xy) = V (x) + V (y), V (x + y) ≥ min(V (x), V (y)) for x, y ∈ E, V (0) = ∞, and there is an x ∈ E with V (x) 6= 0, V (x) 6= ∞).

For z = (z

₁

, . . . , z

_n

) ∈ E

ⁿ

, put V (z) = min(V (z

₁

), . . . , V (z

_n

)). Further, let L

₁

, . . . , L

_r

be r ≥ n linear forms in n variables with coefficients in E.

Lemma 5. Let z ∈ E

ⁿ

with z 6= 0. There is a subset S of {1, . . . , r} of cardinality n − 1 such that every z

⁰

∈ E

ⁿ

with

V (z

⁰

) ≥ V (z), V (L

i

(z

⁰

)) ≥ V (L

i

(z)) for i ∈ S satisfies

V (L

_i

(z

⁰

)) ≥ V (L

_i

(z)) for i = 1, . . . , r.

P r o o f. This is precisely Lemma 13 of [17], except that that lemma has the additional condition V (z) = 0. Suppose that V (z) 6= 0. Let λ ∈ E be such that V (λ) = V (z) and put z

1

:= λ

⁻¹

z. Then V (z

1

) = 0. Now Lemma 5 follows at once from Lemma 13 of [17] applied to z

₁

, on observing that V (L

_i

(z

₁

)) = V (L

_i

(z)) − V (λ) for i = 1, . . . , r.

We call the subset S related to z as in Lemma 5 an anchor for z.

(16)

Lemma 6. Let d

1

, . . . , d

r

be positive rational numbers, γ a real and S a subset of {1, . . . , r} of cardinality n − 1. Put

T (S) :=

n

z ∈ E

ⁿ

: X

r

i=1

d

_i

V (L

_i

(z)) = γ, S is an anchor for z o

. Then for any z

₁

, z

₂

∈ T (S) with V (L

_i

(z

₁

)) = V (L

_i

(z

₂

)) for i ∈ S we have V (L

i

(z

1

)) = V (L

i

(z

2

)) for i = 1, . . . , r.

P r o o f. Let z

1

, z

2

∈ T (S) with V (L

i

(z

1

)) = V (L

i

(z

2

)) for i ∈ S.

We may assume without loss of generality that V (z

₂

) ≥ V (z

₁

). Then by Lemma 5 we have V (L

_i

(z

₂

)) ≥ V (L

_i

(z

₁

)) for i = 1, . . . , r. Together with P

_r

i=1

d

i

V (L

i

(z

j

)) = γ for j = 1, 2 this implies that V (L

i

(z

2

)) = V (L

i

(z

1

)) for i = 1, . . . , r.

As before, if we express an element of A as a t-tuple (ξ

₁

, . . . , ξ

_t

), say, then it is implicitly assumed that ξ

_i

∈ M

_i

for i = 1, . . . , t. Fix v ∈ M

_K

\ S.

For i = 1, . . . , t, let w

i1

, . . . , w

ig_i

denote the places on M

i

which lie above v, and denote by e

_ij

, f

_ij

the ramification index and residue class degree, respectively, of w

ij

over v. Let K denote the algebraic closure of K. Choose a continuation of ord

_v

to K and denote this also by ord

_v

; then ord

_v

assumes its values in Q. For i = 1, . . . , t let E

_i

denote the collection of K-isomorphic embeddings of M

i

into K; then E

i

can be expressed as a disjoint union,

E

_i

= E

_i1

∪ . . . ∪ E

_ig_i

with #E

_ij

= e

_ij

f

_ij

for j = 1, . . . , g

_i

such that for j = 1, . . . , g

_i

,

(3.1) ord

_w_ij

(α) = e

_ij

ord

_v

(σ(α)) for α ∈ M

_i

, σ ∈ E

_ij

.

Lemma 7. There are integers c

_ij

(i = 1, . . . , t, j = 1, . . . , g

_i

) and u

_v

with u

v

≤ ord

v

(β) such that for every solution x = (ξ

1

, . . . , ξ

t

) ∈ M of (1.7) we have

ord

_w_ij

(ξ

_i

) − c

_ij

≥ 0 for i = 1, . . . , t, j = 1, . . . , g

_i

, (3.2)

X

t i=1

g_i

X

j=1

f

_ij

{ord

_w_ij

(ξ

_i

) − c

_ij

} = u

_v

. (3.3)

P r o o f. Let {a

k

= (α

k1

, . . . , α

kt

) : k = 1, . . . , m} be a set of generators of M as an O

_S

-module. Define the integers

(3.4) c

_ij

= min{ord

_w_ij

(α

_ki

) : k = 1, . . . , m} for i = 1, . . . , t, j = 1, . . . , g

_i

. Let x = (ξ

₁

, . . . , ξ

_t

) ∈ M be a solution of (1.7). Then x = P

_m

k=1

β

_k

a

_k

for certain β

1

, . . . , β

m

∈ O

S

. Since the place w

ij

lies above v ∈ M

K

\ S, we have ord

_w_ij

(β

_k

) ≥ 0 for i = 1, . . . , t, j = 1, . . . , g

_i

. Together with ξ

_i

= P

_m

k=1

β

_k

α

_ki

for i = 1, . . . , t and (3.4), this implies ord

w_ij

(ξ

ij

) ≥ c

ij

for i = 1, . . . , t,

j = 1, . . . , g

_i