Abstract. The problem of nonparametric function ﬁtting using the complete orthogonal system of trigonometric functions e

(1)

W. P O P I ´ N S K I (Warszawa)

LEAST-SQUARES TRIGONOMETRIC REGRESSION ESTIMATION

Abstract. The problem of nonparametric function ﬁtting using the complete orthogonal system of trigonometric functions e

_k

, k = 0, 1, 2, . . . , for the observation model y

i

= f (x

in

) + η

i

, i = 1, . . . , n, is considered, where η

i

are uncorrelated random variables with zero mean value and ﬁnite variance, and the observation points x

_in

∈ [0, 2π], i = 1, . . . , n, are equidistant. Conditions for convergence of the mean-square prediction error (1/n) P

n

i=1

E(f (x

in

) − f b

N (n)

(x

in

))

²

, the integrated mean-square error Ekf − b f

N (n)

k

²

and the pointwise mean-square error E(f (x) − b f

_{N (n)}

(x))

²

of the estimator b f

_{N (n)}

(x) = P

N (n)

k=0

bc

^k

e

k

(x) for f ∈ C[0, 2π] and bc

⁰

, bc

¹

, . . . , bc

^{N (n)}

obtained by the least squares method are studied.

1. Introduction. Let y

i

, i = 1, . . . , n, be observations at equidistant points x

_in

= 2π(i − 1)/n, i = 1, . . . , n, which follow the model y

i

= f (x

_in

) + η

i

, where f : [0, 2π] → R is an unknown function satisfying appropriate conditions characterized in the sequel and η

i

, i = 1, . . . , n, are random variables satisfying the conditions Eη

_i

= 0 and Eη

_i

η

_j

= σ

²_η

δ

_ij

, where σ

_η²

> 0 and δ

ij

denotes the Kronecker delta.

The functions

(1) e

0

(x) = 1, e

2l−1

(x) = √

2 sin(lx), e

2l

(x) = √

2 cos(lx), l = 1, 2, . . . , constitute a complete orthogonal system in the space L

²

[0, 2π], normalized so that

1 2π

2π

\

0

e

²_k

(s) ds = 1, k = 0, 1, 2, . . .

1991 Mathematics Subject Classification: 62G07, 62F12.

Key words and phrases : Fourier coeﬃcients, trigonometric polynomial, least squares method, regression function, consistent estimator.

[121]

(2)

In consequence, any function f ∈ L

²

[0, 2π] has the representation f =

X

∞ k=0

c

k

e

k

, where c

k

= 1 2π

2π

\

0

f (s)e

k

(s) ds, k = 0, 1, 2, . . . We consider estimators of the Fourier coeﬃcients c

k

, k = 0, 1, . . . , N , having the form

(2) bc

^kn

= 1

n X

n i=1

y

i

e

k

(x

in

), k = 0, 1, . . . , N.

It is well known that in the case of equidistant observation points x

in

= 2π(i − 1)/n, i = 1, . . . , n, the above deﬁned estimators are for N = 2m, 2m + 1 ≤ n, least squares estimators of the Fourier coeﬃcients c

^k

, k = 0, 1, . . . , N , which is a consequence of the relations (see [1])

(3) 1

n X

n i=1

e

k

(x

in

)e

l

(x

in

) = δ

kl

for k, l = 0, 1, . . . , N , N = 2m, 2m + 1 ≤ n.

Observe that if the regression function f is continuous the estimators bc

^kn

of the Fourier coeﬃcients c

k

, k = 0, 1, . . . , are asymptotically unbiased and consistent in the mean-square sense. Indeed, for ﬁxed k, 0 ≤ k ≤ N, N = 2m, 2m + 1 ≤ n,

E(bc

^kn

− c

^k

)

²

= E(bc

^kn

− Ebc

^kn

)

²

+ (Ebc

^kn

− c

^k

)

²

and taking into account (2) we immediately obtain

E(bc

^kn

− Ebc

^kn

)(bc

^ln

− Ebc

^ln

) = σ

_η²

n

²

X

n i=1

e

k

(x

in

)e

l

(x

in

),

Ebc

^kn

− c

^k

= 1 n

X

n i=1

f (x

in

)e

k

(x

in

) − c

^k

, which in view of (3) yields

(4)

E(bc

^kn

− c

^k

)

²

= σ

²_η

n + (Ebc

^kn

− c

^k

)

²

, Ebc

kn

− c

k

= 1

2π 2π

n X

n i=1

f (x

_in

)e

_k

(x

_in

) − 1 2π

2π

\

0

f (s)e

_k

(s) ds.

The above equalities and continuity of f and e

k

imply that

n→∞

lim Ebc

^kn

− c

^k

= 0 and lim

n→∞

E(bc

^kn

− c

^k

)

²

= 0.

(3)

In the sequel we shall examine the asymptotic properties of the projection estimator of the regression function

f b

N

(x) = X

N k=0

bc

^kn

e

k

(x).

According to the Jackson theorem [6] for any 2π-periodic continuous function (i.e. for f ∈ C[0, 2π] satisfying f (0) = f (2π)) there exists a trigonometric polynomial of degree l

T

l

(x) = a

0

+ X

l k=1

(a

k

cos(kx) + b

k

sin(kx)), where a

²_l

+ b

²_l

6= 0, such that

sup

0≤s≤2π

|f (s) − T

^l

(s)| ≤ 12ω(1/l, f ),

where ω(δ, f ) (for δ > 0) denotes the modulus of continuity of f .

2. Asymptotic mean-square prediction error. Consider ﬁrst the mean-square prediction error of the estimator b f

N

, deﬁned by

D

_nN

= 1 n

X

n i=1

E(f (x

_in

) − b f

_N

(x

_in

))

²

.

In view of the orthogonality relations (3) the standard squared bias plus variance decomposition yields

(5) D

_nN

= 1

n X

n i=1

(f (x

_in

) − E b f

_N

(x

_in

))

²

+ σ

²_η

N + 1 n . It can be easily seen that for N = 2m, 2m + 1 ≤ n, the inequality

1 n

X

n i=1

(f (x

in

) − E b f

N

(x

in

))

²

≤ 1 n

X

n i=1

(f (x

in

) − T

^l

(x

in

))

²

holds for any trigonometric polynomial T

_l

of degree l ≤ m. Consequently, using (5) and applying the Jackson theorem we immediately see that for a 2π-periodic function f ∈ C[0, 2π] we have lim

^n→∞

D

nN (n)

= 0 on condition that lim

n→∞

N (n) = ∞ and lim

^n→∞

N (n)/n = 0.

From the equality (5) we see that for any regression function f the condition lim

_n→∞

N (n)/n = 0 is also necessary for lim

_n→∞

D

_{nN (n)}

= 0. For a continuous regression function f which is not a trigonometric polynomial of any ﬁnite order lim

n→∞

D

nN (n)

= 0 also implies that lim

n→∞

N (n) = ∞.

Indeed, if we assume that there exists a subsequence m

_k

, k = 1, 2, . . . , such

that the sequence N (m

k

) is bounded, then there also exists a subsequence n

l

(4)

such that N (n

l

) = M, l = 1, 2, . . . In consequence, putting f

M

= P

M k=0

c

k

e

k

we would have 1

n

l nl

X

i=1

(f (x

inl

) − E b f

N (nl)

(x

inl

))

²

= 1 n

l

nl

X

i=1

(f (x

_inl

) − f

M

(x

_inl

))

²

+ 1 n

l

nl

X

i=1

(f

_M

(x

_inl

) − E b f

_{N (n}_l₎

(x

_inl

))

²

+ 2 n

l

nl

X

i=1

(f

M

(x

inl

) − E b f

_{N (n}l)

(x

inl

))(f (x

inl

) − f

^M

(x

inl

))

and since the functions f and f

M

are continuous the second and third terms on the right-hand side would converge to zero because by the Schwarz inequality and (3),

1 n

l

nl

X

i=1

(f

_M

(x

_inl

) − E b f

_{N (n}_l₎

(x

_inl

))

²

≤ 1 n

l

nl

X

i=1

X

M k=0

(c

_k

− Ebc

knl

)

²

X

M k=0

e

²_k

(x

_inl

)

≤ (M + 1) X

M k=0

(c

k

− Ebc

^kn^l

)

²

. Consequently, we would have

l→∞

lim 1 n

l

nl

X

i=1

(f (x

inl

) − E b f

N (nl)

(x

inl

))

²

= 1 2π

2π

\

0

(f (s) − f

^M

(s))

²

ds > 0, so the sequence D

_{nN (n)}

would not converge to zero.

The above conclusions allow us to formulate the following theorem.

Theorem 2.1. If the regression function f is continuous, 2π-periodic and not a trigonometric polynomial of any finite order, then the projection estimator b f

N (n)

is consistent in the sense of the mean-square prediction error, i.e.

n→∞

lim 1 n

X

n i=1

E(f (x

in

) − b f

N (n)

(x

in

))

²

= 0,

if and only if the sequence of natural numbers N (n), n = 1, 2, . . . , satisfies

n→∞

lim N (n) = ∞, lim

n→∞

N (n)/n = 0.

Furthermore, since for a function f satisfying the Lipschitz condition with exponent 0 < α ≤ 1 we have ω(δ, f ) ≤ Lδ

^α

, where L > 0, it is easy to see that the following corollary holds.

Corollary 2.1. Assume that the regression function f is 2π-periodic

and satisfies the Lipschitz condition with exponent 0 < α ≤ 1. If the

(5)

sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies N (n) ∼ n

^1/(1+2α)

(i.e. r

1

≤ n

^−1/(1+2α)

N (n) ≤ r

²

for r

1

, r

2

> 0), then

1 n

X

n i=1

E(f (x

in

) − b f

_{N (n)}

(x

in

))

²

= O(n

^{−2α/(1+2α)}

).

The above results complement the ones presented in [2] which were proved for a more general ﬁxed point design under more restrictive assumptions on the smoothness of the regression function.

3. Convergence of the integrated mean-square error. The for- mula for the bias of the estimator bc

kn

(see (4)) can be rewritten in the form

Ebc

^kn

− c

^k

=

2π

\

0

f e

k

d(F

n

− F ),

where F denotes the uniform distribution function on [0, 2π] and F

n

the empirical distribution function of the “sample” x

in

, i = 1, . . . , n. If f is absolutely continuous, then integrating by parts gives

Ebc

^kn

− c

^k

=

2π

\

0

(f

^′

e

k

+ f e

^′_k

)(F

n

− F ), and since sup |F − F

ⁿ

| ≤ 1/n this yields

|Ebc

kn

− c

k

| ≤ 1 n

^2π^\

0

|f

^′

e

_k

| +

2π\

0

|f e

^′k

| and ﬁnally in view of deﬁnition (1) we obtain

(6) (Ebc

⁰ⁿ

− c

⁰

)

²

≤ 1 n

²

kf

^′

k

²1

, (Ebc

^kn

− c

^k

)

²

≤ 4

n

²

(kf

^′

k

²1

+ l

²

kf k

²1

),

for k = 2l −1, 2l, l = 1, . . . , m, 2m+1 ≤ n, where k∗k

^p

denotes the L

^p

[0, 2π]

norm.

Now consider the integrated mean-square error of the estimator b f

N

, R

nN

= 1

2π E

2π

\

0

(f − b f

N

)

²

= p

N

+ X

N k=0

E(bc

^kn

− c

^k

)

²

, where p

N

= P

∞

k=N +1

c

²_k

. According to (4) we can write

(7) R

nN

= p

N

+

X

N k=0

(Ebc

^kn

− c

^k

)

²

+ σ

²_η

N + 1

n .

(6)

For N = 2m, 2m + 1 ≤ n, taking into account (6) we obtain X

N

k=0

(Ebc

^kn

− c

^k

)

²

≤ 1 n

²

h (8m + 1)kf

^′

k

²1

+ 8kf k

²1

X

m l=1

l

²

i (8)

≤ 1 n

²

(4N + 1)kf

^′

k

²1

+ N (N + 1)(N + 2) 3 kf k

²1

since P

m

l=1

l

²

= m(m + 1)(2m + 1)/6. The above estimate together with (7) allows us to formulate the following theorem.

Theorem 3.1. If the sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies

n→∞

lim N (n) = ∞, lim

n→∞

N (n)

^3/2

/n = 0,

then the projection estimator b f

_{N (n)}

of the absolutely continuous regression function f is consistent in the sense of the integrated mean-square error , i.e.

n→∞

lim Ekf − b f

_{N (n)}

k

²2

= 0.

Rafaj lowicz [10] proved that a theorem similar to 3.1 holds in the case of regression functions for which the error of uniform approximation by trigonometric polynomials tends to zero as the polynomial degree increases, i.e. it holds for continuous and 2π-periodic regression functions [6]. It should be noted that our theorem extends the result from [10] since it is true for nonperiodic regression functions.

In order to obtain a result concerning the convergence rate of the integrated mean-square error we need the following lemma.

Lemma 3.1. If the function f is absolutely continuous, then for N = 2m, m = 1, 2, . . . ,

p

N

≤ 5kf

^′

k

²1

π

²

N .

P r o o f. Integrating by parts gives for l = 1, 2, . . . , c

2l−1

= 1

√ 2 lπ

h f (0) − f (2π) +

2π

\

0

f

^′

(s) cos(ls) ds i ,

c

_2l

= − 1

√ 2 lπ

2π

\

0

f

^′

(s) sin(ls) ds, and in consequence

|c

2l−1

| ≤ 2

√ 2 lπ kf

^′

k

1

, |c

2l

| ≤ 1

√ 2 lπ kf

^′

k

1

.

(7)

Hence, for N = 2m, p

_N

=

X

∞ k=N +1

c

²_k

= X

∞ l=m+1

(c

²_2l−1

+ c

²_2l

) ≤ 5kf

^′

k

²1

2π

²

X

∞ l=m+1

1 l

²

≤ 5kf

^′

k

²1

2π

²

X

∞ l=m+1

1 l(l − 1) = 5kf

^′

k

²1

2π

²

· 1 m .

According to (7), (8) and by Lemma 3.1 we have for N = 2m, R

_nN

≤ 5kf

^′

k

²1

π

²

N + 1 n

²

(4N + 1)kf

^′

k

²1

+ N (N + 1)(N + 2) 3 kf k

²1

+ σ

²_η

N + 1 n , which can be rewritten in the form

R

nN

≤ A N + 1

n

²

(BN + CN

³

) + DN n ,

where A, B, C, D > 0 are suitably chosen constants. From the last inequality it is easy to see that the following corollary holds.

Corollary 3.1. If the regression function f is absolutely continuous and the sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies N (n) ∼ n

^1/2

(i.e. r

1

≤ n

^−1/2

N (n) ≤ r

²

for r

1

, r

2

> 0), then

Ekf − b f

N (n)

k

²2

= O(n

^−1/2

).

In papers on wavelet methods of nonparametric function estimation (e.g.

[5]) one can ﬁnd results giving the IMSE decay rate of a wavelet projection estimator for an equidistant point design.

4. Pointwise mean-square consistency of the estimator. In this section we derive suﬃcient conditions for pointwise mean-square consistency of the projection estimator b f

N

considered.

If the Fourier series of f converges to f (x) at some x ∈ [0, 2π], then E(f (x) − b f

N

(x))

²

= E X

^N

k=0

(c

k

− bc

^kn

)e

k

(x)

2

+ r

_N²

(x)

+ 2r

N

(x) X

N k=0

(c

k

− Ebc

^kn

)e

k

(x), where r

_N

(x) = P

∞

k=N +1

c

_k

e

_k

(x). From the Cauchy–Schwarz inequality it

further follows that

(8)

E(f (x) − b f

N

(x))

²

≤ X

N k=0

E(c

k

− bc

^kn

)

²

X

N k=0

e

²_k

(x) + r

²_N

(x)

+ 2|r

^N

(x)| X

^N

k=0

(c

k

− Ebc

^kn

)

²

1/2

X

^N

k=0

e

²_k

(x)

1/2

, and according to (4) since P

N

k=0

e

²_k

(x) = N + 1 for N = 2m, m ≥ 0, x ∈ [0, 2π], we ﬁnally have

(9) E(f (x) − b f

N

(x))

²

≤ (N + 1) X

N k=0

(c

k

− Ebc

^kn

)

²

+ σ

_η²

(N + 1)

²

n + 2|r

N

(x)|(N + 1)

^1/2

X

^N

k=0

(c

_k

− Ebc

kn

)

²

1/2

+ r

_N²

(x).

If we assume that the regression function f is absolutely continuous, then since such a function is both continuous and of bounded variation in [0, 2π], its Fourier series converges uniformly to f in (δ, 2π − δ) for δ > 0 (see Corollary 2.62 [11]), so that lim

n→∞

r

N (n)

(x) = 0 uniformly for x ∈ (δ, 2π − δ) if lim

n→∞

N (n) = ∞. Hence, the estimates in (8) and (9) imply that the following theorem holds.

Theorem 4.1. If the sequence N (n), n = 1, 2, . . . , of even natural numbers satisfies

n→∞

lim N (n) = ∞, lim

n→∞

N (n)

²

/n = 0,

then for any δ > 0 the projection estimator b f

_{N (n)}

of the absolutely continuous regression function f is uniformly consistent in the sense of the pointwise mean-square error in the interval (δ, 2π − δ), i.e.

n→∞

lim E(f (x) − b f

_{N (n)}

(x))

²

= 0 uniformly for x ∈ (δ, 2π − δ).

It should be noted that for an absolutely continuous and 2π-periodic regression function the pointwise mean-square error converges uniformly in the closed interval [0, 2π], which follows from the fact that then the Fourier series of the regression function converges uniformly in this interval [11].

Let us also remark that Rafaj lowicz [10] obtained suﬃcient conditions for uniform pointwise mean-square consistency of b f

N

in [0, 2π] only for 2π- periodic regression functions.

5. Selecting the regression order. In this section we study a method

of selecting a good value of N from the data, namely, the one based on

(9)

Mallows’s C

p

criterion [7]

(10) C(N ) = 1

n X

n i=1

(y

i

− b f

N

(x

in

))

²

+ 2N b σ

²

n ,

where b σ

²

is any consistent estimator of σ

²_η

(see [3], [4] for examples of such estimators). One selects a value b N

_n

by minimizing C(N ) over the integers 0 ≤ N ≤ n − 1.

We obtain results on asymptotic properties of this method for selecting the trigonometric regression order N . First, we prove the following theorem.

Theorem 5.1. Assume that

(a) the function f is absolutely continuous and is not a trigonometric polynomial of any finite order ,

(b) there exists a sequence of nonnegative real numbers ε

_k

, k = 0, 1, . . . , such that the sequence (k + 1)ε

k

is nonincreasing and

|c

^k

| ≤ ε

^k

, X

∞ k=0

ε

k

< ∞, (c) µ

4

= sup Eη

⁴_i

< ∞,

(d) b σ

²

−→ σ

^p η²

as n → ∞.

If b N

n

is the minimizer of (10), then

2π\

0

(f − b f b

^Nⁿ

)

²

= O

_p

(n

^−1/2

).

P r o o f. Under the above assumptions Theorem 2 of [9] holds, which asserts that for the loss function r

n

(N ) =

T2π

0

(f − b f

N

)

²

we have

(11) r

n

( b N

n

)

min

0≤N ≤n−1

r

n

(N )

−→ 1,

p

n → ∞,

even if the absolute continuity assumption is not satisﬁed. If this assumption is satisﬁed we have, by Corollary 3.1,

0≤N ≤n−1

min E

2π

\

0

(f − b f

_N

)

²

= O(n

^−1/2

), and consequently

0≤N ≤n−1

min

2π

\

0

(f − b f

N

)

²

= O

p

(n

^−1/2

).

The last equality together with (11) implies that r

_n

( b N

_n

) = O

_p

(n

^−1/2

), which

completes the proof.

(10)

We can also consider the loss function d

n

(N ) = 1

n X

n i=1

(f (x

in

) − b f

N

(x

in

))

²

.

Under the assumptions of Theorem 5.1 (without the absolute continuity assumption) Theorem 2 of [9] also assures that

d

n

( b N

n

) min

0≤N ≤n−1

d

n

(N )

−→ 1,

p

n → ∞.

Thus, if the regression function is 2π-periodic we can prove analogously using Theorem 2.1 that lim

_n→∞

d

_n

( b N

_n

) = 0. If the 2π-periodic regression

^p

function f also satisﬁes the Lipschitz condition with exponent 0 < α ≤ 1, then using Corollary 2.1 we can prove that d

n

( b N

n

) = O

p

(n

^{−2α/(1+2α)}

).

A sequence of real numbers ε

k

, k = 0, 1, . . . , which satisﬁes the conditions of assumption (b) in the above theorem exists for example when (see [9])

X

∞ k=0

|c

^k

| ln . . . ln(k + 1) < ∞,

where ln . . . ln(k + 1) denotes a multiple logarithm of k + 1.

Results concerning other asymptotic properties of the estimator considered, e.g. the limit distribution of its integrated squared error, are obtained in [8].

References

[1] B. D r o g e, On finite-sample properties of adaptive least-squares regression estimates, Statistics 24 (1993), 181–203.

[2] R. L. E u b a n k and P. S p e c k m a n, Convergence rates for trigonometric and polynomial-trigonometric regression estimators, Statist. Probab. Lett. 11 (1991), 119–

124. [3] T. G a s s e r, L. S r o k a and C. J e n n e n - S t e i n m e t z, Residual variance and residual pattern in nonlinear regression, Biometrika 73 (1986), 625–633.

[4] P. H a l l, J. W. K a y and D. M. T i t t e r i n g t o n, Asymptotically optimal difference- based estimation of variance in nonparametric regression, ibid. 77 (1990), 521–528.

[5] P. H a l l and P. P a t i l, On wavelet methods for estimating smooth functions, J.

Bernoulli Soc. 1 (1995), 41–58.

[6] G. G. L o r e n t z, Approximation of Functions, Holt, Reinehart & Winston, New York, 1966.

[7] C. L. M a l l o w s, Some comments on C

p

, Technometrics 15 (1973), 661–675.

[8] E. N a d a r a y a, Limit distribution of the integrated squared error of trigonometric series regression estimator , Proc. Georgian Acad. Sci. Math. 1 (1993), 221–237.

[9] B. T. P o l y a k and A. B. T s y b a k o v, Asymptotic optimality of the C

p

criterion in projection type estimation of regression functions, Teor. Veroyatnost. Primenen.

35 (1990), 305–317 (in Russian).

(11)

[10] E. R a f a j l o w i c z, Nonparametric least-squares estimation of a regression function, Statistics 19 (1988), 349–358.

[11] A. Z y g m u n d, Trigonometrical Series, Dover, 1955.

Waldemar Popi´ nski Department of Standards Central Statistical Oﬃce Al. Niepodleg lo´sci 208 00-925 Warszawa, Poland E-mail: w.popinski@stat.gov.pl

Received on 30.10.1997;

revised version on 18.1.1999