• Nie Znaleziono Wyników

Abstract. The problem of nonparametric function fitting using the com- plete orthogonal system of trigonometric functions e

N/A
N/A
Protected

Academic year: 2021

Share "Abstract. The problem of nonparametric function fitting using the com- plete orthogonal system of trigonometric functions e"

Copied!
11
0
0

Pełen tekst

(1)

W. P O P I ´ N S K I (Warszawa)

LEAST-SQUARES TRIGONOMETRIC REGRESSION ESTIMATION

Abstract. The problem of nonparametric function fitting using the com- plete orthogonal system of trigonometric functions e

k

, k = 0, 1, 2, . . . , for the observation model y

i

= f (x

in

) + η

i

, i = 1, . . . , n, is considered, where η

i

are uncorrelated random variables with zero mean value and finite variance, and the observation points x

in

∈ [0, 2π], i = 1, . . . , n, are equidistant. Conditions for convergence of the mean-square prediction error (1/n) P

n

i=1

E(f (x

in

) − f b

N (n)

(x

in

))

2

, the integrated mean-square error Ekf − b f

N (n)

k

2

and the point- wise mean-square error E(f (x) − b f

N (n)

(x))

2

of the estimator b f

N (n)

(x) = P

N (n)

k=0

bc

k

e

k

(x) for f ∈ C[0, 2π] and bc

0

, bc

1

, . . . , bc

N (n)

obtained by the least squares method are studied.

1. Introduction. Let y

i

, i = 1, . . . , n, be observations at equidistant points x

in

= 2π(i − 1)/n, i = 1, . . . , n, which follow the model y

i

= f (x

in

) + η

i

, where f : [0, 2π] → R is an unknown function satisfying appropriate conditions characterized in the sequel and η

i

, i = 1, . . . , n, are random variables satisfying the conditions Eη

i

= 0 and Eη

i

η

j

= σ

2η

δ

ij

, where σ

η2

> 0 and δ

ij

denotes the Kronecker delta.

The functions

(1) e

0

(x) = 1, e

2l−1

(x) = √

2 sin(lx), e

2l

(x) = √

2 cos(lx), l = 1, 2, . . . , constitute a complete orthogonal system in the space L

2

[0, 2π], normalized so that

1 2π

\

0

e

2k

(s) ds = 1, k = 0, 1, 2, . . .

1991 Mathematics Subject Classification: 62G07, 62F12.

Key words and phrases : Fourier coefficients, trigonometric polynomial, least squares method, regression function, consistent estimator.

[121]

(2)

In consequence, any function f ∈ L

2

[0, 2π] has the representation f =

X

∞ k=0

c

k

e

k

, where c

k

= 1 2π

\

0

f (s)e

k

(s) ds, k = 0, 1, 2, . . . We consider estimators of the Fourier coefficients c

k

, k = 0, 1, . . . , N , having the form

(2) bc

kn

= 1

n X

n i=1

y

i

e

k

(x

in

), k = 0, 1, . . . , N.

It is well known that in the case of equidistant observation points x

in

= 2π(i − 1)/n, i = 1, . . . , n, the above defined estimators are for N = 2m, 2m + 1 ≤ n, least squares estimators of the Fourier coefficients c

k

, k = 0, 1, . . . , N , which is a consequence of the relations (see [1])

(3) 1

n X

n i=1

e

k

(x

in

)e

l

(x

in

) = δ

kl

for k, l = 0, 1, . . . , N , N = 2m, 2m + 1 ≤ n.

Observe that if the regression function f is continuous the estimators bc

kn

of the Fourier coefficients c

k

, k = 0, 1, . . . , are asymptotically unbiased and consistent in the mean-square sense. Indeed, for fixed k, 0 ≤ k ≤ N, N = 2m, 2m + 1 ≤ n,

E(bc

kn

− c

k

)

2

= E(bc

kn

− Ebc

kn

)

2

+ (Ebc

kn

− c

k

)

2

and taking into account (2) we immediately obtain

E(bc

kn

− Ebc

kn

)(bc

ln

− Ebc

ln

) = σ

η2

n

2

X

n i=1

e

k

(x

in

)e

l

(x

in

),

Ebc

kn

− c

k

= 1 n

X

n i=1

f (x

in

)e

k

(x

in

) − c

k

, which in view of (3) yields

(4)

E(bc

kn

− c

k

)

2

= σ

2η

n + (Ebc

kn

− c

k

)

2

, Ebc

kn

− c

k

= 1

2π 2π

n X

n i=1

f (x

in

)e

k

(x

in

) − 1 2π

\

0

f (s)e

k

(s) ds.

The above equalities and continuity of f and e

k

imply that

n→∞

lim Ebc

kn

− c

k

= 0 and lim

n→∞

E(bc

kn

− c

k

)

2

= 0.

(3)

In the sequel we shall examine the asymptotic properties of the projection estimator of the regression function

f b

N

(x) = X

N k=0

bc

kn

e

k

(x).

According to the Jackson theorem [6] for any 2π-periodic continuous func- tion (i.e. for f ∈ C[0, 2π] satisfying f (0) = f (2π)) there exists a trigono- metric polynomial of degree l

T

l

(x) = a

0

+ X

l k=1

(a

k

cos(kx) + b

k

sin(kx)), where a

2l

+ b

2l

6= 0, such that

sup

0≤s≤2π

|f (s) − T

l

(s)| ≤ 12ω(1/l, f ),

where ω(δ, f ) (for δ > 0) denotes the modulus of continuity of f .

2. Asymptotic mean-square prediction error. Consider first the mean-square prediction error of the estimator b f

N

, defined by

D

nN

= 1 n

X

n i=1

E(f (x

in

) − b f

N

(x

in

))

2

.

In view of the orthogonality relations (3) the standard squared bias plus variance decomposition yields

(5) D

nN

= 1

n X

n i=1

(f (x

in

) − E b f

N

(x

in

))

2

+ σ

2η

N + 1 n . It can be easily seen that for N = 2m, 2m + 1 ≤ n, the inequality

1 n

X

n i=1

(f (x

in

) − E b f

N

(x

in

))

2

≤ 1 n

X

n i=1

(f (x

in

) − T

l

(x

in

))

2

holds for any trigonometric polynomial T

l

of degree l ≤ m. Consequently, using (5) and applying the Jackson theorem we immediately see that for a 2π-periodic function f ∈ C[0, 2π] we have lim

n→∞

D

nN (n)

= 0 on condition that lim

n→∞

N (n) = ∞ and lim

n→∞

N (n)/n = 0.

From the equality (5) we see that for any regression function f the con- dition lim

n→∞

N (n)/n = 0 is also necessary for lim

n→∞

D

nN (n)

= 0. For a continuous regression function f which is not a trigonometric polynomial of any finite order lim

n→∞

D

nN (n)

= 0 also implies that lim

n→∞

N (n) = ∞.

Indeed, if we assume that there exists a subsequence m

k

, k = 1, 2, . . . , such

that the sequence N (m

k

) is bounded, then there also exists a subsequence n

l

(4)

such that N (n

l

) = M, l = 1, 2, . . . In consequence, putting f

M

= P

M k=0

c

k

e

k

we would have 1

n

l nl

X

i=1

(f (x

inl

) − E b f

N (nl)

(x

inl

))

2

= 1 n

l

nl

X

i=1

(f (x

inl

) − f

M

(x

inl

))

2

+ 1 n

l

nl

X

i=1

(f

M

(x

inl

) − E b f

N (nl)

(x

inl

))

2

+ 2 n

l

nl

X

i=1

(f

M

(x

inl

) − E b f

N (nl)

(x

inl

))(f (x

inl

) − f

M

(x

inl

))

and since the functions f and f

M

are continuous the second and third terms on the right-hand side would converge to zero because by the Schwarz in- equality and (3),

1 n

l

nl

X

i=1

(f

M

(x

inl

) − E b f

N (nl)

(x

inl

))

2

≤ 1 n

l

nl

X

i=1

X

M k=0

(c

k

− Ebc

knl

)

2

X

M k=0

e

2k

(x

inl

)

≤ (M + 1) X

M k=0

(c

k

− Ebc

knl

)

2

. Consequently, we would have

l→∞

lim 1 n

l

nl

X

i=1

(f (x

inl

) − E b f

N (nl)

(x

inl

))

2

= 1 2π

\

0

(f (s) − f

M

(s))

2

ds > 0, so the sequence D

nN (n)

would not converge to zero.

The above conclusions allow us to formulate the following theorem.

Theorem 2.1. If the regression function f is continuous, 2π-periodic and not a trigonometric polynomial of any finite order, then the projection estimator b f

N (n)

is consistent in the sense of the mean-square prediction error, i.e.

n→∞

lim 1 n

X

n i=1

E(f (x

in

) − b f

N (n)

(x

in

))

2

= 0,

if and only if the sequence of natural numbers N (n), n = 1, 2, . . . , satisfies

n→∞

lim N (n) = ∞, lim

n→∞

N (n)/n = 0.

Furthermore, since for a function f satisfying the Lipschitz condition with exponent 0 < α ≤ 1 we have ω(δ, f ) ≤ Lδ

α

, where L > 0, it is easy to see that the following corollary holds.

Corollary 2.1. Assume that the regression function f is 2π-periodic

and satisfies the Lipschitz condition with exponent 0 < α ≤ 1. If the

(5)

sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies N (n) ∼ n

1/(1+2α)

(i.e. r

1

≤ n

−1/(1+2α)

N (n) ≤ r

2

for r

1

, r

2

> 0), then

1 n

X

n i=1

E(f (x

in

) − b f

N (n)

(x

in

))

2

= O(n

−2α/(1+2α)

).

The above results complement the ones presented in [2] which were proved for a more general fixed point design under more restrictive assump- tions on the smoothness of the regression function.

3. Convergence of the integrated mean-square error. The for- mula for the bias of the estimator bc

kn

(see (4)) can be rewritten in the form

Ebc

kn

− c

k

=

\

0

f e

k

d(F

n

− F ),

where F denotes the uniform distribution function on [0, 2π] and F

n

the empirical distribution function of the “sample” x

in

, i = 1, . . . , n. If f is absolutely continuous, then integrating by parts gives

Ebc

kn

− c

k

=

\

0

(f

e

k

+ f e

k

)(F

n

− F ), and since sup |F − F

n

| ≤ 1/n this yields

|Ebc

kn

− c

k

| ≤ 1 n



\

0

|f

e

k

| +

\

0

|f e

k

|  and finally in view of definition (1) we obtain

(6) (Ebc

0n

− c

0

)

2

≤ 1 n

2

kf

k

21

, (Ebc

kn

− c

k

)

2

≤ 4

n

2

(kf

k

21

+ l

2

kf k

21

),

for k = 2l −1, 2l, l = 1, . . . , m, 2m+1 ≤ n, where k∗k

p

denotes the L

p

[0, 2π]

norm.

Now consider the integrated mean-square error of the estimator b f

N

, R

nN

= 1

2π E

\

0

(f − b f

N

)

2

= p

N

+ X

N k=0

E(bc

kn

− c

k

)

2

, where p

N

= P

k=N +1

c

2k

. According to (4) we can write

(7) R

nN

= p

N

+

X

N k=0

(Ebc

kn

− c

k

)

2

+ σ

2η

N + 1

n .

(6)

For N = 2m, 2m + 1 ≤ n, taking into account (6) we obtain X

N

k=0

(Ebc

kn

− c

k

)

2

≤ 1 n

2

h (8m + 1)kf

k

21

+ 8kf k

21

X

m l=1

l

2

i (8)

≤ 1 n

2



(4N + 1)kf

k

21

+ N (N + 1)(N + 2) 3 kf k

21



since P

m

l=1

l

2

= m(m + 1)(2m + 1)/6. The above estimate together with (7) allows us to formulate the following theorem.

Theorem 3.1. If the sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies

n→∞

lim N (n) = ∞, lim

n→∞

N (n)

3/2

/n = 0,

then the projection estimator b f

N (n)

of the absolutely continuous regression function f is consistent in the sense of the integrated mean-square error , i.e.

n→∞

lim Ekf − b f

N (n)

k

22

= 0.

Rafaj lowicz [10] proved that a theorem similar to 3.1 holds in the case of regression functions for which the error of uniform approximation by trigonometric polynomials tends to zero as the polynomial degree increases, i.e. it holds for continuous and 2π-periodic regression functions [6]. It should be noted that our theorem extends the result from [10] since it is true for nonperiodic regression functions.

In order to obtain a result concerning the convergence rate of the inte- grated mean-square error we need the following lemma.

Lemma 3.1. If the function f is absolutely continuous, then for N = 2m, m = 1, 2, . . . ,

p

N

≤ 5kf

k

21

π

2

N .

P r o o f. Integrating by parts gives for l = 1, 2, . . . , c

2l−1

= 1

√ 2 lπ

h f (0) − f (2π) +

\

0

f

(s) cos(ls) ds i ,

c

2l

= − 1

√ 2 lπ

\

0

f

(s) sin(ls) ds, and in consequence

|c

2l−1

| ≤ 2

√ 2 lπ kf

k

1

, |c

2l

| ≤ 1

√ 2 lπ kf

k

1

.

(7)

Hence, for N = 2m, p

N

=

X

∞ k=N +1

c

2k

= X

∞ l=m+1

(c

22l−1

+ c

22l

) ≤ 5kf

k

21

2

X

∞ l=m+1

1 l

2

≤ 5kf

k

21

2

X

∞ l=m+1

1

l(l − 1) = 5kf

k

21

2

· 1 m .

According to (7), (8) and by Lemma 3.1 we have for N = 2m, R

nN

≤ 5kf

k

21

π

2

N + 1 n

2



(4N + 1)kf

k

21

+ N (N + 1)(N + 2) 3 kf k

21



+ σ

2η

N + 1 n , which can be rewritten in the form

R

nN

≤ A N + 1

n

2

(BN + CN

3

) + DN n ,

where A, B, C, D > 0 are suitably chosen constants. From the last inequality it is easy to see that the following corollary holds.

Corollary 3.1. If the regression function f is absolutely continuous and the sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies N (n) ∼ n

1/2

(i.e. r

1

≤ n

−1/2

N (n) ≤ r

2

for r

1

, r

2

> 0), then

Ekf − b f

N (n)

k

22

= O(n

−1/2

).

In papers on wavelet methods of nonparametric function estimation (e.g.

[5]) one can find results giving the IMSE decay rate of a wavelet projection estimator for an equidistant point design.

4. Pointwise mean-square consistency of the estimator. In this section we derive sufficient conditions for pointwise mean-square consistency of the projection estimator b f

N

considered.

If the Fourier series of f converges to f (x) at some x ∈ [0, 2π], then E(f (x) − b f

N

(x))

2

= E  X

N

k=0

(c

k

− bc

kn

)e

k

(x) 

2

+ r

N2

(x)

+ 2r

N

(x) X

N k=0

(c

k

− Ebc

kn

)e

k

(x), where r

N

(x) = P

k=N +1

c

k

e

k

(x). From the Cauchy–Schwarz inequality it

further follows that

(8)

E(f (x) − b f

N

(x))

2

≤ X

N k=0

E(c

k

− bc

kn

)

2

X

N k=0

e

2k

(x) + r

2N

(x)

+ 2|r

N

(x)|  X

N

k=0

(c

k

− Ebc

kn

)

2



1/2

 X

N

k=0

e

2k

(x) 

1/2

, and according to (4) since P

N

k=0

e

2k

(x) = N + 1 for N = 2m, m ≥ 0, x ∈ [0, 2π], we finally have

(9) E(f (x) − b f

N

(x))

2

≤ (N + 1) X

N k=0

(c

k

− Ebc

kn

)

2

+ σ

η2

(N + 1)

2

n + 2|r

N

(x)|(N + 1)

1/2

 X

N

k=0

(c

k

− Ebc

kn

)

2



1/2

+ r

N2

(x).

If we assume that the regression function f is absolutely continuous, then since such a function is both continuous and of bounded variation in [0, 2π], its Fourier series converges uniformly to f in (δ, 2π − δ) for δ > 0 (see Corollary 2.62 [11]), so that lim

n→∞

r

N (n)

(x) = 0 uniformly for x ∈ (δ, 2π − δ) if lim

n→∞

N (n) = ∞. Hence, the estimates in (8) and (9) imply that the following theorem holds.

Theorem 4.1. If the sequence N (n), n = 1, 2, . . . , of even natural numbers satisfies

n→∞

lim N (n) = ∞, lim

n→∞

N (n)

2

/n = 0,

then for any δ > 0 the projection estimator b f

N (n)

of the absolutely con- tinuous regression function f is uniformly consistent in the sense of the pointwise mean-square error in the interval (δ, 2π − δ), i.e.

n→∞

lim E(f (x) − b f

N (n)

(x))

2

= 0 uniformly for x ∈ (δ, 2π − δ).

It should be noted that for an absolutely continuous and 2π-periodic regression function the pointwise mean-square error converges uniformly in the closed interval [0, 2π], which follows from the fact that then the Fourier series of the regression function converges uniformly in this interval [11].

Let us also remark that Rafaj lowicz [10] obtained sufficient conditions for uniform pointwise mean-square consistency of b f

N

in [0, 2π] only for 2π- periodic regression functions.

5. Selecting the regression order. In this section we study a method

of selecting a good value of N from the data, namely, the one based on

(9)

Mallows’s C

p

criterion [7]

(10) C(N ) = 1

n X

n i=1

(y

i

− b f

N

(x

in

))

2

+ 2N b σ

2

n ,

where b σ

2

is any consistent estimator of σ

2η

(see [3], [4] for examples of such estimators). One selects a value b N

n

by minimizing C(N ) over the integers 0 ≤ N ≤ n − 1.

We obtain results on asymptotic properties of this method for selecting the trigonometric regression order N . First, we prove the following theorem.

Theorem 5.1. Assume that

(a) the function f is absolutely continuous and is not a trigonometric polynomial of any finite order ,

(b) there exists a sequence of nonnegative real numbers ε

k

, k = 0, 1, . . . , such that the sequence (k + 1)ε

k

is nonincreasing and

|c

k

| ≤ ε

k

, X

∞ k=0

ε

k

< ∞, (c) µ

4

= sup Eη

4i

< ∞,

(d) b σ

2

−→ σ

p η2

as n → ∞.

If b N

n

is the minimizer of (10), then

\

0

(f − b f b

Nn

)

2

= O

p

(n

−1/2

).

P r o o f. Under the above assumptions Theorem 2 of [9] holds, which asserts that for the loss function r

n

(N ) =

T

0

(f − b f

N

)

2

we have

(11) r

n

( b N

n

)

min

0≤N ≤n−1

r

n

(N )

−→ 1,

p

n → ∞,

even if the absolute continuity assumption is not satisfied. If this assumption is satisfied we have, by Corollary 3.1,

0≤N ≤n−1

min E

\

0

(f − b f

N

)

2

= O(n

−1/2

), and consequently

0≤N ≤n−1

min

\

0

(f − b f

N

)

2

= O

p

(n

−1/2

).

The last equality together with (11) implies that r

n

( b N

n

) = O

p

(n

−1/2

), which

completes the proof.

(10)

We can also consider the loss function d

n

(N ) = 1

n X

n i=1

(f (x

in

) − b f

N

(x

in

))

2

.

Under the assumptions of Theorem 5.1 (without the absolute continuity assumption) Theorem 2 of [9] also assures that

d

n

( b N

n

) min

0≤N ≤n−1

d

n

(N )

−→ 1,

p

n → ∞.

Thus, if the regression function is 2π-periodic we can prove analogously using Theorem 2.1 that lim

n→∞

d

n

( b N

n

) = 0. If the 2π-periodic regression

p

function f also satisfies the Lipschitz condition with exponent 0 < α ≤ 1, then using Corollary 2.1 we can prove that d

n

( b N

n

) = O

p

(n

−2α/(1+2α)

).

A sequence of real numbers ε

k

, k = 0, 1, . . . , which satisfies the conditions of assumption (b) in the above theorem exists for example when (see [9])

X

∞ k=0

|c

k

| ln . . . ln(k + 1) < ∞,

where ln . . . ln(k + 1) denotes a multiple logarithm of k + 1.

Results concerning other asymptotic properties of the estimator consid- ered, e.g. the limit distribution of its integrated squared error, are obtained in [8].

References

[1] B. D r o g e, On finite-sample properties of adaptive least-squares regression esti- mates, Statistics 24 (1993), 181–203.

[2] R. L. E u b a n k and P. S p e c k m a n, Convergence rates for trigonometric and poly- nomial-trigonometric regression estimators, Statist. Probab. Lett. 11 (1991), 119–

124.

[3] T. G a s s e r, L. S r o k a and C. J e n n e n - S t e i n m e t z, Residual variance and residual pattern in nonlinear regression, Biometrika 73 (1986), 625–633.

[4] P. H a l l, J. W. K a y and D. M. T i t t e r i n g t o n, Asymptotically optimal difference- based estimation of variance in nonparametric regression, ibid. 77 (1990), 521–528.

[5] P. H a l l and P. P a t i l, On wavelet methods for estimating smooth functions, J.

Bernoulli Soc. 1 (1995), 41–58.

[6] G. G. L o r e n t z, Approximation of Functions, Holt, Reinehart & Winston, New York, 1966.

[7] C. L. M a l l o w s, Some comments on C

p

, Technometrics 15 (1973), 661–675.

[8] E. N a d a r a y a, Limit distribution of the integrated squared error of trigonometric series regression estimator , Proc. Georgian Acad. Sci. Math. 1 (1993), 221–237.

[9] B. T. P o l y a k and A. B. T s y b a k o v, Asymptotic optimality of the C

p

criterion in projection type estimation of regression functions, Teor. Veroyatnost. Primenen.

35 (1990), 305–317 (in Russian).

(11)

[10] E. R a f a j l o w i c z, Nonparametric least-squares estimation of a regression function, Statistics 19 (1988), 349–358.

[11] A. Z y g m u n d, Trigonometrical Series, Dover, 1955.

Waldemar Popi´ nski Department of Standards Central Statistical Office Al. Niepodleg lo´sci 208 00-925 Warszawa, Poland E-mail: w.popinski@stat.gov.pl

Received on 30.10.1997;

revised version on 18.1.1999

Cytaty

Powiązane dokumenty

For functions of one and two real variables, the rate of pointwise and uniform convergence of some Bernstein type algebraic polynomials is investigated..

K EY WORDS : coal, dry separation method, air concentrating tables, deshaling, coal sludge, deposits, fuel carbon, substitute natural aggregates... Ireneusz B AIC *, Wies³aw B

Fixed points of a generalized smoothing transformation and applications to the branching random walk.. On generalized

The serious work towards proving Theorem 1 begins with the following Lemma..

State its domain

If f satisfies a compactness condition expressed in terms of measures of weak noncompactness, and f is Pettis-integrable, then the set of pseudo-solutions of this problem is a

More- over, our results and methods used in the proof suggest that in the class of bounded pseudoconvex complete Reinhardt domains the symmetry of the Green function is equivalent

Chistyakov, Superposition operators in the algebra of functions of two variables with finite total variation, Monatshefte Math.. Adams, On definitions of bounded variation for