W. P O P I ´ N S K I (Warszawa)
LEAST-SQUARES TRIGONOMETRIC REGRESSION ESTIMATION
Abstract. The problem of nonparametric function fitting using the com- plete orthogonal system of trigonometric functions e
k, k = 0, 1, 2, . . . , for the observation model y
i= f (x
in) + η
i, i = 1, . . . , n, is considered, where η
iare uncorrelated random variables with zero mean value and finite variance, and the observation points x
in∈ [0, 2π], i = 1, . . . , n, are equidistant. Conditions for convergence of the mean-square prediction error (1/n) P
ni=1
E(f (x
in) − f b
N (n)(x
in))
2, the integrated mean-square error Ekf − b f
N (n)k
2and the point- wise mean-square error E(f (x) − b f
N (n)(x))
2of the estimator b f
N (n)(x) = P
N (n)k=0
bc
ke
k(x) for f ∈ C[0, 2π] and bc
0, bc
1, . . . , bc
N (n)obtained by the least squares method are studied.
1. Introduction. Let y
i, i = 1, . . . , n, be observations at equidistant points x
in= 2π(i − 1)/n, i = 1, . . . , n, which follow the model y
i= f (x
in) + η
i, where f : [0, 2π] → R is an unknown function satisfying appropriate conditions characterized in the sequel and η
i, i = 1, . . . , n, are random variables satisfying the conditions Eη
i= 0 and Eη
iη
j= σ
2ηδ
ij, where σ
η2> 0 and δ
ijdenotes the Kronecker delta.
The functions
(1) e
0(x) = 1, e
2l−1(x) = √
2 sin(lx), e
2l(x) = √
2 cos(lx), l = 1, 2, . . . , constitute a complete orthogonal system in the space L
2[0, 2π], normalized so that
1 2π
2π
\
0
e
2k(s) ds = 1, k = 0, 1, 2, . . .
1991 Mathematics Subject Classification: 62G07, 62F12.
Key words and phrases : Fourier coefficients, trigonometric polynomial, least squares method, regression function, consistent estimator.
[121]
In consequence, any function f ∈ L
2[0, 2π] has the representation f =
X
∞ k=0c
ke
k, where c
k= 1 2π
2π
\
0
f (s)e
k(s) ds, k = 0, 1, 2, . . . We consider estimators of the Fourier coefficients c
k, k = 0, 1, . . . , N , having the form
(2) bc
kn= 1
n X
n i=1y
ie
k(x
in), k = 0, 1, . . . , N.
It is well known that in the case of equidistant observation points x
in= 2π(i − 1)/n, i = 1, . . . , n, the above defined estimators are for N = 2m, 2m + 1 ≤ n, least squares estimators of the Fourier coefficients c
k, k = 0, 1, . . . , N , which is a consequence of the relations (see [1])
(3) 1
n X
n i=1e
k(x
in)e
l(x
in) = δ
klfor k, l = 0, 1, . . . , N , N = 2m, 2m + 1 ≤ n.
Observe that if the regression function f is continuous the estimators bc
knof the Fourier coefficients c
k, k = 0, 1, . . . , are asymptotically unbiased and consistent in the mean-square sense. Indeed, for fixed k, 0 ≤ k ≤ N, N = 2m, 2m + 1 ≤ n,
E(bc
kn− c
k)
2= E(bc
kn− Ebc
kn)
2+ (Ebc
kn− c
k)
2and taking into account (2) we immediately obtain
E(bc
kn− Ebc
kn)(bc
ln− Ebc
ln) = σ
η2n
2X
n i=1e
k(x
in)e
l(x
in),
Ebc
kn− c
k= 1 n
X
n i=1f (x
in)e
k(x
in) − c
k, which in view of (3) yields
(4)
E(bc
kn− c
k)
2= σ
2ηn + (Ebc
kn− c
k)
2, Ebc
kn− c
k= 1
2π 2π
n X
n i=1f (x
in)e
k(x
in) − 1 2π
2π
\
0
f (s)e
k(s) ds.
The above equalities and continuity of f and e
kimply that
n→∞
lim Ebc
kn− c
k= 0 and lim
n→∞
E(bc
kn− c
k)
2= 0.
In the sequel we shall examine the asymptotic properties of the projection estimator of the regression function
f b
N(x) = X
N k=0bc
kne
k(x).
According to the Jackson theorem [6] for any 2π-periodic continuous func- tion (i.e. for f ∈ C[0, 2π] satisfying f (0) = f (2π)) there exists a trigono- metric polynomial of degree l
T
l(x) = a
0+ X
l k=1(a
kcos(kx) + b
ksin(kx)), where a
2l+ b
2l6= 0, such that
sup
0≤s≤2π
|f (s) − T
l(s)| ≤ 12ω(1/l, f ),
where ω(δ, f ) (for δ > 0) denotes the modulus of continuity of f .
2. Asymptotic mean-square prediction error. Consider first the mean-square prediction error of the estimator b f
N, defined by
D
nN= 1 n
X
n i=1E(f (x
in) − b f
N(x
in))
2.
In view of the orthogonality relations (3) the standard squared bias plus variance decomposition yields
(5) D
nN= 1
n X
n i=1(f (x
in) − E b f
N(x
in))
2+ σ
2ηN + 1 n . It can be easily seen that for N = 2m, 2m + 1 ≤ n, the inequality
1 n
X
n i=1(f (x
in) − E b f
N(x
in))
2≤ 1 n
X
n i=1(f (x
in) − T
l(x
in))
2holds for any trigonometric polynomial T
lof degree l ≤ m. Consequently, using (5) and applying the Jackson theorem we immediately see that for a 2π-periodic function f ∈ C[0, 2π] we have lim
n→∞D
nN (n)= 0 on condition that lim
n→∞N (n) = ∞ and lim
n→∞N (n)/n = 0.
From the equality (5) we see that for any regression function f the con- dition lim
n→∞N (n)/n = 0 is also necessary for lim
n→∞D
nN (n)= 0. For a continuous regression function f which is not a trigonometric polynomial of any finite order lim
n→∞D
nN (n)= 0 also implies that lim
n→∞N (n) = ∞.
Indeed, if we assume that there exists a subsequence m
k, k = 1, 2, . . . , such
that the sequence N (m
k) is bounded, then there also exists a subsequence n
lsuch that N (n
l) = M, l = 1, 2, . . . In consequence, putting f
M= P
M k=0c
ke
kwe would have 1
n
l nlX
i=1
(f (x
inl) − E b f
N (nl)(x
inl))
2= 1 n
lnl
X
i=1
(f (x
inl) − f
M(x
inl))
2+ 1 n
lnl
X
i=1
(f
M(x
inl) − E b f
N (nl)(x
inl))
2+ 2 n
lnl
X
i=1
(f
M(x
inl) − E b f
N (nl)(x
inl))(f (x
inl) − f
M(x
inl))
and since the functions f and f
Mare continuous the second and third terms on the right-hand side would converge to zero because by the Schwarz in- equality and (3),
1 n
lnl
X
i=1
(f
M(x
inl) − E b f
N (nl)(x
inl))
2≤ 1 n
lnl
X
i=1
X
M k=0(c
k− Ebc
knl)
2X
M k=0e
2k(x
inl)
≤ (M + 1) X
M k=0(c
k− Ebc
knl)
2. Consequently, we would have
l→∞
lim 1 n
lnl
X
i=1
(f (x
inl) − E b f
N (nl)(x
inl))
2= 1 2π
2π
\
0
(f (s) − f
M(s))
2ds > 0, so the sequence D
nN (n)would not converge to zero.
The above conclusions allow us to formulate the following theorem.
Theorem 2.1. If the regression function f is continuous, 2π-periodic and not a trigonometric polynomial of any finite order, then the projection estimator b f
N (n)is consistent in the sense of the mean-square prediction error, i.e.
n→∞
lim 1 n
X
n i=1E(f (x
in) − b f
N (n)(x
in))
2= 0,
if and only if the sequence of natural numbers N (n), n = 1, 2, . . . , satisfies
n→∞
lim N (n) = ∞, lim
n→∞
N (n)/n = 0.
Furthermore, since for a function f satisfying the Lipschitz condition with exponent 0 < α ≤ 1 we have ω(δ, f ) ≤ Lδ
α, where L > 0, it is easy to see that the following corollary holds.
Corollary 2.1. Assume that the regression function f is 2π-periodic
and satisfies the Lipschitz condition with exponent 0 < α ≤ 1. If the
sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies N (n) ∼ n
1/(1+2α)(i.e. r
1≤ n
−1/(1+2α)N (n) ≤ r
2for r
1, r
2> 0), then
1 n
X
n i=1E(f (x
in) − b f
N (n)(x
in))
2= O(n
−2α/(1+2α)).
The above results complement the ones presented in [2] which were proved for a more general fixed point design under more restrictive assump- tions on the smoothness of the regression function.
3. Convergence of the integrated mean-square error. The for- mula for the bias of the estimator bc
kn(see (4)) can be rewritten in the form
Ebc
kn− c
k=
2π
\
0
f e
kd(F
n− F ),
where F denotes the uniform distribution function on [0, 2π] and F
nthe empirical distribution function of the “sample” x
in, i = 1, . . . , n. If f is absolutely continuous, then integrating by parts gives
Ebc
kn− c
k=
2π
\
0
(f
′e
k+ f e
′k)(F
n− F ), and since sup |F − F
n| ≤ 1/n this yields
|Ebc
kn− c
k| ≤ 1 n
2π\0
|f
′e
k| +
2π\
0
|f e
′k| and finally in view of definition (1) we obtain
(6) (Ebc
0n− c
0)
2≤ 1 n
2kf
′k
21, (Ebc
kn− c
k)
2≤ 4
n
2(kf
′k
21+ l
2kf k
21),
for k = 2l −1, 2l, l = 1, . . . , m, 2m+1 ≤ n, where k∗k
pdenotes the L
p[0, 2π]
norm.
Now consider the integrated mean-square error of the estimator b f
N, R
nN= 1
2π E
2π
\
0
(f − b f
N)
2= p
N+ X
N k=0E(bc
kn− c
k)
2, where p
N= P
∞k=N +1
c
2k. According to (4) we can write
(7) R
nN= p
N+
X
N k=0(Ebc
kn− c
k)
2+ σ
2ηN + 1
n .
For N = 2m, 2m + 1 ≤ n, taking into account (6) we obtain X
Nk=0
(Ebc
kn− c
k)
2≤ 1 n
2h (8m + 1)kf
′k
21+ 8kf k
21X
m l=1l
2i (8)
≤ 1 n
2(4N + 1)kf
′k
21+ N (N + 1)(N + 2) 3 kf k
21since P
ml=1
l
2= m(m + 1)(2m + 1)/6. The above estimate together with (7) allows us to formulate the following theorem.
Theorem 3.1. If the sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies
n→∞
lim N (n) = ∞, lim
n→∞
N (n)
3/2/n = 0,
then the projection estimator b f
N (n)of the absolutely continuous regression function f is consistent in the sense of the integrated mean-square error , i.e.
n→∞
lim Ekf − b f
N (n)k
22= 0.
Rafaj lowicz [10] proved that a theorem similar to 3.1 holds in the case of regression functions for which the error of uniform approximation by trigonometric polynomials tends to zero as the polynomial degree increases, i.e. it holds for continuous and 2π-periodic regression functions [6]. It should be noted that our theorem extends the result from [10] since it is true for nonperiodic regression functions.
In order to obtain a result concerning the convergence rate of the inte- grated mean-square error we need the following lemma.
Lemma 3.1. If the function f is absolutely continuous, then for N = 2m, m = 1, 2, . . . ,
p
N≤ 5kf
′k
21π
2N .
P r o o f. Integrating by parts gives for l = 1, 2, . . . , c
2l−1= 1
√ 2 lπ
h f (0) − f (2π) +
2π
\
0
f
′(s) cos(ls) ds i ,
c
2l= − 1
√ 2 lπ
2π
\
0
f
′(s) sin(ls) ds, and in consequence
|c
2l−1| ≤ 2
√ 2 lπ kf
′k
1, |c
2l| ≤ 1
√ 2 lπ kf
′k
1.
Hence, for N = 2m, p
N=
X
∞ k=N +1c
2k= X
∞ l=m+1(c
22l−1+ c
22l) ≤ 5kf
′k
212π
2X
∞ l=m+11 l
2≤ 5kf
′k
212π
2X
∞ l=m+11
l(l − 1) = 5kf
′k
212π
2· 1 m .
According to (7), (8) and by Lemma 3.1 we have for N = 2m, R
nN≤ 5kf
′k
21π
2N + 1 n
2(4N + 1)kf
′k
21+ N (N + 1)(N + 2) 3 kf k
21+ σ
2ηN + 1 n , which can be rewritten in the form
R
nN≤ A N + 1
n
2(BN + CN
3) + DN n ,
where A, B, C, D > 0 are suitably chosen constants. From the last inequality it is easy to see that the following corollary holds.
Corollary 3.1. If the regression function f is absolutely continuous and the sequence of even natural numbers N (n), n = 1, 2, . . . , satisfies N (n) ∼ n
1/2(i.e. r
1≤ n
−1/2N (n) ≤ r
2for r
1, r
2> 0), then
Ekf − b f
N (n)k
22= O(n
−1/2).
In papers on wavelet methods of nonparametric function estimation (e.g.
[5]) one can find results giving the IMSE decay rate of a wavelet projection estimator for an equidistant point design.
4. Pointwise mean-square consistency of the estimator. In this section we derive sufficient conditions for pointwise mean-square consistency of the projection estimator b f
Nconsidered.
If the Fourier series of f converges to f (x) at some x ∈ [0, 2π], then E(f (x) − b f
N(x))
2= E X
Nk=0
(c
k− bc
kn)e
k(x)
2+ r
N2(x)
+ 2r
N(x) X
N k=0(c
k− Ebc
kn)e
k(x), where r
N(x) = P
∞k=N +1
c
ke
k(x). From the Cauchy–Schwarz inequality it
further follows that
E(f (x) − b f
N(x))
2≤ X
N k=0E(c
k− bc
kn)
2X
N k=0e
2k(x) + r
2N(x)
+ 2|r
N(x)| X
Nk=0
(c
k− Ebc
kn)
21/2X
Nk=0
e
2k(x)
1/2, and according to (4) since P
Nk=0
e
2k(x) = N + 1 for N = 2m, m ≥ 0, x ∈ [0, 2π], we finally have
(9) E(f (x) − b f
N(x))
2≤ (N + 1) X
N k=0(c
k− Ebc
kn)
2+ σ
η2(N + 1)
2n + 2|r
N(x)|(N + 1)
1/2X
Nk=0
(c
k− Ebc
kn)
21/2+ r
N2(x).
If we assume that the regression function f is absolutely continuous, then since such a function is both continuous and of bounded variation in [0, 2π], its Fourier series converges uniformly to f in (δ, 2π − δ) for δ > 0 (see Corollary 2.62 [11]), so that lim
n→∞r
N (n)(x) = 0 uniformly for x ∈ (δ, 2π − δ) if lim
n→∞N (n) = ∞. Hence, the estimates in (8) and (9) imply that the following theorem holds.
Theorem 4.1. If the sequence N (n), n = 1, 2, . . . , of even natural numbers satisfies
n→∞
lim N (n) = ∞, lim
n→∞
N (n)
2/n = 0,
then for any δ > 0 the projection estimator b f
N (n)of the absolutely con- tinuous regression function f is uniformly consistent in the sense of the pointwise mean-square error in the interval (δ, 2π − δ), i.e.
n→∞
lim E(f (x) − b f
N (n)(x))
2= 0 uniformly for x ∈ (δ, 2π − δ).
It should be noted that for an absolutely continuous and 2π-periodic regression function the pointwise mean-square error converges uniformly in the closed interval [0, 2π], which follows from the fact that then the Fourier series of the regression function converges uniformly in this interval [11].
Let us also remark that Rafaj lowicz [10] obtained sufficient conditions for uniform pointwise mean-square consistency of b f
Nin [0, 2π] only for 2π- periodic regression functions.
5. Selecting the regression order. In this section we study a method
of selecting a good value of N from the data, namely, the one based on
Mallows’s C
pcriterion [7]
(10) C(N ) = 1
n X
n i=1(y
i− b f
N(x
in))
2+ 2N b σ
2n ,
where b σ
2is any consistent estimator of σ
2η(see [3], [4] for examples of such estimators). One selects a value b N
nby minimizing C(N ) over the integers 0 ≤ N ≤ n − 1.
We obtain results on asymptotic properties of this method for selecting the trigonometric regression order N . First, we prove the following theorem.
Theorem 5.1. Assume that
(a) the function f is absolutely continuous and is not a trigonometric polynomial of any finite order ,
(b) there exists a sequence of nonnegative real numbers ε
k, k = 0, 1, . . . , such that the sequence (k + 1)ε
kis nonincreasing and
|c
k| ≤ ε
k, X
∞ k=0ε
k< ∞, (c) µ
4= sup Eη
4i< ∞,
(d) b σ
2−→ σ
p η2as n → ∞.
If b N
nis the minimizer of (10), then
2π\
0
(f − b f b
Nn)
2= O
p(n
−1/2).
P r o o f. Under the above assumptions Theorem 2 of [9] holds, which asserts that for the loss function r
n(N ) =
T2π
0
(f − b f
N)
2we have
(11) r
n( b N
n)
min
0≤N ≤n−1r
n(N )
−→ 1,
pn → ∞,
even if the absolute continuity assumption is not satisfied. If this assumption is satisfied we have, by Corollary 3.1,
0≤N ≤n−1
min E
2π
\
0
(f − b f
N)
2= O(n
−1/2), and consequently
0≤N ≤n−1
min
2π
\
0