22,4 (1995), pp. 427–446
M. B E R T R A N D - R E T A L I and L. A I T - H E N N A N I (Rouen)
UNIFORM CONVERGENCE OF DENSITY ESTIMATORS ON SPHERES
Non-parametric estimation of a probability density for random variables taking values on an s-dimensional unit sphere is studied in [1], [5], [6]. The object of the present paper is to establish new uniform convergence theo- rems for several estimators: we use successively the histogram method, the spherical cap and the kernel methods. In part D, we present simulation results.
Let D be the set of continuous densities, defined on the sphere S; we estimate f, an element of D, from a sample of size n, denoted by X
1, . . . , X
n. The density f satisfies R
S
f (x) dµ(x) = 1, where µ is the Lebesgue measure on S.
A. The histogram estimator. We are going to describe a partition of the sphere which will allow us to use the main theorem of [4].
This theorem establishes a necessary and sufficient condition for uniform convergence—in probability and almost completely—using the histogram estimator on a metric space, for every f in D. To use it for S, it will be sufficient to construct a sequence ∆
k(n)of partitions ∆
k= {∆
k,r: r ∈ R
k}, the Borel sets ∆
k,rbeing such that
k→∞
lim sup
r∈Rk
(diam ∆
k,r) = 0, lim
k→∞
sup
r∈Rk
(area ∆
k,r) = 0, lim sup
k→∞
sup
r∈Rk(area ∆
k,r) inf
r∈Rk(area ∆
k,r) < ∞.
We choose the integer k(n) such that lim
n→∞k(n) = +∞. For r ∈ R
k, let ν
n,rbe the number of X
i’s belonging to ∆
k,r.
1991 Mathematics Subject Classification: 62G07, 62G20.
Key words and phrases :histogram estimator, spherical cap estimator, kernel estimator.
The histogram estimator b f
nis given by
∀r ∈ R
k, ∀x ∈ ∆
k,r, f b
n(x) = ν
nrnµ(∆
k,r) ,
µ(∆
k,r) denoting the area of ∆
k,r. With these notations, the main theo- rem of [4] states that b f
nis uniformly convergent, in probability and almost completely, if and only if
[ inf
r∈Rk
µ(∆
k,r)]
−1= o(n/ log n) where k = k(n).
First, we are going to construct the partition for s = 3. Then we shall explain it for any s.
1. Partition for s = 3. A parametric representation of S is x
1= cos θ
1, θ
1∈ [0, π],
x
2= sin θ
1cos θ
2,
x
3= sin θ
1sin θ
2, θ
2∈ [0, 2π[.
The “poles” of S, corresponding to θ
1= 0 and θ
1= π, must belong to a unique element of the partition, so we define the Borel sets ∆
k,r= ∆
k,r1,r2in the following manner:
∆
k,0= [0, arccos(1 − 1/k
2)[ × [0, 2π[,
∆
k,1,r2= [arccos(1 − 1/k
2), arccos(1 − 2/k)[ × [(r
2− 1)π/k, r
2π/k[
for r
2= 1, . . . , 2k,
∆
k,r1,r2= [arccos(1 − 2(r
1− 1)/k), arccos(1 − 2r
1/k)[ × [(r
2− 1)π/k, r
2π/k[
for r
1= 2, . . . , k − 1; r
2= 1, . . . , 2k,
∆
k,k,r2= [arccos(−1 + 2/k), arccos(−1 + 1/k
2)[ × [(r
2− 1)π/k, r
2π/k[
for r
2= 1, . . . , 2k,
∆
k,k+1= [arccos(−1 + 1/k
2), π] × [0, 2π[,
these intervals being closed when necessary. Then we can easily see that, for each ∆
k,r, µ(∆
k,r) is equivalent to 2π/k
2, and that there are 2k
2+ 2 elements in the partition. The necessary and sufficient condition is then
k
2= o(n/ log n).
2. Construction for arbitrary s. A parametric representation of S is: for θ
i∈ [0, π] when i = 1, . . . , s − 2 and θ
s−1∈ [0, 2π[,
x
1= cos θ
1, x
i=
i−1
Y
j=1
sin θ
jcos θ
i, i = 2, . . . , s − 1,
x
s=
s−1
Y
j=1
sin θ
j.
In R
s, the distance between two points M and M
0belonging to the sphere, associated with (θ
i)
i=1,...,s−1and (θ
i0)
i=1,...,s−1, is
d
2(M, M
0) = 4
s−1
X
i=1 i−1
Y
j=1
sin θ
jsin θ
j0sin
2θ
i− θ
i02 .
We notice that, for i = 1, . . . , s − 2, sin θ
i= 0 implies that θ
i+1, . . . , θ
s−1are arbitrary.
The area of a part S
0⊂ S is µ(S
0) = R
S0 s−1
Y
i=1
sin
miθ
idθ
iwith m
i= s − 1 − i; i = 1, . . . , s − 1.
For positive integers q ≥ 0, define I
q=
π/2
R
0
sin
2q+1θ dθ, J
q=
R
π 0sin
2qθ dθ.
First, let us construct the elements which do not contain the poles—i.e.
the points such that, for one index i = 1, . . . , s−2, sin θ
i= 0. These elements can be written as
∆
k,r=
s−1
Y
i=1
[α
ri−1, α
ri[, r ∈ R
0k.
We choose the values α
ri, i = 1, . . . , s−1, in the following manner. Consider the integral
αri
R
αri−1
sin
miθ
idθ
i. If m
i= 2q
i+ 1 with q
i∈ N, then define
F
qi(α) =
R
α 0sin
2qi+1θ
idθ
ifor α ∈ [0, π].
Then F
qi(α) is increasing from 0 to F
qi(π) = 2I
qi; we define α
rifrom F
qi(α
ri) = 2r
ik I
qifor r
i= 1, . . . , k.
Then
αri
R
αri−1
sin
2qi+1θ
idθ
i= 2
k I
qi.
If m
i= 2q
iwith q
i∈ N
∗, then define G
qi(α) =
R
α 0sin
2qiθ
idθ
ifor α ∈ [0, π].
Then G
qi(α) is increasing from 0 to J
qi; we define α
rifrom G
qi(α
ri) = r
ik J
qifor r
i= 1, . . . , k.
Then
αri
R
αri−1
sin
2qiθ
idθ
i= 1 k J
qi. For m
i= 0, i.e. i = s − 1, we choose
[α
rs−1−1, α
rs−1[ = [(r
s−1− 1)π/k, r
s−1π/k[, r
s−1= 1, . . . , 2k.
Using the values of I
qiand J
qi, we can easily see that for r
i= 2, . . . , k−1;
i = 1, . . . , s − 2; and r
s−1= 1, . . . , 2k,
µ(∆
k,r) = C(s) k
s−1,
where C(s) is a constant; its value follows from the preceding formulations.
The whole partition is constructed by generalization of the method explained for s = 3. When, for an index i = 1, . . . s − 2, sin θ
i= 0, the associated ele- ment of the partition satisfies: θ
i+1, . . . , θ
s−2are in [0, π], and θ
s−1in [0, 2π[;
the intervals for θ
1, . . . , θ
iare chosen to make the area of ∆
k,requivalent to the preceding expression.
Example (for s = 4). For r
1= 2, . . . , k − 1; r
2= 2, . . . , k − 1; and r
3= 1, . . . , 2k,
∆
k,r= [α
r1−1, α
r1[
× [arccos(1 − 2(r
2− 1)/k), arccos(1 − 2r
2/k)[ × [(r
3− 1)π/k, r
3π/k[, µ(∆
k,r) = π
2/k
3,
α
r1being given from 1
2 α
r1− 1
4 sin
2α
r1= r
12k π, and
∆
k,0= [0, (3π/4)
1/3/k[ × [0, π] × [0, 2π[,
∆
k,k+1= [π − (3π/4)
1/3/k, π] × [0, π] × [0, 2π[,
∆
k,1,0= [(3π/4)
1/3/k, α
1[ × [0, √
2/k[ × [0, 2π[,
∆
k,1,k+1= [(3π/4)
1/3/k, α
1[ × [π − √
2/k, π] × [0, 2π[,
∆
k,1,1,r3= [(3π/4)
1/3/k, α
1[ × [ √
2/k, arccos(1 − 1/k)[
× [(r
3− 1)π/k, r
3π/k[, r
3= 1, . . . , 2k, and so on.
The number of elements in the partition is
K
n,4= 2 + k(2k
2+ 2) = 2k
3+ 2k + 2.
Coming back to the general case, we have
K
n,s= 2k
s−1+ 2 k
s−2− 1 k − 1 . The necessary and sufficient condition is then
k
s−1= o(n/ log n).
B. The spherical cap estimator. For the sphere S in R
s, the spherical cap estimator is defined as in [6].
With each x ∈ S, we associate the spherical cap with pole x and radius h
n, denoted by B
n,x; here h
nis a sequence of positive real numbers such that
n→∞
lim h
n= 0.
The area of B
n,xis
µ(B
n,x) = C
sh
s−1n+ o(h
s−1n), where C
s= 2π
(s−1)/2(s − 1)Γ ((s − 1)/2) . We estimate the density f in the following manner. Let ν
n,xbe the number of X
i’s belonging to B
n,x. Define
∀x ∈ S, f e
n(x) = ν
nxnC
sh
s−1n. We are going to prove the following theorem:
For each element f ∈ D, e f
nis uniformly convergent—in probability and almost completely—if and only if
h
1−sn= o(n/ log n).
P r o o f o f t h e “i f” p a r t. We suppose that h
1−sn= o(n/ log n),
and we are going to prove that, for every f in D, e f
nconverges almost completely to f, uniformly on S.
Let x be an element of S, and e f
n(x) the associated estimator. We choose 0x
1= 0x. Let k
n= [1/h
n]. Then
k
nk
n+ 1 < k
nh
n≤ 1.
Now, k
nbeing chosen, we construct the partition as in part A; x belongs to
∆
kn,0, and the corresponding histogram estimator is f b
n,kn(x) = ν
n,0(k
n)
nµ(∆
kn,0) , where ν
n,0(k
n) is the number of X
i’s in ∆
kn,0. We do the same construction with the integer k
n+ 1:
f b
n,kn+1(x) = ν
n,0(k
n+ 1) nµ(∆
kn+1,0) .
Since ∆
kn,0(resp. ∆
kn+1,0) is (by part A) the spherical cap of pole x and radius 1/k
n(resp. 1/(k
n+ 1)), we can write
ν
n,0(k
n+ 1)
nµ(∆
kn,0) ≤ e f
n(x) ≤ ν
n,0(k
n) nµ(∆
kn+1,0) , or
µ(∆
kn+1,0)
µ(∆
kn,0) f b
n,kn+1(x) ≤ e f
n(x) ≤ µ(∆
kn,0)
µ(∆
kn+1,0) f b
n,kn(x).
From the choices of h
nand k
n, we claim that b f
n,knand b f
n,kn+1converge to f uniformly almost completely.
Choosing a positive η, we suppose that the events {d( b f
n,kn, f ) < η} and {d( b f
n,kn+1,f ) < η} are realized. For large n
−η +
µ(∆
kn+1,0) µ(∆
kn,0) − 1
f (x) ≤ e f
n(x) − f(x) ≤
µ(∆
kn,0) µ(∆
kn+1,0) − 1
f (x) + 2η.
Let H be such that f < H. Then, for large n,
µ(∆
kn+1,0) µ(∆
kn,0) − 1
H < η.
Thus, for large n,
P [d( e f
n, f ) > 3η] ≤ P [d( b f
n,kn, f ) > η] + P [d( b f
n,kn+1, f ) > η].
The choices of h
nand k
nimply the convergence of the series on the right- hand side.
The uniform and almost complete convergence of e f
nto f follows imme- diately.
P r o o f o f t h e “o n l y i f” p a r t. We suppose that, for every f in D, f e
nconverges to f uniformly in probability. First, we show h
1−sn= o(n).
We choose a coordinate system and we consider the spherical cap with radius 1/4 and pole x (θ
1= 0); we choose f to be an element of D such that, on this cap, f is an arbitrary positive number α.
From this choice of f, and from the hypothesis, we get
n→∞
lim P [ν
nx= 0] = 0,
that is,
n→∞
lim (1 − αC
sh
s−1n)
n= 0,
so that lim
n→∞n log(1 − αC
sh
s−1n) = −∞ and thus h
1−sn= o(n).
Now, we show that h
1−sn= o(n/ log n). Let β be fixed in ]0, π/2[, and let S
0be the part of S defined by β ≤ θ
i≤ π − β for i = 1, . . . , s − 2, and 0 ≤ θ
s−1< 2π.
Let k
nbe an integer to be defined later; we construct the corresponding partition (as in part A), and let {∆
kn,r: r ∈ R
0kn} be the set of its elements included in S
0.
For each ∆
kn,r, we define its center x
kn,ras follows. For large n, ∆
kn,rcan be written as Q
s−1i=1
[α
ri−1, α
ri[ for every r ∈ R
0kn. Then x
kn,r= α
ri−1/2, i = 1, . . . , s − 1, with
αri−1/2
R
0
sin
miθ
idθ
i=
2r
i− 1 k I
qior 2r
i− 1
2k J
qifor i = 1, . . . , s − 2,
and
α
rs−1−1/2= 2r
s−1− 1 2k π.
Consider the distance (in R
s) from x
kn,rto the boundary of ∆
kn,r. Using the expression for d(M, M
0) (part A), we can easily see that there exists a positive constant C(s, β) such that
r∈R
inf
0kn
d(x
kn,r, boundary of ∆
kn,r) ≥ C(s, β)
1/2/k
n. This implies that, for each r in R
0kn
, ∆
kn,rcontains the spherical cap with pole x
kn,rand radius C(s, β)
1/2/k
n. Choose k
n= [C(s, β)
1/2/h
n]. Then, for each r in R
0kn
, ∆
kn,rcontains the spherical cap with pole x
kn,rand radius h
n, i.e. B
n,¯xkn,r.
Moreover, by definition of S
0, R
0knhas [C
0(s, β)k
ns−1] elements, where C
0(s, β) is a positive number depending only on s and β.
We choose f in D with f = α on S
0, α being an arbitrarily small positive number. From the hypothesis, e f
nconverges to f uniformly in probability, so
n→∞
lim P [d( e f
n, f ) > α/2] = 0.
If one of the ∆
kn,rincluded in S
0contains no X
i, then neither does the cap B
n,¯xkn,rand e f
n(x
kn,r) = 0, so d( e f
n, f ) ≥ α. The convergence hypothesis implies
n→∞
lim P h [
r∈R0
kn
{ν
n,r(k
n) = 0} i
= 0,
ν
n,r(k
n) being the number of X
i’s belonging to ∆
kn,r. That is,
n→∞
lim P h \
r∈R0kn
{ν
n,r(k
n) ≥ 1} i
= 1.
Here, we remind that two events A and B of positive probability are in negative correlation if
P (A|B) ≤ P (A), that is, P (A ∩ B) ≤ P (A)P (B).
More generally, the events A
1, . . . , A
nof positive probability are in negative correlation if
∀I ⊂ {1, . . . , n}, P h \
i∈I
A
ii ≤ Y
i∈I
P (A
i),
that is, the realization of one of the A
idiminishes the probability that the others are realized.
The events in the intersection several lines above are in negative corre- lation, thus
n→∞
lim Y
r∈R0kn
P [ν
n,r(k
n) ≥ 1] = 1.
Then, remembering that f = α on S
0, we have
n→∞
lim Y
r∈R0
kn
[1 − (1 − αµ(∆
kn,r))
n] = 1.
From part A, µ(∆
kn,r) = C(s)/k
ns−1; taking the logarithm, we obtain, for large n,
∀α > 0, 1 − nαC(s)
k
s−2nlog[C
0(s, β)k
s−1n] < 0, thus
n→∞
lim
k
ns−1log[C
0(s, β)k
s−1n]
n = 0.
Using the definition of k
nfrom h
n, and h
1−sn= o(n), we obtain the desired result.
C. The kernel estimator. Let K be a positive function, defined on R
+, such that
R
∞ 0K(u)u
(s−3)/2du < ∞.
For this function K and for a sequence of positive numbers h
nwith lim
n→∞h
n= 0 the kernel estimator of f is
f e
n(x) = 1 nh
s−1nC
K,s(h
n)
X
n i=1K
1 − hx, X
ii h
2n,
where hx, X
ii is the scalar product and C
K,s(h
n) = h
1−snR
S
K
1 − hx, yi h
2ndµ(y), dµ(y) being the area element on S.
The constant C
K,s(h
n) does not depend on x and can be written as C
K,s(h
n) = 2π
(s−1)/2Γ ((s − 1)/2)
2/h
R
2n0
(2u − u
2h
2n)
(s−3)/2K(u) du with
n→∞
lim C
K,s(h
n) = 2π
(s−1)/2Γ ((s − 1)/2)
R
∞ 0(2u)
(s−3)/2K(u) du.
Notice first that if we choose
K(u) = 1
[0,1/2](u), then
C
K,s(h
n) = 2π
(s−1)/2Γ ((s − 1)/2)
1/2
R
0
(2u − u
2h
2n)
(s−3)/2du, that is,
C
K,s(h
n) = h
1−sn2π
(s−1)/2Γ ((s − 1)/2)
2 arcsin h
R
n/2 0sin
s−2θ dθ.
From part B, we see that h
s−1nC
K,s(h
n) is the area of the cap B
n,x, and thus the estimator e f
ndefined from that function K is the spherical cap estimator.
We are going to prove two uniform convergence theorems for the ker- nel estimator: a necessary condition for convergence in probability, and a sufficient condition for almost complete convergence. In the proofs, we will follow the method used in [3]. Thus, we do not give all the details; we just indicate how these methods can be adapted for S.
1. Necessary condition for convergence. The theorem is:
Suppose that
y→∞
lim y
R
∞ yK(u)(2u)
(s−3)/2du = 0.
Then , for every f in D, if e f
nconverges to f uniformly in probability, then h
1−sn= o(n/ log n).
First, we show that
h
1−sn= o(n).
As in [3], we suppose that this condition is not satisfied, and we show that, for an element f in D, e f
ndoes not converge in probability.
If h
1−snis not o(n), there exists a positive α and an infinite subset N
1of N such that
∀n ∈ N
1, h
1−sn> αn.
We define a parametric representation of S, and we choose f in D equal to α on C defined by
C = {x ∈ S : 0 ≤ θ
1≤ π/4; θ
i∈ [0, π], i = 1, . . . , s − 2; θ
s−1∈ [0, 2π]}.
Let H be an upper bound of f.
We choose a positive number M such that
R
∞ MK(u)(2u)
(s−3)/2du < inf
1 4 , α
4H
R∞ 0
K(u)(2u)
(s−3)/2du.
Let
%
n= h
n√ 2M
and let Q
nbe the cap with pole ξ (θ
1= 0) and radius %
n. Let H
nbe the event: no one of the X
i’s belongs to Q
n.
We get
P (H
n) = [1 − αµ(Q
n)]
n.
We use the hypothesis on h
nand the choice of %
nto obtain P (H
n) > e
−2(2M )(s−1)/2Cs> 0 for large n in N
1. Let f
Hnbe the density of X conditioned by H
n:
f
Hn(x) =
0 on Q
n,
f (x)
1 − αC
s(2M )
(s−1)/2h
s−1non S − Q
n.
Then we bound the mean of e f
n(ξ) conditioned by H
n; as in [3], we obtain E[ e f
n(ξ) | H
n]
≤ [2π
(s−1)/2/Γ ((s − 1)/2)]H (1 − αC
s%
s−1n)C
K,s(h
n)
2/h
R
2n%2n/(2h2n)
K(u)(2u − u
2h
2n)
(s−3)/2du.
For large n, using %
2n/(2h
2n) = M , we get E[ e f
n(ξ) | H
n] ≤ [2π
(s−1)/2/Γ ((s − 1)/2)]H
(1 − αC
s%
s−1n)C
K,s(h
n)
R
∞ MK(u)(2u)
(s−3)/2du,
and, from the definition of M ,
E[ e f
n(ξ) | H
n]
≤ α
4(1 − αC
s%
s−1n)
2π
(s−1)/2/Γ ((s − 1)/2)] R
∞0
K(u)(2u)
(s−3)/2du
C
K,s(h
n) .
Remembering that lim
n→∞%
n= 0 and
n→∞
lim C
K,s(h
n) = 2π
(s−1)/2Γ ((s − 1)/2)
R
∞ 0K(u)(2u)
(s−3)/2du we obtain, for large n, E[ e f
n(ξ) | H
n] ≤
14α(1 + ε).
The proof is then as in [3], using the Markov inequality, and the fact that, for large n in N
1, P (H
n) is strictly positive.
Now we show
h
1−sn= o(n/ log n).
We suppose that h
1−sn= o(n), but that the condition h
1−sn= o(n/ log n) is not satisfied. Then there exists a positive β and an infinite subset N
1of N such that
∀n ∈ N
1, h
1−sn> βn/ log n.
Let α be a positive number, to be made precise further, and let us choose f : f (x) =
( f (θ
1, . . . , θ
s−1) = α on C = [0, π/2] × [0, π]
s−3× [0, 2π[, a sin θ
1+ b on [π/2, 2π/3] × [0, π]
s−3× [0, 2π[,
H elsewhere.
The constants a, b, H are well known from α, using the continuity con- dition, and R
S
f dµ = 1. More precisely, we get H = d
s− a
sα
b
s,
d
s, a
s, b
sbeing positive numbers, known from the choice of s.
We choose β
0= β/(12C
s), decreasing the value of β if necessary to get β
0< d
s/a
s. Using the hypothesis on K:
y→∞
lim y
R
∞ yK(u)(2u)
(s−3)/2du = 0, that is, ∀ε > 0, ∃M
0, ∀M > M
0,
M
R
∞ MK(u)(2u)
(s−3)/2du < ε
R
∞ 0K(u)(2u)
(s−3)/2du, we choose ε = inf(β
0b
s/(4d
s), 1/4); then M
0is known.
Next, we choose a positive M such that
M > max(M
0, a
sβ
0/d
s, β
0, 1)
and
α = β
0M . Then H is known and
α
4H = β
0b
s4(d
sM − a
sβ
0) ; thus,
α
4H > β
0b
s4M d
s, and from the choices of ε and M,
R
∞ MK(u)(2u)
(s−3)/2du < α 4H
R
∞ 0K(u)(2u)
(s−3)/2du.
We shall use this inequality at the end of the proof.
We choose the integer k
n=
h
−1n√ 2M (C
s2
s)
1/(s−1)and let
%
n= k
n−1(C
s2
s)
1/(s−1). Then, for large n, 2
sk
s−1n> 1/(3M C
s).
For large n in N
1, we have k
s−1n> β
0n/ log n, where β
0= β/(3M C
s) = 4α. This inequality is valid if β is chosen small enough.
We make a partition of C, similar to the partition defining b f
non S:
without going into details, we simply note that we divide [0, π/2] for θ
1and the partition is associated with the integer 2k
n.
Let K
nbe the number of elements in this partition; K
nis equivalent to 2
sk
s−1n. For each element, the area is equivalent to C
s%
s−1n.
We obtain a similar result to Proposition 1 of [3]:
Let J
nbe the exact number of ∆
n,t, t = 1, . . . , K
n, containing no element of the sample. Then for every ε > 0,
n→∞
lim P [1 ≤ J
n≤ εK
n] = 1.
We can also state (cf. [3]):
Let j an integer in {1, . . . , K
n} and integers t
1, . . . , t
jbe such that 1 ≤ t
1< . . . < t
j≤ K
n.
Let V
n(t
1, . . . , t
j) be the event: each ∆
n,t, t = t
1, . . . , t
j, is empty, while each
among the others contains at least a point of the sample; the hypothesis
h
1−sn= o(n) implies K
n= o(n). Let α
0and α
00be the positive numbers
defined in [3]; suppose n is so large that K
n< α
0n, and let ν be an integer
such that [α
0n] + 1 ≤ ν ≤ α
00n; let ν
nbe the number of X
i’s belonging to C.
Then the distribution of each X
i(i = 1, . . . , n) conditioned by the event E
n(ν; t
1, . . . , t
j) = {ν
n= ν} ∩ V
n(t
1, . . . , t
j)
admits the density
f
∗(x) =
n − ν
n
f (x)
1 − αK
nC
s%
s−1nif x ∈ S − C, ν
nα
f (x)
(K
n− j)%
s−1nC
sif x ∈ C − [
j r=1∆
n,tr,
0 if x ∈
[
j r=1∆
n,tr. We now conclude as in [3]. Let
ψ(x) = E[ e f
n(x) | E
n(ν; t
1, . . . , t
j)].
Then
ψ(x) = 1
h
s−1nC
K,s(h
n)
R
S−C
K
1 − (x, u) h
2nn − ν
n(1 − α) f (u) dµ(u)
+ 1
h
s−1nC
K,s(h
n)
× R
C−
∪
jr=1∆n,trK
1 − hx, ui h
2nν
nα(K
n− j)%
s−1nC
sf (u) dµ(u).
Let ε be in ]0, 1[, and suppose 1 ≤ j ≤ εK
n. Then, for large n, (K
n− j)%
s−1nC
s> 1 − ε,
and we can bound
ψ(x) ≤ 1
h
s−1nC
K,s(h
n)
R
S−C
K
1 − hx, ui h
2n1 − α
01 − α f (u) dµ(u)
+ 1
h
s−1nC
K,s(h
n)
R
C−
∪
jr=1∆n,trK
1 − hx, ui h
2nα
00α(1 − ε) f (u) dµ(u).
If α
0and α
00are chosen such that 1 − α
01 − α < 1 + 2ε and α
00α(1 − ε) < 1 + 2ε then
ψ(x) ≤ R
S−
∪
jr=1∆n,tr1 + 2ε h
s−1nC
K,s(h
n) K
1 − hx, ui h
2nf (u) dµ(u).
Let us choose x = ξ, corresponding to θ
1= 0, a pole of ∆
n,t1= ∆
kn,0. We obtain
ψ(ξ) ≤ 1 + 2ε h
s−1nC
K,s(h
n)
R
S−
∪
jr=1∆n,trK
1 − hξ, ui h
2nf (u) dµ(u),
that is,
ψ(ξ) ≤ 1 + 2ε h
s−1nC
K,s(h
n)
R
S−
∪
jr=1∆n,trK
1 − cos θ
1h
2nf (θ
1, . . . , θ
s−1) dµ(θ).
Let D
00be the image of the integration domain under the change of variable u = (1 − cos θ
1)/h
2n. Then
ψ(ξ) ≤ 1 + 2ε C
K,s(h
n)
2π
(s−1)/2Γ ((s − 1)/2) (sup f ) R
D00
K(u)(2u)
(s−3)/2du.
The image of the cap ∆
n,t1has no common point with D
00and is the interval [0, %
2n/(2h
2n)]. Thus
ψ(ξ) ≤ (1 + 2ε)(sup f) 2π
(s−1)/2Γ ((s − 1)/2)C
K,s(h
n)
R
∞%2n/(2h2n)
K(u)(2u)
(s−3)/2du.
Remembering that
%
2n2h
2n= 1
(C
s2
s)
2/(s−1)2h
2nk
2nand k
n2≤ 1
2M h
2n(C
s2
s)
2/(s−1)we have %
2n/2h
2n≥ M and
ψ(ξ) ≤ (1 + 2ε)(sup f) 2π
(s−1)/2Γ ((s − 1)/2)C
K,s(h
n)
R
∞ MK(u)(2u)
(s−3)/2du.
Recall also that
R
∞ MK(u)(2u)
(s−3)/2du < inf
α 4H , α
4M
R∞ 0
K(u)(2u)
(s−3)/2du and, from the definition of M ,
R
∞ MK(u)(2u)
(s−3)/2du < inf
α 4H , 1
4
R∞ 0
K(u)(2u)
(s−3)/2du.
But sup f = sup(α, H) and thus ψ(ξ) <
1 2 + ε
[2π
(s−1)/2/Γ ((s − 1)/2)] R
∞0
K(u)(2u)
(s−3)/2du
C
K,s(h
n) α
and for large n,
ψ(ξ) <
1 2 + ε
α
1 − ε
0.
Choosing ε = ε
0= 1/10, for large n, we get ψ(ξ) <
23α, and the end of the proof is similar to [3].
2. Sufficient condition for convergence. In this part, too, we proceed as in [3].
We recall that a function defined on R
+is called π
m-simple if, for a fixed integer m, it is constant on each element of the partition π
m, where
π
m= {I
m,j= [j/2
m, (j + 1)/2
m[ : j ∈ N}.
We suppose that K is chosen such that there exist two sequences ϕ
+mand ϕ
−mof R
+-integrable π
m-simple functions with
ϕ
−m≤ ϕ
−m+1≤ K ≤ ϕ
+m+1≤ ϕ
+mfor large m.
For instance, every function K of bounded variation in the neighboorhood of infinity satisfies this condition.
We suppose, moreover, that u
(s−1)/2K(u) is decreasing for large u, and that R
∞0
u
(s−1)/2K(u) du exists, with
m→∞
lim
R
∞ 0u
(s−3)/2ϕ
+m(u) du = lim
m→∞
R
∞ 0u
(s−3)/2ϕ
−m(u) du
=
R
∞ 0u
(s−3)/2K(u) du.
We are going to prove the following theorem:
If K satisfies the above hypotheses and if h
1−sn= o(n/ log n), then for each element f of D, e f
nconverges to f uniformly almost completely.
We set
ϕ
+m= X
∞ j=0α
mj1
Imj, ϕ
−m= X
∞ j=0α
0mj1
Imj. We can write
1 nh
s−1nC
K,s(h
n)
X
n i=1X
∞ j=0α
0mj1
Imj1 − hx, X
ii h
2n≤ e f
n(x) ≤ 1 nh
s−1nC
K,s(h
n)
X
n i=1X
∞ j=0α
mj1
Imj1 − hx, X
ii h
2n.
Consider the event
1
Imj1 − hx, X
ii h
2n= 1
, that is,
j
2
m≤ 1 − hx, X
ii
h
2n< j + 1 2
m, or
{X
i∈ B
n,m,j+1,x− B
n,m,j,x= C
n,m,j,x},
where B
n,m,j,x(resp. B
n,m,j+1,x) is the spherical cap with pole x and radius a
n= (j/2
m−1)
1/2h
n(resp. b
n= ((j + 1)/2
m−1)
1/2h
n). Let
f e
n,m,j(x) = ν
n,m,j,xnC
sa
s−1nand f e
n,m,j+1(x) = ν
n,m,j+1,xnC
sb
s−1nbe the spherical cap estimators corresponding to these two caps. When j and m are chosen, the hypothesis about h
nimplies the uniform almost complete convergence of these two estimators. For the chosen j and m,
(2
m−1)
(s−1)/2nh
s−1nC
K,s(h
n)
X
m i=11
Imj1 − hx, X
ii h
2n= (2
m−1)
(s−1)/2nh
s−1nC
K,s(h
n) (ν
n,m,j+1,x− ν
n,m,j,x)
= C
sC
K,s(h
n) [(j + 1)
(s−1)/2f e
n,m,j+1(x) − j
(s−1)/2f e
n,m,j(x)].
So the preceding bounds allow us to write C
sC
K,s(h
n) X
∞ j=0α
0mj1 (2
m−1)
(s−1)/2× [(j + 1)
(s−1)/2f e
n,m,j+1(x) − j
(s−1)/2f e
n,m,j(x)] − f(x)
≤ e f
n(x) − f(x)
≤ C
sC
K,s(h
n) X
∞ j=0α
mj1 (2
m−1)
(s−1)/2× [(j + 1)
(s−1)/2f e
n,m,j+1(x) − j
(s−1)/2f e
n,m,j(x)] − f(x).
Consider, first, the upper bound of e f
n(x) − f(x). We can write it as C
sC
K,s(h
n) X
∞ j=0α
m,j2
(m−1)(s−1)/2× {(j + 1)
(s−1)/2[ e f
n,m,j+1(x) − f(x)] − j
(s−1)/2[f
n,m,j(x) − f(x)]}
+ f (x)
C
sC
K,s(h
n) X
∞ j=0α
m,j2
(m−1)(s−1)/2[(j + 1)
(s−1)/2− j
(s−1)/2] − 1
. Recall that
n→∞
lim C
K,s(h
n) = 2π
(s−1)/2Γ ((s − 1)/2)
R
∞ 0(2u)
(s−3)/2K(u) du,
C
s= 2π
(s−1)/2(s − 1)Γ ((s − 1)/2) . Moreover,
(s −1)
R
∞ 0ϕ
+m(u)(2u)
(s−3)/2du = X
∞ j=0α
mj2
(m−1)(s−1)/2[(j + 1)
(s−1)/2−j
(s−1)/2].
From the hypotheses about K, there exists an integer m
0such that, for m > m
0,
R
∞ 0ϕ
+m(u)(2u)
(s−3)/2du < (1 + ε)
R
∞ 0K(u)(2u)
(s−3)/2du.
Thus, for n > n
0and m > m
0, the coefficient of f (x) is smaller than an arbitrary positive number η.
Let us choose m > m
0. The hypotheses about u
(s−1)/2K(u) imply that, for each ε > 0, there exists a finite subset J of N such that
X
j6∈J
α
mj[j
(s−1)/2+ (j + 1)
(s−1)/2] < ε.
Let H be an upper bound for f. For n > n
0, e f
n(x) − f(x) is smaller than C
sC
K,s(h
n) X
j∈J
α
mj2
(m−1)(s−1)/2(j + 1)
(s−1)/2| e f
n,m,j+1(x) − f(x)|
+ C
sC
K,s(h
n) X
j∈J
α
mj2
(m−1)(s−1)/2j
(s−1)/2| e f
n,m,j(x) − f(x)| + 2Hε C
sC
K,s(h
n) + H.
The end of proof is similar to [3].
The lower bound for e f
n(x) − f(x) is obtained analogously.
f(θ
1, θ
2) = 1 π
2sin θ
1The histogram estimator
The kernel estimator
D. Simulation results. We now study the performance of these esti- mators by simulation methods, for the density
f (θ
1, θ
2) = 1 π
2sin θ
1with s = 3.
The histogram estimate is calculated from a sample of size n = 5000, with k = √
n/ log n.
The kernel estimate is calculated from a sample of size n = 1000, with K(u) =
12e
−u(u ≥ 0) and h
n= (log n)/ √
n.
References
[1] Z. D. B a i, C. R a d a k r i s h n a R a o and L. C. Z h a o, Kernel estimators of density function of directional data , J. Multivariate Anal. 27 (1988), 24–39.
[2] M. B e r t r a n d - R e t a l i, Convergence uniforme stochastique d’un estimateur d’une densit´ e de probabilit´ e dans R
s, C. R. Acad. Sci. Paris S´er. A 278 (1974), 451–453.
[3] —, Convergence uniforme d’un estimateur de la densit´e par la m´ethode du noyau, Rev. Roumaine Math. Pures Appl. 23 (1978), 361–385.
[4] J. G e f f r o y, Sur l’estimation d’une densit´e dans un espace m´etrique, C. R. Acad.
Sci. Paris S´er. A 278 (1974), 1449–1452.
[5] P. H a l l, G. S. W a t s o n and J. C a b r e r a, Kernel density estimation with spherical
data, Biometrika 74 (1987), 751–762.
[6] F. H. R u y m g a a r t, Strong uniform convergence of density estimators on spheres, J.
Statist. Plann. Inference 23 (1989), 45–52.
[7] G. S. W a t s o n, Statistics on Spheres, University of Arkansas Lecture Notes in Math.
Sci., Wiley, 1983.
MONIQUE BERTRAND-RETALI LARBI AIT-HENNANI
UNIVERSIT ´E DE ROUEN
UFR DES SCIENCES MATH ´EMATIQUES ANALYSE ET MOD `ELES STOCHASTIQUES URA CNRS 1378
76821 MONT SAINT AIGNAN CEDEX, FRANCE