Anna Janicka Statistics Mathematical

(1)

Mathematical Statistics

Anna Janicka

Lecture VII, 6.04.2020

ESTIMATOR PROPERTIES, PART III

(2)

Plan for Today

1. Asymptotic properties of estimators – cont.

 consistency

 asymptotic normality

 asymptotic efficiency

2. Consistency, asymptotic normality and asymptotic efficiency of MLE estimators

(3)

Consistency – reminder

Let X₁, X₂, ..., X_n,... be an IID sample (of

independent random variables from the same distribution) . Let be a

sequence of estimators of the value g( ).

is a consistent estimator, if for all , for any  >0:

(i.e. converges to g( ) in probability) )

,..., ,

ˆ(X₁ X₂ X_n g

gˆ

1 )

| ) ( )

,..., ,

ˆ( (|

lim ₁ ₂   



 P_ g X X X_n g  

n

gˆ

(4)

Strong consistency – reminder

Let X₁, X₂, ..., X_n,... be an IID sample (of

independent random variables from the same distribution). Let be a

sequence of estimators of the value g( ).

is strong consistent, if for any :

(i.e. converges to g( ) almost surely) )

,..., ,

ˆ(X₁ X₂ X_n g

gˆ ^P^



_n^lim__ ^g^ˆ⁽^X¹^, ^X²^,..., ^Xⁿ ⁾ ^ ^g⁽^ ⁾



^ ¹

gˆ

(5)

Consistency – how to verify?

 From the definition: for example with the use of a version of the Chebyshev inequality:

Given that the MSE of an estimator is

we get a sufficient condition for consistency:

 From the LLN

2

))2

( )

( ) (

| ) ( )

(

(| 

 

 ^E ^g ^X ^g

g X

g

P 





 



))2

( )

ˆ( ( ˆ)

,

( ^g ^E_ ^g ^X ^g 

MSE  

0 ˆ)

, (

lim 



 MSE g

n 

(6)

Consistency – examples

 For any family of distributions with an

expected value: the sample mean is a consistent estimator of the expected value

⁽^)=E_ ^(X₁). Convergence from the SLLN.

 For distributions having a variance:

and

are consistent estimators of the variance

²⁽^)=Var_ ^(X₁). Convergence from the SLLN.

X

n





 

 ⁿ

i i

n n X X

S 1

2 1

)

( 



ⁿ 

i i

n n X X

S 1

1 2

2 ( )

ˆ

(7)

Consistency – examples/properties

 An estimator may be unbiased but

unconsistent; eg. T_n(X₁, X₂, ..., X_n)=X₁ as an estimator of ⁽^)=E_ ^(X₁^).

 An estimator may be biased but

consistent; eg. the biased estimator of the variance or any unbiased consistent estimator + 1/n.

(8)

Asymptotic normality

is an asymptotically normal estimator of g( ), if for any  there exists

²( ) such that, when n→

Convergence in distribution, i.e. for any a

in other words, the distribution of is for large n similar to

) ,...,

,

ˆ(X₁ X₂ X_n g

 ^g ^ˆ ⁽ ^X

₁

^, ^X

₂

^,..., ^X ⁾ ^g ⁽ ^ ⁾  ^N ⁽ ⁰ ^, ^

²

⁽ ^ ⁾⁾

n

_n

  

^D



^ˆ⁽ ^, ^,..., ⁾ ⁽ ⁾



⁽ ⁾

)

lim (n g X₁ X₂ X g a a

P _n

n   









  



 



 

) ,...,

,

ˆ(X₁ X₂ X_n g

) ),

(

( g

_n²

N 

^

(9)

Asymptotic normality – properties

 An asymptotically normal estimator is consistent (not necessarily strongly).

 A similar condition to unbiasedness – the expected value of the asymptotic

distribution equals g( ) (but the estimator does not need to be unbiased).

 Asymptotic variance defined as

or – the variance of the asymptotic n distribution

)

2(



)

2(



(10)

Asymptotic normality – what it is not

 For an asymptotically normal estimator we usually have:

but these properties needn’t hold, because convergence in distribution does not imply convergence of moments.

) ( )

,..., ,

ˆ( ₁ ₂



g X X X g

E _n  ⁿ^^

) ( )

,..., ,

ˆ(

var g X₁ X₂ X_n  ⁿ^^



²



n

(11)

Asymptotic normality – example

 Let X₁, X₂, ..., X_n,... be an IID sample from a distribution with mean  and variance ²^. On the base of the CLT, for the sample

mean we have

In this case the asymptotic variance, , is equal to the estimator variance.

) ,

0 ( )

(X



N



²

n  ^D

n

 2

(12)

Asymptotic normality – how to prove it

In many cases, the following is useful:

Delta Method. Let T_n be a sequence of

random variables such that for n→ we have

and let h:R→R be a function differentiable at point  such that h’()0. Then

, ² are functions of 

usually used when estimators are functions of statistics T_n, which can be easily shown co converge on the base of CLT

) ,

0 ( )

(T



N



²

n _n  ^D



^h⁽^T ⁾ ^h⁽

^

⁾



^N⁽⁰^,

^

²⁽^h^'⁽

^

⁾⁾² ⁾

n _n  ^D

(13)

Asymptotic normality – examples cont.

In an exponential model:

From CLT, we get

so from the Delta Method for h(t)=1/t:

so is an asymptotically normal (and consistent) estimator of ^.

MLE (  ) 

X¹

) ,

0 ( )

(X _¹ N _¹₂ n  ^D

) ) (

, 0 ( )

( ²

) / 1 (

1 1

1

2

2 



^^  ^ ^

 N

n ^D

X

X 1

(14)

Asymptotic efficiency

For an asymptotically normal estimator

of g( ) we define asymptotic efficiency as

where ²⁽ )/n is the asymptotic variance, i.e.

for n→



^g^ˆ⁽^X₁^, ^X₂^,..., ^X ⁾ ^g⁽

^

⁾



^N⁽⁰^,

^

²⁽

^

⁾⁾

n _n  ^D

) ,...,

,

ˆ(X₁ X₂ X_n g

 

), ( )

(

) ( ) '

( ˆ

as.ef ₂

2







In

n g g

 

 

) ( ) (

) ( ) '

( ˆ as.ef

1 2

2





 I g g

 

modification of the definition of efficiency to the limit case, with the asymptotic

variance in place of the normal variance

(15)

Relative asymptotic efficiency

Relative asymptotic efficiency for asymptotically normal estimators and

ˆ ) ( as.ef

ˆ ) ( as.ef )

( ) ) (

, ˆ ( ˆ

as.ef

2 1 2

1 2 2 2

1 g

g g

g  









) ˆ₁(X

g gˆ₂(X )

Note. A less (asymptotically) efficient estimator may have other properties, which will make it preferable to a more efficient one.

(16)

Relative asymptotic efficiency – examples.

Is the mean better than the median?

Depends on the distribution!

a) normal model N(, ²):

b) Laplace model Lapl(, )

c) some distributions do not have a mean...

Theorem: For a sample from a continuous distribution with density f(x), the sample median is an asymptotically normal estimator for the median m (provided the density is continuous and 0 at point m):



^X ^



^N⁽⁰^,^ ²⁾

n  ^D

^m^eˆ^d ^ ^N⁽⁰^,^₂² ⁾

n  ^D

1 )

, d eˆ m (

as.ef X  _² 



^X ^



^N⁽⁰^,_²² ⁾

n  ^D

^m^eˆ^d ^ ^N⁽⁰^,_¹² ⁾

n  ^D as.ef(meˆd, X)  2  1

^m^eˆ^d ^m ^D ^N⁽⁰^,₄₍_f₍¹_m₎₎² ⁾

n  

(17)

Consistency of ML estimators

Let X₁, X₂, ..., X_n,... be a sample from a distribution with density f_ (x). If   R is an open set, and:

 all densities f_have the same support;

 the equation has exactly one solution, .

Then is the MLE( ) and it is consistent

Note. MLE estimators do not have to be unbiased!

0 ) ( ln  

 ^L d

d

^ˆ

(18)

Asymptotic normality of ML estimators

Let X₁, X₂, ..., X_n,... be a sample with density f_ (x), such that   R is open, and is a consistent

m.l.e. (for example, fulfills the assumptions of the previous theorem), and

 exists

 Fisher Information may be calculated, 0<I₁( )<

 the order of integration with respect to x and derivation with respect to  may be changed

then is asymptotically normal and

^ˆ

) (

2 ln

2 

 ^L

d d

^ˆ

  ^

^ˆ

^

^D ^N⁽⁰^, _I₁₍¹_ ₎⁾

n  

(19)

Asymptotic normality of ML estimators

Additionally, if g:R→R is a function

differentiable at point , such that g’( )  0, and is MLE(g(^{)), then}

 ^ˆ ⁽

₁

^,

₂

^,..., ⁾ ⁽ ⁾  ⁽ ⁰ ^,

⁽ ^'⁽₍ ⁾⁾₎

⁾

1

2



^D ^g_I 

n

g N

X X

X g

n   

) ,...,

,

ˆ(X₁ X₂ X_n g

(20)

Asymptotic efficiency of ML estimators

If the assumptions of the previous theorems are fulfilled, then the ML estimator (of ^or g( )) is asymptotically efficient.

(21)

Asymptotic normality and efficiency of ML estimators – examples

 In the normal model: the mean is an asymptotically efficient estimator of 

 In the Laplace model: the median is an asymptotically efficient estimator of 

(22)

Summary: basic (point) estimator properties

 bias

 variance

 MSE

 efficiency

 consistency

 asymptotic normality

 asymptotic efficiency

(23)

Anna Janicka Statistics Mathematical

Mathematical Statistics

Anna Janicka





X





 g ˆ ( X

, X

,..., X ) g (  )  N ( 0 , 

(  ))

n

  





) ),

(

( g

N 

























MLE (  ) 













 

























  



 ˆ (

,

,..., ) ( )  ( 0 ,

)



g N

X X

X g

n   

 ^g ^ˆ ⁽ ^X

^, ^X

^,..., ^X ⁾ ^g ⁽ ^ ⁾  ^N ⁽ ⁰ ^, ^

⁽ ^ ⁾⁾

^

^

^

^

^

^

  ^

^

 ^ˆ ⁽

^,

^,..., ⁾ ⁽ ⁾  ⁽ ⁰ ^,

⁾