1. Analysis of variance tests (ANOVA) 2. Goodness-of-fit tests

(1)

Mathematical Statistics

Anna Janicka

Lecture XIII, 20.05.2019

A^NOVA

NÔN-PÂRAMETRIC TÊSTS

(2)

Plan for Today

1. Analysis of variance tests (ANOVA) 2. Goodness-of-fit tests

Kolmogorov test

Kolmogorov-Smirnov (two samples) Kolmogorov-Lilliefors

chi-squared goodness-of-fit

3. Tests of independence

chi-squared test

(3)

Tests for more than two populations

A naive approach:

pairwise tests for all pairs But:

in this case, the type I error is higher than

the significance level assumed for each

simple test...

(4)

More populations

Assume we have k samples:

, and

all X

_i,j

are independent (i=1,...,k, j=1,.., n

_i

) X

_i,j

~N(m

_i

, σ

²

)

we do not know m

₁

, m

₂

, ..., m

_k

, nor σ

²

let n=n

₁

+n

₂

+...+n

_k

nk

k k

k

n n

X X

X

X X

X

X X

X

, 2

, 1

,

, 2 2

, 2 1

, 2

, 1 2

, 1 1

, 1

,..., ,

...

, ,...,

,

, ,...,

,

2 1

(5)

Test of the Analysis of Variance (ANOVA) for significance level α

H

₀

: µ

₁

= µ

₂

=... = µ

_k

H

₁

: ¬ H

₀

(i.e. not all µ

_i

are equal) A LR test; we get a test statistic:

with critical region

for k=2 the ANOVA is equivalent to the two-sample t-test.

) ,

1 (

~ ) /(

) (

) 1 /(

) (

1 1

2 ,

1

2

k n

k F k

n X

X

k X

X F

_k

n

i

n

j i j i

k

i i i

i

− −

−

= −

∑ ∑

∑

= =

=

∑

∑ ∑

∑

= = = = = =

= ^k

i i i

k i

n

j i j

n

j i j

i

i n X

X n X n

n X

X ⁱ ⁱ

1

1 1 ,

1 ,

1 , 1

1

)}

, 1 (

) ( :

{

* x F x F

₁

k n k

K = >

₋_α

− −

(6)

ANOVA – interpretation

we have

– between group variance estimator

– within group variance estimator

∑ ∑

= = −

−

k i

n

j i j i

i X X

k

n ¹ ¹

2

, )

1 (

Sum of Squares (SS)

Sum of Squares Between (SSB)

Sum of Squares Within (SSW)

∑

= −

−

k

i ni Xi X

k ¹

)2

1 ( 1

∑ ∑ ∑

∑ ∑

= = − = ^k= − + = = −

i

k i

n

j i j i

i i

k i

n

j i j

i

i X X n X X X X

1 1 1

2 ,

2

1 1

2

, ) ( ) ( )

(

(7)

ANOVA test – table

source of

variability sum of squares degrees of freedom

value of the test statistic F between

groups SSB k-1 –

within groups SSW n-k –

total SS n-1 F

(8)

ANOVA test – example

Yearly chocolate consumption in three cities: A, B, C based on random samples of n

_A

= 8, n

_B

= 10, n

_C

= 9 consumers. Does consumption depend on the city?

α=0.01

→ reject H

₀

(equality of means), consumption depends on city

A B C

sample mean 11 10 7

sample variance 3.5 2.8 3

61 . 5 )

24 . 2 (

and

31 . 24 12

/ 7 . 73

2 / 63 . 75

7 . 73 8

3 9 8 . 2 7 5 . 3

63 . 75 9

) 3 . 9 7 ( 10 )

3 . 9 10 ( 8 ) 3 . 9 11 (

3 . 9 )

9 7 10 10

8 11 (

99 . 0

2 2

2 27

1

≈

=

⋅ +

⋅

=

⋅

− +

⋅

− +

⋅

−

=

⋅ +

⋅

=

F F

SSW SSB

X

(9)

ANOVA test – table – example

source of

variability sum of squares degrees of freedom

value of the test statistic F between

groups 75.63 2 –

within groups 73.7 24 –

total 149.33 26 12.31

(10)

Non-parametric tests

we check whether a random variable fits a given distribution (goodness-of-fit tests).

we check whether random variables have the same distribution

we check whether variables/characteristics

are independet (test of independence)

(11)

Kolmogorov goodness-of-fit test

Model: X

₁

, X

₂

, ..., X

_n

are an IID sample from distribution with CDF F.

H

₀

: F = F

₀ ^(F₀ ^specified)

H

₁

: ¬ H

₀ (i.e. the CDF is different)

If F

₀

is continuous, we use the statistic

where

and F

_n

(t) – n-th empirical CDF

} ,

max{

| ) ( )

(

|

sup

_∈

−

₀

=

⁺ ⁻

=

_t _R _n _n _n

n

F t F t D D

D

n x i

F D

x n F

D

_n _i _n

i

_i _n _n _i _n _i _n

1 ) (

max

, ) (

max

₁_,..., ₀ _: ₁_,..., ₀ _:

−

=

−

=

₌ ⁻ ₌

+

(12)

Kolmogorov goodness-of-fit test – cont.

The test: we reject H

₀

when:

D

_n

> c( α , n)

for a critical value c( α , n).

Theorem. If H

₀

is true, the distribution of D

_n

does not depend on F

₀

.

Problem: This distribution needs tables, for each different n.

Theorem. In the limit

the approximation may be used for n ≥ 100

∑

^+∞=−∞

−

∞

→

 = −

 →



≤

k

d k k

n

d

n

K d e

D n

P ( ) ( ) ( 1 )

² ² ²

(13)

Kolmogorov goodness-of-fit test – cont. (2)

Tables of the asymptotic distribution K(d)

1-

α

0.8 0.9 0.95 0.99

quantile of

K(d) 1,07 1,22 1,36 1,63

c(n,

α

)

for n≥100 1,07/ n 1,22/ n 1,36/ n 1,63 / n

(14)

Kolmogorov goodness-of-fit test – example

Does the sample

0.4085 0.5267 0.3751 0.8329 0.0846 0.8306 0.6264 0.3086 0.3662 0.7952 come from a uniform distribution U(0,1)?

Source: W. Niemiro

(15)

Kolmogorov goodness-of-fit test – example cont.

D

_n

= 0.2086 c(10; 0.9) = 0.369

→ no grounds to reject the null hypothesis that the distribution is uniform

X_i:10 (i-1)/10 i/10 i/10 - F(X_i:10) F(X_i:10)-i/10

0.0846 0 0.1 0.0154 0.0846

0.3086 0.1 0.2 -0.1086 0.2086

0.3662 0.2 0.3 -0.0662 0.1662

0.3751 0.3 0.4 0.0249 0.0751

0.4085 0.4 0.5 0.0915 0.0085

0.5267 0.5 0.6 0.0733 0.0267

0.6264 0.6 0.7 0.0736 0.0264

0.7952 0.7 0.8 0.0048 0.0952

0.8306 0.8 0.9 0.0694 0.0306

0.8329 0.9 1 0.1671 -0.0671

(16)

Kolmogorov-Smirnov test of equality of distributions

Model: X

₁

, X

₂

, ..., X

_n

are an IID sample from a distribution with CDF F, Y

₁

, Y

₂

, ..., Y

_m

are an IID sample from a distribution with CDF G.

H

₀

: F = G

H

₁

: ¬ H

₀ (i.e. the CDF functions/distributions differ)

If F (and G) is continuous, we test with

where F

_n

(t) – n-th empirical CDF for the first sample, and G

_m

(t) – m-th empirical CDF for the second sample

| ) ( )

(

|

,

sup F t G t

D

_n _m

=

_t_∈_R _n

−

_m

(17)

Kolmogorov-Smirnov test of equality of distributions – cont.

The test: we reject H

₀

if:

D

_n,m

> c( α , n, m) for a critical value c( α , n, m).

Theorem. If H

₀

is true, the distribution of D

_n,m

does not depend on F (or G).

Theorem. In the limit

the approximation is OK for n,m ≥ 100

∑

^+∞=−∞

−

∞

→

∞

+

≤  

→

   → = −

k

d k k

m m n

m n n

nm

D d K d e

P (

_,

)

_,

( ) ( 1 )

² ² ²

(18)

Kolmogorov-Lilliefors goodness-of-fit test Model: X

₁

, X

₂

, ..., X

_n

are an IID sample from a distribution with CDF F.

H

₀

: F is a CDF of a normal distribution

(with unknown parameters)

H

₁

: ¬ H

₀ (i.e. the distribution is not normal)

We test with where

and

} ,

max{

⁺ ⁻

=

_n _n

n

D D

D

n z i

D n z

D

_n _i _n

i

_i _n _i _n _i

1 max

,

max

₁_,..., ₁_,...,

−

=

−

=

₌ ⁻ ₌

+

 

 



 −

Φ

= S

X z

_i

X

ⁱ^:ⁿ

2 1 1

1

∑

= , = −

∑

= ( − )

= ⁿ

i i

n n

i i

n X S X X

X

(19)

Kolmogorov-Lilliefors goodness-of-fit test – cont.

The test: we reject H

₀

if:

D

_n

> D

_n

( α )

for a critical value D

_n

( α ).

Theorem. If H

₀

is true, the distribution of D

_n

does not depend on the parameters of the normal distribution.

Problem: we need tables and do not know the analytical form of this distribution...

Used for small samples (n ≤30), when it performs

better than the chi-squared test

(20)

Kolmogorov-Lilliefors goodness-of-fit test – critical values

Source: H. Lilliefors

(21)

Chi-squared goodness-of-fit test

Model: X

₁

, X

₂

, ..., X

_n

are an IID sample from a discrete distribution with k values (1, ..., k).

H

₀

: the distribution probabilities are equal to

H

₁

: ¬ H

₀ (i.e. the distribution is different)

If the results of the experiment are

where N

_i

denotes the number of outcomes equal to i:

i 1 2 3 ... k

P(X=i) p₁ p₂ p₃ ... p_k

i 1 2 3 ... k

N_i N₁ N₂ N₃ ... N_k

∑

= =

= ⁿ

j X i

i _j

N 11

value labels

(22)

Chi-squared goodness-of-fit test – cont.

General form of the test:

here:

Theorem. If H

₀

is true, the distribution of the χ

²

statistic converges to a chi-squared distr with k-1 degrees of freedom χ

²

(k-1) for n→∞

Procedure: we reject H

₀

if χ

²

> c, where c= χ

²_1-_α

(k-1) is a quantile of rank 1- α from a chi- squared distr with k-1 degrees of freedom

∑

= expected value

value) expected

- value observed

( ²

χ2

∑

⁼

=

i i k i

i np

np

N ²

1

2 ( - )

χ

(23)

Chi-squared goodness-of-fit test – example

Is a die symmetric? For a significance level α ^=0.05

n=150 tosses. Results:

H

₀

: (N

₁

, N

₂

, N

₃

, N

₄

, N

₅

, N

₆

)

~Mult(150, 1/6, 1/6, 1/6, 1/6, 1/6, 1/6) H

₁

: ¬ H

₀

i 1 2 3 4 5 6

N_i 15 27 36 17 26 29

24 . 25 12

) 25 29

( 25

) 25 26

( 25

) 25 17

( 25

) 25 36

( 25

) 25 27

( 25

) 25 15

( ² ² ² ² ² ²

2 − =

− +

− + χ =

7 . 11 )

5

2

(

05 . 0

1₋

≈

χ

→ we reject H0.

(24)

Chi-squared goodness-of-fit test – distribution with an unknown parameter.

Model: X

₁

, X

₂

, ..., X

_n

are an IID sample from a discrete distribution with k values (1, ..., k).

H

₀

: distribution probabilities are equal to

where θ is an unknown parameter of dimension d.

H

₁

: ¬ H

i 1 2 3 ... k

P(X=i) p₁(

θ

) p₂(

θ

) p₃(

θ

) ... p_k(

θ

)

(25)

Chi-squared goodness-of-fit test – distribution with an unknown parameter, cont.

Test statistics are constructed like in the

previous case, with the expected values calculated using ML estimators of the

parameter θ . Only the number of degrees of freedom changes:

Theorem. If H

₀

is true, the distribution of the χ

²

statistic converges to a chi-squared

distribution with k-d-1 degrees of freedom

χ

²

(k-d-1) for n→∞

(26)

Chi-squared goodness-of-fit test – version for continuous distributions

Kolmogorov tests are better, but the chi- squared test may also be used

Model: X

₁

, X

₂

, ..., X

_n

are an IID sample from a continuous distribution.

H

₀

: The distribution is given by F

H

₁

: ¬ H

It suffices to divide the range of values of the random variable into classes and count the

observations. The expected values are known

(result from F).Then: the chi-squared test.

(27)

Chi-squared goodness-of-fit test – practical notes

The test should be used for large samples The expected counts can’t be too small (<5). If they are smaller, observations should be grouped.

The classes in the „continuous” version

may be chosen arbitrarily, but it is best if

the theoretical probabilities are balanced.

(28)

Chi-squared test of independence

Model: (X

₁

,Y

₁

), ..., (X

_n

,Y

_n

) are an IID sample from a two-dimensional distribution with rs values*

(denoted by the set {1, ..., r} × {1, ..., s}).

Let the theoretical distribution be

Denote

We want to veryfy independence of X and Y:

H

₀

:

H

₁

: ¬ H

₀

s j

r i

j Y

i X

P

p

_ij

= ( = , = ) = 1 ,..., = 1 ,...,

∑

= • =

•

= =

^r

i ij

j s

j ij

i

p p p

p

1

,

1

r j

s i

p p

p

_ij

=

_i_•

∗

_•_j

= 1 ,..., , = 1 ,...,

(29)

Chi-squared test of independence – cont.

The empirical distribution may be summarized by a table (so-called contingency table, or

crosstab)

i \ j 1 2 ... s N_i•

1 N₁₁ N₁₂ N_1s N_1•

2 N₂₁ N₂₂ N_2s N_2•

...

r N_r1 N_r2 N_rs N_r•

N_•j N_•1 N_•2 N_•s n

(30)

Chi-squared test of independence – cont. (2)

This is a special case of a goodness-of-fit test with (r-1) + (s-1) parameters to be

estimated:

The test statistic:

has a chi-squared distribution with (r-1)(s-1) degrees of freedom (if H

₀

is true)

∑ ∑

= =

•

−

•

=

^r

i

s j

j i

ij

n N

N

n N

N N

1 1

2 2

/

)

/

χ (

(31)

Chi-squared test of independence – example

We verify independence of political and

musical preferences, for signif. level α =0.05

Support X Do not support X Total

Listen to jazz 25 10 35

Listen to rock 20 20 40

Listen to hip-hop 15 10 25

Total 60 40 100

57 . 100 3

/ 25

* 40

) 100 / 25

* 40 10

( 100

/ 40

* 40

) 100 / 40

* 40 20

( 100

/ 35

* 40

) 100 / 35

* 40 10

(

100 / 25

* 60

) 100 / 25

* 60 15

( 100

/ 40

* 60

) 100 / 40

* 60 20

( 100

/ 35

* 60

) 100 / 35

* 60 25

(

2 2

2

2 2

− ≈

− +

− + +

+ − + −

= − χ

99 . 5 )

2 ( ))

1 3

)(

1 2

((

₀²_.₉₅

2 05 . 0

1₋

− − = χ ≈

χ

→ no grounds to reject H0.

(32)