Mathematical Statistics Anna Janicka

(1)

Mathematical Statistics

Anna Janicka

Lecture XII, 18.05.2020

HYPOTHESIS TESTING IV:

PARAMETRIC TESTS: COMPARING TWO OR MORE

POPULATIONS

(2)

Plan for today

1. Parametric LR tests for one population – cont.

2. Asymptotic properties of the LR test

3. Parametric LR tests for two populations 4. Comparing more than two populations

 ANOVA

(3)

Notation

x

_something

always means a quantile of rank

something

(4)

Model IV: comparing the fraction – reminder Asymptotic model: X

₁

, X

₂

, ..., X

_n

are an IID sample from a two-point distribution, n – large.

H

₀

: p = p

₀

Test statistic:

has an approximate distribution N(0,1) for large n H

₀

: p = p

₀

against H

₁

: p > p

₀

critical region

H

₀

: p = p

₀

against H

₁

: p < p

₀

critical region

H

₀

: p = p

₀

against H

₁

: p ≠ p

₀

critical region

𝑃𝑃_𝑝𝑝(𝑋𝑋 = 1) = 𝑝𝑝 = 1 − 𝑃𝑃_𝑝𝑝(𝑋𝑋 = 0) 𝑈𝑈^∗ = ̄𝑋𝑋 − 𝑝𝑝₀

𝑝𝑝₀(1 − 𝑝𝑝₀) 𝑛𝑛 = ̂𝑝𝑝 − 𝑝𝑝₀

𝑝𝑝₀(1 − 𝑝𝑝₀) 𝑛𝑛

𝐶𝐶^∗ = {𝑥𝑥 : 𝑈𝑈^∗ (𝑥𝑥) > 𝑢𝑢_1−𝛼𝛼}

𝐶𝐶^∗ = {𝑥𝑥 : 𝑈𝑈^∗ (𝑥𝑥) < 𝑢𝑢_𝛼𝛼 = −𝑢𝑢_1−𝛼𝛼} 𝐶𝐶^∗ = {𝑥𝑥 : | 𝑈𝑈^∗(𝑥𝑥)| > 𝑢𝑢_{1−𝛼𝛼/2}}

(5)

Model IV: example

We toss a coin 400 times. We get 180 heads. Is the coin symmetric?

H

₀

: p = ½

for α = 0.05 and H₁: p ≠ ½ we have u_0.975 =1.96 → we reject H₀ for α = 0.05 and H₁: p < ½ we have u_0.05 = -u_0.95=-1.64

→ we reject H₀

for α = 0.01 and H₁: p ≠ ½ we have u_0.995 =2.58

→ we do not reject H₀

for α = 0.01 and H₁: p < ½ we have u_0.01 = -u_0.99=-2.33

→ we do not reject H₀

p-value for H₁: p ≠ ½: 0.044 p-value for H₁: p < ½: 0.022 𝑈𝑈^∗ = (180/400 − 1/2)

1/2(1 − 1/2) 400 = −2

(6)

Likelihood ratio test for composite hypotheses – reminder

X ~ P

_θ

, {P

_θ

: θ ∈ Θ} – family of distributions We are testing H

₀

: θ ∈ Θ

₀

against H

₁

: θ ∈ Θ

₁

such that Θ

₀

∩ Θ

₁

= ∅, Θ

₀

∪ Θ

₁

= Θ Let

H

₀

: X ~ f

₀

( θ

₀

,⋅) for some θ

₀

∈ Θ

_0.

H

₁

: X ~ f

₁

( θ

₁

, ⋅) for some θ

₁

∈ Θ

₁

,

where f

₀

and f

₁

are densities (for θ ∈ Θ

₀

and θ

∈ Θ

₁

, respectively)

(7)

Likelihood ratio test for composite hypotheses – reminder (cont.)

Test statistic:

or

where are the ML estimators for the model without restrictions and for the null model.

We reject H

₀

if for a constant .

̃𝜆𝜆 = sup

^{𝜃𝜃∈Θ}

𝑓𝑓 (𝜃𝜃, 𝑋𝑋) sup

_𝜃𝜃₀_∈Θ₀

𝑓𝑓

₀

(𝜃𝜃

₀

, 𝑋𝑋)

̃𝜆𝜆 = 𝑓𝑓( �𝜃𝜃, 𝑋𝑋) 𝑓𝑓

₀

( �𝜃𝜃

₀

, 𝑋𝑋)

�𝜃𝜃, �𝜃𝜃

₀

̃𝜆𝜆 > ̃𝑐𝑐 ̃𝑐𝑐

more convenient if the null is simple or if models are nested

(8)

Asymptotic properties of the LR test We consider two nested models, we test H

₀

: h( θ ) = 0 against H

₁

: h( θ ) ≠ 0

Under the assumption that

 h is a nice function

 Θ is a d-dimensional set

 Θ

₀

= { θ : h( θ ) = 0} is a d – p dimensional set

Theorem: If H

₀

is true, then for n →∞ the distribution of the statistic converges to a chi-squared

distribution with p degrees of freedom

2ln ̃𝜆𝜆

degrees of freedom = number of restrictions

(9)

Asymptotic properties of the LR test – example Exponential model: X

₁

, X

₂

, ..., X

_n

are an IID sample from Exp( θ ).

We test H

₀

: θ = 1 against H

₁

: θ ≠ 1

then:

from Theorem:

for a sign. level α =0.05 we have so we reject H

₀

in favor of H

₁

if

𝑀𝑀𝑀𝑀𝑀𝑀(𝜃𝜃) = �𝜃𝜃 = 1/ ̄𝑋𝑋

̃𝜆𝜆 = Π𝑓𝑓^�𝜃𝜃(𝑥𝑥_𝑖𝑖) Π𝑓𝑓₁(𝑥𝑥_𝑖𝑖) =

̄𝑋𝑋1^𝑛𝑛 exp( − 1̄𝑋𝑋Σ𝑥𝑥^𝑖𝑖)

exp( − Σ𝑥𝑥_𝑖𝑖) = 1

̄𝑋𝑋^𝑛𝑛 exp 𝑛𝑛( ̄𝑋𝑋 − 1)

2ln ̃𝜆𝜆 = 2𝑛𝑛(( ̄𝑋𝑋 − 1) − ̄ln 𝑋𝑋) ^𝐷𝐷 𝜒𝜒²(1)

̃𝜆𝜆 > ̃𝑐𝑐 ⇔ 2ln ̃𝜆𝜆 > 2ln ̃𝑐𝑐

𝜒𝜒_0.95² (1) ≈ 3.84 ≈ 2ln ̃𝑐𝑐

̃𝜆𝜆 > 𝑒𝑒^3.84/2

(10)

Comparing two or more populations

We want to know if populations studied are

“the same” in certain aspects:

 parametric tests: we check the equality of certain distribution parameters

 nonparametric tests: we check whether

distributions are the same

(11)

Model I: comparison of means, variance known, significance level α

X

₁

, X

₂

, ..., X

_nX

are an IID sample from distr N( µ

_X

, σ

_X²

), Y

₁

, Y

₂

, ..., Y

_nY

are an IID sample from distr N( µ

_Y

, σ

_Y²

),

σ

_X²

, σ

_Y²

are known, samples are independent H

₀

: µ

_x

= µ

_Y

Test statistic:

H

₀

: µ

_x

= µ

_Y

against H

₁

: µ

_x

> µ

_Y

critical region

H

₀

: µ

_x

= µ

_Y

against H

₁

: µ

_x

≠ µ

_Y

critical region

𝑈𝑈 = ̄𝑋𝑋 − ̄𝑌𝑌 𝜎𝜎

_𝑋𝑋²

�

𝑛𝑛

_𝑋𝑋

+ � 𝜎𝜎

_𝑌𝑌²

𝑛𝑛

_𝑌𝑌

~ 𝑁𝑁 (0,1)

𝐶𝐶^∗ = {𝑥𝑥 : 𝑈𝑈 (𝑥𝑥) > 𝑢𝑢_1−𝛼𝛼} 𝐶𝐶^∗ = {𝑥𝑥 : | 𝑈𝑈(𝑥𝑥)| > 𝑢𝑢_{1−𝛼𝛼/2}}

assuming H₀is true

(12)

Model I – comparison of means. Example

X

₁

, X

₂

, ..., X

₁₀

are an IID sample from distr N( µ

_X

,11

²

), Y

₁

, Y

₂

, ..., Y

₁₀

are an IID sample from distr N( µ

_Y

,13

²

) Based on the sample:

Are the means equal, for significance level 0.05?

H

₀

: µ

_x

= µ

_Y

against H

₁

: µ

_x

≠ µ

_Y

we have: u

_0.975

≈ 1.96.

|0.557| < 1.96 → no grounds to reject H

₀

̄𝑋𝑋 = 501, ̄𝑌𝑌 = 498

𝑈𝑈 = 501 − 498 13²

10 + 11² 10

≈ 0.557

(13)

Model II: comparison of means, variance

unknown but assumed equal, significance level α

X

₁

, X

₂

, ..., X

_nX

are an IID sample from distr N( µ

_X

, σ

²

), Y

₁

, Y

₂

, ..., Y

_nY

are an IID sample from distr N( µ

_Y

, σ

²

) with σ

²

unknown, samples are independent

H

₀

: µ

_x

= µ

_Y

Test statistic:

H

₀

: µ

_x

= µ

_Y

against H

₁

: µ

_x

> µ

_Y

critical region

H

₀

: µ

_x

= µ

_Y

against H

₁

: µ

_x

≠ µ

_Y

critical region

𝐾𝐾 ∗= {𝑥𝑥 : 𝑇𝑇 (𝑥𝑥) > 𝑡𝑡_1−𝛼𝛼(𝑛𝑛_𝑥𝑥 + 𝑛𝑛_𝑦𝑦 − 2)}

𝐶𝐶^∗ = {𝑥𝑥 : | 𝑇𝑇(𝑥𝑥)| > 𝑡𝑡_{1−𝛼𝛼/2}(𝑛𝑛_𝑥𝑥 + 𝑛𝑛_𝑦𝑦 − 2)}

Assuming H₀is true

𝑆𝑆_𝑋𝑋² = 1

𝑛𝑛_𝑋𝑋−1�

𝑖𝑖=1 𝑛𝑛_𝑋𝑋

(𝑋𝑋_𝑖𝑖 − ̄𝑋𝑋)², 𝑆𝑆_𝑌𝑌² = 1

𝑛𝑛_𝑌𝑌−1�

𝑖𝑖=1 𝑛𝑛_𝑌𝑌

(𝑌𝑌_𝑖𝑖 − ̄𝑌𝑌)²

𝑇𝑇 = ̄𝑋𝑋 − ̄𝑌𝑌

(𝑛𝑛_𝑥𝑥 − 1)𝑆𝑆_𝑋𝑋² + (𝑛𝑛_𝑌𝑌 − 1)𝑆𝑆_𝑌𝑌²

𝑛𝑛_𝑋𝑋𝑛𝑛_𝑌𝑌

𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 (𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 − 2) ~ 𝑡𝑡 (𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 − 2)

(14)

Model II: comparison of means, variance unknown but assumed equal, cont.

can be rewritten as

where

is an estimator of the variance σ

²

based on the two samples

𝑇𝑇 = ̄𝑋𝑋 − ̄𝑌𝑌

(𝑛𝑛_𝑥𝑥 − 1)𝑆𝑆_𝑋𝑋² + (𝑛𝑛_𝑌𝑌 − 1)𝑆𝑆_𝑌𝑌²

𝑛𝑛_𝑋𝑋𝑛𝑛_𝑌𝑌

𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 (𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 − 2) ~ 𝑡𝑡 (𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 − 2)

𝑇𝑇 = ̄𝑋𝑋 − ̄𝑌𝑌 𝑆𝑆_∗ 1

𝑛𝑛_𝑋𝑋 + 1𝑛𝑛_𝑌𝑌

~ 𝑡𝑡 (𝑛𝑛_𝑋𝑋 + 𝑛𝑛_𝑌𝑌 − 2)

𝑆𝑆_∗² = (𝑛𝑛_𝑥𝑥 − 1)𝑆𝑆_𝑋𝑋² + (𝑛𝑛_𝑌𝑌 − 1)𝑆𝑆_𝑌𝑌² 𝑛𝑛_𝑥𝑥 + 𝑛𝑛_𝑦𝑦 − 2

(15)

Model II: comparison of variances, significance level α

X

₁

, X

₂

, ..., X

_nX

are an IID sample from distr N( µ

_X

, σ

_X²

), Y

₁

, Y

₂

, ..., Y

_nY

are an IID sample from distr N( µ

_Y

, σ

_Y²

),

σ

_X²

, σ

_Y²

are unknown, samples are independent H

₀

: σ

_X

= σ

_Y

Test statistic:

H

₀

: σ

_X

= σ

_Y

against H

₁

: σ

_X

> σ

_Y

critical region

H

₀

: σ

_X

= σ

_Y

against H

₁

: σ

_X

≠ σ

_Y

critical region

𝐹𝐹 = 𝑆𝑆_𝑋𝑋²

𝑆𝑆_𝑌𝑌² ~ 𝐹𝐹 (𝑛𝑛_𝑋𝑋 − 1, 𝑛𝑛_𝑌𝑌 − 1)

𝐶𝐶^∗ = {𝑥𝑥 : 𝐹𝐹 (𝑥𝑥) > 𝐹𝐹_1−𝛼𝛼(𝑛𝑛_𝑋𝑋 − 1, 𝑛𝑛_𝑌𝑌 − 1)}

𝐶𝐶^∗ = {𝑥𝑥 : 𝐹𝐹 (𝑥𝑥) < 𝐹𝐹_𝛼𝛼/2(𝑛𝑛_𝑋𝑋 − 1, 𝑛𝑛_𝑌𝑌 − 1)

∨ 𝐹𝐹(𝑥𝑥) > 𝐹𝐹_{1−𝛼𝛼/2}(𝑛𝑛_𝑋𝑋 − 1, 𝑛𝑛_𝑌𝑌 − 1)}

assuming H₀ is true

𝑆𝑆𝑋𝑋2= 1 𝑛𝑛_𝑋𝑋−1�

(𝑋𝑋𝑖𝑖− ̄𝑋𝑋)², 𝑆𝑆𝑌𝑌2= 1 𝑛𝑛_𝑌𝑌−1�

(𝑌𝑌𝑖𝑖− ̄𝑌𝑌)²

(16)

Model II: comparison of means, variances unknown and no equality assumption

X

₁

, X

₂

, ..., X

_nX

are an IID sample from distr N( µ

_X

, σ

_X²

), Y

₁

, Y

₂

, ..., Y

_nY

are an IID sample from distr N( µ

_Y

, σ

_Y²

),

σ

_X²

, σ

_Y²

are unknown, samples independent H

₀

: µ

_x

= µ

_Y

The test statistic would be very simple, but:

It isn’t possible to design a test statistic such that the distribution does not depend on σ

_X²

and σ

_Y²

(values)...

𝑆𝑆_𝑋𝑋²= 1 𝑛𝑛_𝑋𝑋−1�

(𝑋𝑋_𝑖𝑖− ̄𝑋𝑋)², 𝑆𝑆_𝑌𝑌²= 1 𝑛𝑛_𝑌𝑌−1�

(𝑌𝑌_𝑖𝑖− ̄𝑌𝑌)²

̄𝑋𝑋 − ̄𝑌𝑌 𝑆𝑆_𝑋𝑋²

𝑛𝑛_𝑋𝑋 + 𝑆𝑆𝑛𝑛^𝑌𝑌_𝑌𝑌²

~?

(17)

Model III: comparison of means for large samples, significance level α

X₁, X₂, ..., X_nX are an IID sample from distr. with mean µ_X, Y₁, Y₂, ..., Y_nY are an IID sample from distr. with mean µ_Y , both distr. have unknown variances, samples are independent,

n_X, n_Y – large.

H₀: µ_x = µ_YTest statistic:

H₀: µ_x = µ_Yagainst H₁: µ_x > µ_Y critical region

H₀: µ_x = µ_Yagainst H₁: µ_x ≠ µ_Y critical region

𝑈𝑈 = ̄𝑋𝑋 − ̄𝑌𝑌 𝑆𝑆_𝑋𝑋²

𝑛𝑛_𝑋𝑋 + 𝑆𝑆𝑛𝑛^𝑌𝑌_𝑌𝑌²

~ 𝑁𝑁 (0,1)

assuming H_0. is true, for large samples

approximately

𝐶𝐶^∗ = {𝑥𝑥 : 𝑈𝑈 (𝑥𝑥) > 𝑢𝑢_1−𝛼𝛼}

𝐶𝐶^∗ = {𝑥𝑥 : | 𝑈𝑈(𝑥𝑥)| > 𝑢𝑢_{1−𝛼𝛼/2}}

𝑆𝑆_𝑋𝑋²= 1 𝑛𝑛_𝑋𝑋−1�

𝑖𝑖 1 𝑛𝑛𝑋𝑋

(𝑋𝑋_𝑖𝑖− ̄𝑋𝑋)², 𝑆𝑆_𝑌𝑌²= 1 𝑛𝑛_𝑌𝑌−1�

𝑖𝑖 1 𝑛𝑛𝑌𝑌

(𝑌𝑌_𝑖𝑖− ̄𝑌𝑌)²

(18)

Model III – example (equality of means?)



(19)

Model IV: comparison of fractions for large samples, significance level α

Two IID samples from two-point distributions. X – number of successes in n_X trials with prob of success p_X, Y – number of successes in n_Y trials with prob of success p_Y. p_X and p_Y

unknown, n_X and n_Y large.

H₀: p_X = p_Y Test statistic:

where

H₀: p_X = p_Y against H₁: p_X > p_Y critical region

H₀: p_X = p_Y against H₁: p_X ≠ p_Y critical region

𝑈𝑈^∗ =

𝑛𝑛𝑋𝑋_𝑋𝑋 − 𝑌𝑌𝑛𝑛_𝑌𝑌

𝑝𝑝_∗(1 − 𝑝𝑝_∗) 1𝑛𝑛_𝑋𝑋 + 1𝑛𝑛_𝑌𝑌

~ 𝑁𝑁 (0,1)

𝐶𝐶^∗ = {𝑥𝑥 : 𝑈𝑈^∗( 𝑥𝑥) > 𝑢𝑢_1−𝛼𝛼}

𝐶𝐶^∗ = {𝑥𝑥 : | 𝑈𝑈^∗(𝑥𝑥)| > 𝑢𝑢_{1−𝛼𝛼/2}}

𝑝𝑝^∗ = 𝑋𝑋 + 𝑌𝑌 𝑛𝑛_𝑥𝑥 + 𝑛𝑛_𝑦𝑦

assuming H_0. is true, for large samples

approximately

(20)

Model IV – example (equality of probabilities?)



(21)

Tests for more than two populations

A naive approach:

pairwise tests for all pairs But:

in this case, the type I error is higher than

the significance level assumed for each

simple test...

(22)

More populations

Assume we have k samples:

, and

 all X

_i,j

are independent (i=1,...,k, j=1,.., n

_i

)

 X

_i,j

~N(m

_i

, σ

²

⁾

 we do not know m

₁

, m

₂

, ..., m

_k

, nor σ

²

let n=n

₁

+n

₂

+...+n

_k

𝑋𝑋

_1,1

, 𝑋𝑋

_1,2

, . . . , 𝑋𝑋

_1,𝑛𝑛₁

, 𝑋𝑋

_2,1

, 𝑋𝑋

_2,2

, . . . , 𝑋𝑋

_2,𝑛𝑛₂

, . . .

𝑋𝑋

_𝑘𝑘,1

, 𝑋𝑋

_𝑘𝑘,2

, . . . , 𝑋𝑋

_{𝑘𝑘,𝑛𝑛}_𝑘𝑘

(23)

Test of the Analysis of Variance (ANOVA) for significance level α

H

₀

: µ

₁

⁼ µ

₂

^{=... =} µ

_k

H

₁

: ¬ ^H

₀

(i.e. not all µ

_i

^{are equal)} A LR test; we get a test statistic:

with critical region

for k=2 the ANOVA is equivalent to the two-sample t-test.

𝐹𝐹 = ∑_𝑖𝑖=1^𝑘𝑘 𝑛𝑛_𝑖𝑖( ̄𝑋𝑋_𝑖𝑖 − ̄𝑋𝑋)²/(𝑘𝑘 − 1)

∑_𝑖𝑖=1^𝑘𝑘 ∑_𝑗𝑗=1^𝑛𝑛^𝑖𝑖 (𝑋𝑋_{𝑖𝑖,𝑗𝑗} − ̄𝑋𝑋_𝑖𝑖)²/(𝑛𝑛 − 𝑘𝑘) ~ 𝐹𝐹 (𝑘𝑘 − 1, 𝑛𝑛 − 𝑘𝑘)

̄𝑋𝑋_𝑖𝑖 = 1 𝑛𝑛_𝑖𝑖 �

𝑗𝑗=1 𝑛𝑛_𝑖𝑖

𝑋𝑋_{𝑖𝑖,𝑗𝑗} , ̄𝑋𝑋 = 1 𝑛𝑛 �_𝑖𝑖=1

𝑘𝑘

�

𝑋𝑋_{𝑖𝑖,𝑗𝑗} = 1 𝑛𝑛 �_𝑖𝑖=1

𝑘𝑘

𝑛𝑛_𝑖𝑖 ̄𝑋𝑋_𝑖𝑖

𝐶𝐶

^∗

= {𝑥𝑥 : 𝐹𝐹 (𝑥𝑥) > 𝐹𝐹

_1−𝛼𝛼

(𝑘𝑘 − 1, 𝑛𝑛 − 𝑘𝑘)}

(24)

ANOVA – interpretation

we have

– between group variance estimator – within group variance estimator 1

𝑛𝑛 − 𝑘𝑘 �_𝑖𝑖=1

𝑘𝑘

�

(𝑋𝑋_{𝑖𝑖,𝑗𝑗} − ̄𝑋𝑋_𝑖𝑖)²

Sum of Squares (SS)

Sum of Squares Between (SSB)

Sum of Squares Within (SSW)

1

𝑘𝑘 − 1 �_𝑖𝑖=1

𝑘𝑘

𝑛𝑛_𝑖𝑖( ̄𝑋𝑋_𝑖𝑖 − ̄𝑋𝑋)²

�

𝑖𝑖=1 𝑘𝑘

�

(𝑋𝑋_{𝑖𝑖,𝑗𝑗} − ̄𝑋𝑋)² = �

𝑖𝑖=1 𝑘𝑘

𝑛𝑛_𝑖𝑖( ̄𝑋𝑋_𝑖𝑖 − ̄𝑋𝑋)² + �

𝑖𝑖=1 𝑘𝑘

�

(𝑋𝑋_{𝑖𝑖,𝑗𝑗} − ̄𝑋𝑋_𝑖𝑖)²

(25)

ANOVA test – table

source of

variability sum of squares degrees of freedom

value of the test statistic F between

groups SSB k-1 –

within groups SSW n-k –

total SS n-1 F

(26)

ANOVA test – example

Yearly chocolate consumption in three cities: A, B, C based on random samples of n

_A

= 8, n

_B

= 10, n

_C

= 9 consumers. Does consumption depend on the city?

α=0.01

→ reject H

₀

(equality of means), consumption depends on city

A B C

sample mean 11 10 7

sample variance 3.5 2.8 3

̄𝑋𝑋 = 127 (11 ⋅ 8 + 10 ⋅ 10 + 7 ⋅ 9) = 9.3

𝑆𝑆𝑆𝑆𝑆𝑆 = (11 − 9.3)² ⋅ 8 + (10 − 9.3)² ⋅ 10 + (7 − 9.3)² ⋅ 9 = 75.63 𝑆𝑆𝑆𝑆𝑆𝑆 = 3.5 ⋅ 7 + 2.8 ⋅ 9 + 3 ⋅ 8 = 73.7

𝐹𝐹 = 75.63/2

73.7/24 ≈ 12.31 and 𝐹𝐹^0.99(2,24) ≈ 5.61

(27)

ANOVA test – table – example

source of

variability sum of squares degrees of freedom

value of the test statistic F between

groups 75.63 2 –

within groups 73.7 24 –

total 149.33 26 12.31

(28)