F AND SELECTIVE F TESTS WITH BALANCED CROSS-NESTING AND ASSOCIATED MODELS

(1)

F AND SELECTIVE F TESTS WITH BALANCED CROSS-NESTING AND ASSOCIATED MODELS

C´ elia Nunes

Departamento de Matem´ atica, Universidade da Beira Interior 6200 Covilh˜ a, Portugal

e-mail: celia@mat.ubi.pt Iola Pinto

Instituto Superior de Engenharia de Lisboa and

Jo˜ ao Tiago Mexia

Mathematics Department, Faculty of Science and Technology New University of Lisbon

Monte da Caparica 2829–516 Caparica, Portugal

Abstract

F tests and selective F tests for fixed effects part of balanced models with cross-nesting are derived. The effects of perturbations in the numerator and denominator of the F statistics are considered.

Keywords: selective F tests, associated models, cross-nesting.

2000 Mathematics Subject Classification: 62J10, 62J12, 62J99.

1. Introduction

Balanced models with cross-nesting enable us to study the

action of a large number of factors. Whenever possible, F tests are

highly recommended due to their robustness and power. In what follows

such tests are derived for the fixed effects part of the models.

(2)

Besides the usual F tests we will consider selective F tests which have high power for chosen alternatives. we consider the effects of perturbations on the numerator and denominator of the statistics. These perturbations arise when additional terms are added to the models, thus originating associated models.

2. Models and hypothesis

Throughout the text, superscripts will indicate vector dimensions, 1 ^r [0 ^r ] will have their r components equal to 1 [0]. Moreover, I _r will be the r × r identity matrix, R(A) will be the range space of matrix A and A ⊗ B the Kronecker product of matrices A and B. The transpose of matrix A will be denoted by A ^> .

Let us assume there are L groups with u 1 , ..., u _L factors. The number of the levels for the first factors in the different groups will be a _` (1), ` = 1, ..., L, we also put a _` (0) = 1, ` = 1, ..., L. If u _` > 1, we will have balanced nesting in the `-th group of factors. For each level of the first factor there will be a _` (2) levels for the second factor, and so on. With h _` ≤ u ` , we will have c _` (h _` ) = Q _h

_`

t

`

=0 a _` (t _` ) level combinations from the first h _` factors in the `-th group. The number of level combinations for the factors in the `-th group will be c _` = c _` (u _` ), ` = 1, ..., L. Each level combination for the first h _` factors of the `-th group will nest b _` (h _` ) = _c ^c

^`

`

(h

`

) level combinations for the remaining factors in the group. The number of level combinations for all the factors, this is, the number of treatments, will be c = Q _L

`=1 c _` . If, for each treatment, we have r observations, the total number of observations will be n = cr.

Let ∆ be the family of vectors h ^L with components h _` = 0, ..., u _` ,

` = 1, ..., L and let us assume that the L ⁰ < L first groups are of fixed-effect factors and that the remaining groups will be of random effects factors.

The vectors associated to µ and the effects and interactions between fixed effects factors constitute the sub-family ∆ ⁰ of ∆. Given h ^L ∈ ∆ ⁰ , c(h ^L ) = Q L

`=1 c _` (h _` ) will be the number of effects or interactions associ-

ated to h ^L . These effects and interactions are components of a fixed vector

β ^c(h

^L

⁾ (h ^L ), if h ^L 6= 0 ^L . Besides this, the vectors h ^L ∈ ∆ ^0c = ∆ − ∆ ⁰ will

correspond to effects or to interactions involving one or more random effects

factors. Associated to them there will be random vectors e β ^c(h

^L

⁾ (h ^L ) with

null mean vectors and variance-covariance matrices σ ² (h ^L )I _c(h

L

) , h ^L ∈ ∆ ^0c .

(3)

These vectors are independent between themselves and of the error vector e ⁿ which has null mean vector and variance-covariance matrix σ ² I _n . Let us put

(2.1)

 



∆ ⁰ (t ^L ) = {h ^L : h ^L ∈ ∆ ⁰ ; t ^L ≤ h ^L }

∆ ^0c (t ^L ) = {h ^L : h ^L ∈ ∆ ^0c ; t ^L ≤ h ^L } .

Taking

X(h ^L ) = O L

`=1

X _` (h _` ) ⊗ 1 ^r ,

where X _` (h _` ) = I _c(h

_`

₎ ⊗ 1 ^b

^`

⁽

^`

⁾ , h _` = 0, ..., u _` , ` = 1, ..., L, we will use a model formulation introduced by Fonseca et al. (2003),

(2.2) Y ⁿ = X

h

^L

∈∆

⁰

X (h ^L )β ^c(h

^L

⁾ (h ^L ) + X

h

^L

∈∆

^0c

X (h ^L ) e β ^c(h

^L

⁾ (h ^L ) + e ⁿ .

With T s a matrix such that [T s ; s ^−1/2 1 ^s ] is a s order orthogonal matrix, we put

K _` (t _` ) = I _c

_`

_(t

_`

₋₁₎ ⊗ T a

`

(t

`

) ⊗ 1 ^b

^`

^(t

^`

⁾

p b _` (t _` ) ; t _` = 0, ..., u _` ; ` = 1, ..., L, (2.3)

as well as P (t ^L ) = K(t ^L )K(t ^L ) ^> , with

K(t ^L ) = O L

`=1

K _` (t _` ) ⊗ 1

√ r 1 ^r

and rank(P (t ^L )) = rank(K(t ^L )) = g(t ^L ). We also put P = I _n − X

t

^L

:t

^L

∈∆

P (t ^L ),

with rank(P ) = g, so we can write

(2.4) Y ⁿ = X

t

^L

∈∆

P (t ^L )Y ⁿ + P Y ⁿ .

(4)

Let us point out that

M (h ^L ) = X(h ^L )X(h ^L ) ^> = O L

`=1

M _` (h _` ) ⊗ J r ,

where J _r = 1 ^r (1 ^r ) ^> . These matrices commute and, taking b(h ^L ) = r Q _L

`=1 b _` (h _` ), the matrices

Q(h ^L ) = b(h ^L ) ⁻¹ M (h ^L ) = X

t

^L

:t

^L

≤h

^L

P (t ^L )

are orthogonal projections matrices, while the P (t ^L ), t ^L ∈ ∆, and ¯ P are mutually orthogonal projections matrices.

Let us take

(2.5) e

η ^g(t

^L

⁾ (t ^L ) = K(t ^L ) ^> Y ⁿ = η ^g(t

^L

⁾ (t ^L ) + P

h

^L

∈∆

^0c

(t

^L

) K(t ^L ) ^> X(h ^L ) e β ^c(h

^L

⁾ (h ^L ) + K(t ^L ) ^> e ⁿ ; t ^L ∈ ∆, where

(2.6) η ^g(t

^L

⁾ (t ^L ) = X

h

^L

∈∆

⁰

(t

^L

)

K(t ^L ) ^> X (h ^L )β ^c(h

^L

⁾ (h ^L ); t ^L ∈ ∆

is the mean vector of e η ^g(t

^L

⁾ (t ^L ), t ^L ∈ ∆. From the independence of the β e ^c(h

^L

⁾ (h ^L ), h ^L ∈ ∆ ^0c , and of e ⁿ , we also get the variance-covariance matrix of e η ^g(t

^L

⁾ (t ^L )

(2.7) 6Σ(e η ^g(t

^L

⁾ (t ^L )) = γ(t ^L )I _g(t

L

) ; t ^L ∈ ∆ with

(2.8) γ (t ^L ) = X

h

^L

∈∆

^0c

(t

^L

)

b(h ^L )σ ² (h ^L ) + σ ² ,

since 6Σ(e β ^c(h

^L

⁾ (h ^L )) = σ ² (h ^L )I _c(h

^L

₎ , h ^L ∈ ∆ ^0c , 6Σ(e ⁿ ) = σ ² I _n , and

(5)

(2.9)

 



K(t ^L ) ^> M (h ^L )K(t ^L ) = b(h ^L )I _g(t

L

) ; t ^L ≤ h ^L K(t ^L ) ^> M (h ^L )K(t ^L ) = 0 _g(t

L

),g(t

^L

) ; t ^L 6≤ h ^L

,

since K(t ^L ) ^> K(t ^L ) = I _g(t

^L

₎ , t ^L ∈ ∆. If t ^L ∈ ∆ ^0c , ∆ ⁰ (t ^L ) = ∅, and

(2.10) η ^g(t

^L

⁾ (t ^L ) = 0 ^g(t

^L

⁾ .

Let us consider the family of the estimable vectors

(2.11)

Λ(t ^L ) = {ψ ^s (t ^L ) = B(t ^L )η ^g(t

^L

⁾ (t ^L ) : R(B(t ^L ) ^> ) ⊆ R(K(t ^L ) ^> ) } ; t ^L ∈ ∆ ⁰ .

In what follows we will use the ψ ^s (t ^L ) ∈ Λ h (t ^L ), the family of homoscedastic estimable vectors, t ^L ∈ ∆ ⁰ , to define the hypotheses

(2.12) H ₀ (t ^L ; d ^s ) : ψ ^s (t ^L ) = d ^s ; t ^L ∈ ∆ ⁰

that holds if and only if kψ ^s (t ^L ) − d ^s k ² = 0. The vectors in Λ _h (t ^L ) are characterized by B(t ^L )B(t ^L ) ^> = kI s .

3. F and selective F tests When ∆ ^0c (t ^L ) = ∅ we have, according to (2.8),

(3.1) γ (t ^L ) = σ ² .

To single out these cases we put t ^L ∈ ∆ ⁰ _∅ when this happen.

If the vectors e β ^c(h

^L

⁾ (h ^L ), h ^L ∈ ∆ ^0c and e ⁿ are normal and independent,

we obtain, see Mexia (1995, p. 35–42),

(6)

a) e η ^g(t

^L

⁾ (t ^L ) ∼ N(η ^g(t

^L

⁾ (t ^L ), γ(t ^L )I _g(t

^L

₎ ), t ^L ∈ ∆;

b) P e ⁿ ∼ N(0 ⁿ , σ ² P );

c) the vectors e η ^g(t

^L

⁾ (t ^L ), t ^L ∈ ∆, and P e ⁿ are independent.

Then, with t ^L ∈ ∆ ⁰ _∅ , we will have e ψ ^s (t ^L ) ∼ N(ψ ^s (t ^L ), σ ² I _s ) independent from S = kP e ⁿ k ² ∼ σ ² χ ² _g,0 .

Thus it is straightforward to obtain, for the hypotheses H ₀ (t ^L , d ^s ), F tests with statistics

(3.2) =(t ^L ) = g

s

k e ψ ^s (t ^L ) − d ^s k ²

S ,

where k e ψ ^s (t ^L ) − d ^s k ² ∼ σ ² χ ² _s,δ(t

L

,d

^s

) , with

(3.3) δ(t ^L , d ^s ) = 1

σ ² kψ ^s (t ^L ) − d ^s k ² . We will replace the test statistic by

(3.4) T (t ^L ) = k e ψ ^s (t ^L ) − d ^s k ²

S ,

because these statistics have ”more manageable” distributions than the previous ones.

Let us consider associated models. The models obtained, taking (3.5) β e ^c(h

^L

⁾ (h ^L ) = β ^c(h

^L

⁾ (h ^L ) + U ^c(h

^L

⁾ (h ^L ), h ^L ∈ ∆ ⁰ ,

this is adding random vectors Z ^g(t

^L

⁾ (t ^L ) to the vectors e η ^g(t

^L

⁾ (t ^L ), t ^L ∈ ∆ ⁰ , will be the associated models of first type. Now, with t ^L ∈ ∆ ⁰ _∅ , we will have k e ψ ^s (t ^L ) − d ^s k ² ∼ σ ² χ ² _{s,V (t}

L

,d

^s

) , with

(3.6) V (t ^L , d ^s ) = 1

σ ² kψ ^s (t ^L ) − d ^s + B(t ^L )Z ^g(t

^L

⁾ (t ^L ) k ² ; t ^L ∈ ∆ ⁰ .

If pr(Z ^g(t

^L

⁾ (t ^L ) = 0 ^g(t

^L

⁾ ) = 1, V (t ^L , d ^s ) is constant (with probability equal

to one), and is null if and only if H 0 (t ^L , d ^s ) holds.

(7)

We have the associated models of the second type, obtained adding to P e ⁿ a random vector W ^g independent of all the other random vectors that are considered in the model. Thus, only S will be disturbed and we will have S ∼ σ ² χ ² _g,V , with

(3.7) V = 1

σ ² kW ^g k ² .

We point out that χ ² _g,V is the result of randomizing the non-centrality parameter in a chi-square.

Lastly we have associated models of third type. These models are obtained adding random vectors Z ^g(t

^L

⁾ (t ^L ) [W ^g ] to the vectors e η ^g(t

^L

⁾ (t ^L ), t ^L ∈ ∆ ⁰ _∅ [P e ⁿ ]. We now have k e ψ ^s (t ^L ) − d ^s k ² ∼ σ ² χ ² _{s,V (t}

L

,d

^s

) and S ∼ σ ² χ ² _g,V . Once more, with t ^L ∈ ∆ ⁰ _∅ , whenever pr(Z ^g(t

^L

⁾ (t ^L ) = 0 ^g(t

^L

⁾ ) = 1, V (t ^L , d ^s ) will be fixed and null if and only if H ₀ (t ^L , d ^s ) holds.

We intend deriving selective F tests for certain alternatives.

To define selected alternatives to H ₀ (t ^L , d ^s ), Dias (1994, p. 21–24) used polar coordinates (r, θ ₁ , ..., θ _s−1 ). For the θ ₁ , ..., θ _s−1 we have the bounds

(3.8)

 



 

− Π

2 ≤ θ j ≤ Π

2 ; j = 1, ..., s − 2 , 0 ≤ θ s−1 < 2Π ,

which define the domain D with θ ^s−1 (ψ ^s − d ^s ) the vector of central angles θ ₁ , ..., θ _s−1 associated to ψ ^s (t ^L ) − d ^s , when one of those alternatives holds we will have θ ^s−1 (ψ ^s − d ^s ) ∈ C and kψ ^s (t ^L ) − d ^s k ² > a. We will represent by ν the set of vectors ψ ^s (t ^L ) associated to these alternatives.

The joint density of T (t ^L ) and Θ ^s−1 = θ ^s−1 (ψ ^s −d ^s ) for normal models, see Nunes and Mexia (2003), will be

(3.9)

f (z, θ ^s−1 |ρ ^s , g)

= e ^−δ/2 h(θ ^s−1 ) 2π ^s/2 Γ g

2 X +∞

j=0

2 ^j/2 Γ

g + s + j 2

a ^j (θ ^s−1 )z

^j+s²

⁻¹ j!(1 + z)

^g+s+j²

,

z > 0, θ ^s−1 ∈ D,

(8)

where

(3.10)

 

 

 

 



ρ ^s = 1

σ (ψ ^s (t ^L ) − d ^s ) k(θ ^s−1 ) = k` ^s (θ ^s−1 ) k ² = 1 a(θ ^s−1 ) = 1

√ σ ² (ψ ^s (t ^L ) − d ^s ) ^> ` ^s (θ ^s−1 ) h(θ ^s−1 ) = cos θ ₁ ^s−2 , ..., cos θ _s−2

.

The components of ` ^s (θ ^s−1 ) being

(3.11)

 

 

 

 



` ₁ (θ ^s−1 ) = cos θ ₁ · · · cos θ s−1

` ₂ (θ ^s−1 ) = cos θ ₁ · · · cos θ s−2 sin θ _s−1 .. .

` _j (θ ^s−1 ) = cos θ ₁ · · · cos θ s−j sin θ _s−j+1 .. .

` _s (θ ^s−1 ) = sin θ ₁

.

When H ₀ (t ^L , d ^s ) holds, ρ ^s = 0 ^s and the joint density will now be (3.12) f(z, θ ^s−1 |0 ^s , g) = f (z |s, g)f(θ ^s−1 ),

where f (z |s, g) is the density of ^χ _χ

²^s2 g

and

f (θ ^s−1 ) = Γ s

2 2π ^s/2 h(θ ^s−1 ); θ ^s−1 ∈ D,

so T (t ^L ) and Θ ^s−1 will then be independent. Let us assume that we have a critical region (k, C), rejecting the tested hypothesis when T (t ^L ) > k and Θ ^s−1 ∈ C, the test level will be

(3.13) level(k, C) = (1 − F (k|s, g)) Z

C

...

Z

f (θ ^s−1 )

s−1 Y

j=1

dθ _j .

(9)

From this expression we can conclude that, for a given test level, there exist several pairs (k, C). To choose a convenient pair Dias (1994, p. 53) introduced the max-min tests. These tests maximize the minimum power for selected alternatives. The selected alternatives for which this minimum is attained will be the critical alternatives. It is possible to have more than one of these alternatives, Dias (1994, p. 53) describes a case with s = 2 where there are two critical alternatives.

Now we go over to first type associated models. The selected alternatives will now be those such that pr((ψ ^s (t ^L ) + B(t ^L )Z ^g(t

^L

⁾ (t ^L )) ∈ ν) = 1 ^∗ .

Now we establish

Proposition 1. If the test is max-min for the normal model it is also max- min for the associated models of the first type with the same minimum power for selected alternatives.

P roof. With pot(ψ ^s ) the power function for the normal model and G(v ^s ) the distribution of e ψ ^s (t ^L ) + B(t ^L )Z ^g(t

^L

⁾ (t ^L ), the power for the associated model will be

(3.14) P ot(G) =

Z

ν

...

Z

pot(v ^s )dG(v ^s ).

Let us now point out that, if H ₁ : ψ ^s (t ^L ) = ψ _c ^s (t ^L ), t ^L ∈ ∆ ⁰ _∅ , is a critical alternative for the normal model, we will have pot(ψ _c ^s ) = min {pot(ψ ^s ), ψ ^s (t ^L ) ∈ ν}. Since pr(( e ψ ^s (t ^L ) + B(t ^L )Z ^g(t

^L

⁾ (t ^L )) ∈ ν) = 1, we have P ot(G) ≥ pot(ψ c ^s ). This minimum is attained for the ”degenerate” selected alternatives e ψ ^s (t ^L ) + B(t ^L )Z ^g(t

^L

⁾ (t ^L )) = ψ ^s _c (t ^L ) so the proof is complete.

Thus we can extended directly the max-min tests constructed for normal models to the corresponding associated models of first type.

When the infimum of the power for selected alternatives exceeds the test level the test will be selectively unbiased. If suffices that the power for critical alternatives, if they exist, exceeds the test level for the test to be selectively unbiased. Is easy to establish the

∗

This condition holds, for instance, if pr(kB(t

^L

)Z

^g(t^L⁾

(t

^L

)k ≤ r) = 1 and the

hypersphere, centered in ν

0

with radius r, are contained in ν. In this case we privilege the

alternatives such that ψ

^s

(t

^L

) ∈ ν

0

.

(10)

Proposition 2. If the test for the normal model with a (k, C) critical region is selectively unbiased and has critical alternatives, the test with the same critical region will be selectively unbiased for associated models of the first type.

Let us consider now what happens with associated models of the second type.

When the tested hypothesis holds ρ ^s = 0 ^s . Since S ∼ σ ² χ ² _g,V , when V = v, the joint conditional density of T (t ^L ) and Θ ^s−1 is

(3.15)

f (z, θ ^s−1 |0 ^s , g, v) = e ^−v/2 X +∞

j=0

v 2

j

j! f (z, θ ^s−1 |0 ^s , g + 2j)

= f (θ ^s−1 )e ^−v/2

+∞ X

j=0

v 2

j

j! f (z |s, g + 2j), while the unconditional joint density will be, with F _V (v) and λ _V (t) respectively the distribution and the moment generating function of V ,

f(z, θ ^s−1 |0 ^s , g, λ _V ) = Z _+∞

0

f (z, θ ^s−1 |0 ^s , g, v)dF _V (v)

(3.16) = f (θ ^s−1 )

X +∞

j=0

f (z |s, g + 2j) j!2 ^j

Z _+∞

0

v ^j e ^−v/2 dF _V (v)

= f (θ ^s−1 ) X +∞

j=0

λ ^<j> _V

− 1 2

j!2 ^j f (z |s, g + 2j),

z > 0, θ ^s−1 ∈ D.

Since

• Θ ^s−1 is independent of W ^g ;

• pr(T (t ^L ) > k) decreases when pr(W ^g 6= 0 ^g ) > 0 (since the distribution

of T (t ^L ) increases with V = v, see Nunes and Mexia (2000));

(11)

we can conclude that the perturbations occurring in associated models of the second type lead to a loss of power of the selective F tests.

For the associated models of the third type we have perturbations in the numerator and the denominator. Then we will have to deconditionalize in order to both perturbations. When B(t ^L )Z ^g(t

^L

⁾ (t ^L ) = b ^s , d ^s is replaced by d ˙ ^s = d ^s − b ^s . If moreover V = v, the conditional joint density of T (t ^L ) and Θ ^s−1 will be

f (z, θ ^s−1 | ˙ ρ ^s , g, v) = e ^−v/2 X +∞

j=0

v 2

j

j! f (z, θ ^s−1 | ˙ρ ^s , g + 2j), (3.17)

where ˙ρ ^s = _σ ¹ (ψ ^s (t ^L ) − ˙d ^s ). We point out that we now also have

˙a(θ ^s−1 ) = 1

√ σ ² (ψ ^s (t ^L ) − ˙d ^s ) ^> ` ^s (θ ^s−1 ) and ˙δ(t ^L , d ^s ) = 1

σ ² kψ ^s (t ^L ) − ˙d ^s k ² . Deconditioning in order to V we get

f (z, θ ^s−1 | ˙ρ ^s , g, λ _V ) = Z _+∞

0

f (z, θ ^s−1 | ˙ρ ^s , g, v)dF _V (v)

(3.18) =

X +∞

j=0

f (z, θ ^s−1 | ˙ρ ^s , g + 2j) j!2 ^j

Z +∞

0

v ^j e ^−v/2 dF _V (v)

= X +∞

j=0

λ ^<j> _V

− 1 2

j!2 ^j f (z, θ ^s−1 | ˙ρ ^s , g + 2j),

z > 0, θ ^s−1 ∈ D.

Since

(3.19)

f (z, θ ^s−1 | ˙ρ ^s , g)

= e ^{− ˙δ/2} h(θ ^s−1 ) 2π ^s/2 Γ g

2 X +∞

j=0

2 ^j/2 Γ

g + s + j 2

˙a ^j (θ ^s−1 )z

^j+s²

⁻¹ j!(1 + z)

^g+s+j²

z > 0, θ ^s−1 ∈ D,

,

(12)

if the expected values (3.20) A _j = E

e ⁻

^2σ2¹

^kψ

^s

^(t

^L

^{)− ˙} ^d

^s

^k

²

˙a ^j (θ ^s−1 )

, j = 0, 1, ...

exist, deconditioning in order to the perturbations in the numerator, we first obtain

(3.21)

f (z, θ ^s−1 |g) =

h(θ ^s−1 ) 2π ^s/2 Γ ^g ₂

X +∞

j=0

2 ^j/2 A _j Γ

g + s + j 2

z

^j+s²

⁻¹

j!(1 + z)

^g+s+j²

, z > 0, θ ^s−1 ∈ D, and the unconditional joint density for T (t ^L ) and Θ ^s−1 will be, according to (3.18),

(3.22)

f (z, θ ^s−1 )

=

+∞ X

j=0

λ ^<j> _V

− 1 2

j!2 ^j f (z, θ ^s−1 |g + 2j), z > 0, θ ^s−1 ∈ D.

Since the perturbations occur both in the numerator and denominator it is difficult in this case to compare the power of the tests.

References

[1] G. Dias, Selective F tests, Trabalhos de Investiga¸c˜ ao, N ^o 1, FCT/UNL 1994.

[2] M. Fonseca, J.T. Mexia and R. Zmy´slony, Estimators and Tests for Variance Components in Cross Nested Orthogonal Designs, Discussions Mathematicae- Probability and Statistics 23 (2003), 175–201.

[3] A. Michalski and R. Zmy´slony, Testing hypothesis for variance components in mixed linear models, Statistics 27 (1996), 297–310.

[4] A. Michalski and R. Zmy´slony, Testing hypothesis for linear functions of pa-

rameters in mixed linear models, Tatra Mountain Mathematical Publications

17 (1999), 103–110.

(13)

[5] J.T. Mexia, Introdu¸c˜ ao ` a Inferˆ encia Estat´ıstica Linear, Centro de Estudos de Matem´ atica Aplicada, Edi¸c˜ oes Lusofonas 1995.

[6] C. Nunes and J.T. Mexia, Perturbations in Sub-normal Models, In Statisti- cal Modelling- Proceedings of the 15 ^th International Workshop on Statistical Modelling (V. N´ u˜ nez-Ant´ on, E. Ferreira, eds.), p. 485–488, Bilbao, Spain, July 17–21 (2000).

F AND SELECTIVE F TESTS WITH BALANCED CROSS-NESTING AND ASSOCIATED MODELS

F AND SELECTIVE F TESTS WITH BALANCED CROSS-NESTING AND ASSOCIATED MODELS

C´ elia Nunes

Departamento de Matem´ atica, Universidade da Beira Interior 6200 Covilh˜ a, Portugal

e-mail: celia@mat.ubi.pt Iola Pinto

Instituto Superior de Engenharia de Lisboa and

Jo˜ ao Tiago Mexia

Mathematics Department, Faculty of Science and Technology New University of Lisbon

Monte da Caparica 2829–516 Caparica, Portugal

Abstract

F tests and selective F tests for fixed effects part of balanced models with cross-nesting are derived. The effects of perturbations in the numerator and denominator of the F statistics are considered.

Keywords: selective F tests, associated models, cross-nesting.

2000 Mathematics Subject Classification: 62J10, 62J12, 62J99.

1. Introduction

Balanced models with cross-nesting enable us to study the

action of a large number of factors. Whenever possible, F tests are

highly recommended due to their robustness and power. In what follows

such tests are derived for the fixed effects part of the models.

2. Models and hypothesis

t

=0 a ` (t ` ) level combinations from the first h ` factors in the `-th group. The number of level combinations for the factors in the `-th group will be c ` = c ` (u ` ), ` = 1, ..., L. Each level combination for the first h ` factors of the `-th group will nest b ` (h ` ) = c c

(h

) level combinations for the remaining factors in the group. The number of level combinations for all the factors, this is, the number of treatments, will be c = Q L

`=1 c ` . If, for each treatment, we have r observations, the total number of observations will be n = cr.

Let ∆ be the family of vectors h L with components h ` = 0, ..., u ` ,

` = 1, ..., L and let us assume that the L 0 < L first groups are of fixed-effect factors and that the remaining groups will be of random effects factors.

The vectors associated to µ and the effects and interactions between fixed effects factors constitute the sub-family ∆ 0 of ∆. Given h L ∈ ∆ 0 , c(h L ) = Q L

`=1 c ` (h ` ) will be the number of effects or interactions associ-

ated to h L . These effects and interactions are components of a fixed vector

β c(h

) (h L ), if h L 6= 0 L . Besides this, the vectors h L ∈ ∆ 0c = ∆ − ∆ 0 will

correspond to effects or to interactions involving one or more random effects

factors. Associated to them there will be random vectors e β c(h

) (h L ) with

null mean vectors and variance-covariance matrices σ 2 (h L )I c(h

) , h L ∈ ∆ 0c .

These vectors are independent between themselves and of the error vector e n which has null mean vector and variance-covariance matrix σ 2 I n . Let us put

(2.1)

 



∆ 0 (t L ) = {h L : h L ∈ ∆ 0 ; t L ≤ h L }

∆ 0c (t L ) = {h L : h L ∈ ∆ 0c ; t L ≤ h L } .

Taking

X(h L ) = O L

`=1

X ` (h ` ) ⊗ 1 r ,

where X ` (h ` ) = I c(h

) ⊗ 1 b

(

) , h ` = 0, ..., u ` , ` = 1, ..., L, we will use a model formulation introduced by Fonseca et al. (2003),

(2.2) Y n = X

h

∈∆

X (h L )β c(h

) (h L ) + X

h

∈∆

X (h L ) e β c(h

) (h L ) + e n .

With T s a matrix such that [T s ; s −1/2 1 s ] is a s order orthogonal matrix, we put

K ` (t ` ) = I c

(t

−1) ⊗ T a

(t

) ⊗ 1 b

(t

)

p b ` (t ` ) ; t ` = 0, ..., u ` ; ` = 1, ..., L, (2.3)

as well as P (t L ) = K(t L )K(t L ) > , with

K(t L ) = O L

`=1

K ` (t ` ) ⊗ 1

√ r 1 r

and rank(P (t L )) = rank(K(t L )) = g(t L ). We also put P = I n − X

t

:t

∈∆

P (t L ),

with rank(P ) = g, so we can write

(2.4) Y n = X

=0 a _` (t _` ) level combinations from the first h _` factors in the `-th group. The number of level combinations for the factors in the `-th group will be c _` = c _` (u _` ), ` = 1, ..., L. Each level combination for the first h _` factors of the `-th group will nest b _` (h _` ) = _c ^c

) level combinations for the remaining factors in the group. The number of level combinations for all the factors, this is, the number of treatments, will be c = Q _L

`=1 c _` . If, for each treatment, we have r observations, the total number of observations will be n = cr.

Let ∆ be the family of vectors h ^L with components h _` = 0, ..., u _` ,

` = 1, ..., L and let us assume that the L ⁰ < L first groups are of fixed-effect factors and that the remaining groups will be of random effects factors.

The vectors associated to µ and the effects and interactions between fixed effects factors constitute the sub-family ∆ ⁰ of ∆. Given h ^L ∈ ∆ ⁰ , c(h ^L ) = Q L

`=1 c _` (h _` ) will be the number of effects or interactions associ-

ated to h ^L . These effects and interactions are components of a fixed vector

β ^c(h

⁾ (h ^L ), if h ^L 6= 0 ^L . Besides this, the vectors h ^L ∈ ∆ ^0c = ∆ − ∆ ⁰ will

factors. Associated to them there will be random vectors e β ^c(h

⁾ (h ^L ) with

null mean vectors and variance-covariance matrices σ ² (h ^L )I _c(h

) , h ^L ∈ ∆ ^0c .

These vectors are independent between themselves and of the error vector e ⁿ which has null mean vector and variance-covariance matrix σ ² I _n . Let us put

∆ ⁰ (t ^L ) = {h ^L : h ^L ∈ ∆ ⁰ ; t ^L ≤ h ^L }

∆ ^0c (t ^L ) = {h ^L : h ^L ∈ ∆ ^0c ; t ^L ≤ h ^L } .

X(h ^L ) = O L

X _` (h _` ) ⊗ 1 ^r ,

where X _` (h _` ) = I _c(h

₎ ⊗ 1 ^b

⁽

⁾ , h _` = 0, ..., u _` , ` = 1, ..., L, we will use a model formulation introduced by Fonseca et al. (2003),

(2.2) Y ⁿ = X

X (h ^L )β ^c(h

⁾ (h ^L ) + X

X (h ^L ) e β ^c(h

⁾ (h ^L ) + e ⁿ .

With T s a matrix such that [T s ; s ^−1/2 1 ^s ] is a s order orthogonal matrix, we put

K _` (t _` ) = I _c

_(t

₋₁₎ ⊗ T a

) ⊗ 1 ^b

^(t

⁾

p b _` (t _` ) ; t _` = 0, ..., u _` ; ` = 1, ..., L, (2.3)

as well as P (t ^L ) = K(t ^L )K(t ^L ) ^> , with

K(t ^L ) = O L

K _` (t _` ) ⊗ 1

√ r 1 ^r

and rank(P (t ^L )) = rank(K(t ^L )) = g(t ^L ). We also put P = I _n − X

P (t ^L ),

(2.4) Y ⁿ = X

P (t ^L )Y ⁿ + P Y ⁿ .

M (h ^L ) = X(h ^L )X(h ^L ) ^> = O L

M _` (h _` ) ⊗ J r ,

where J _r = 1 ^r (1 ^r ) ^> . These matrices commute and, taking b(h ^L ) = r Q _L

`=1 b _` (h _` ), the matrices

Q(h ^L ) = b(h ^L ) ⁻¹ M (h ^L ) = X

P (t ^L )

are orthogonal projections matrices, while the P (t ^L ), t ^L ∈ ∆, and ¯ P are mutually orthogonal projections matrices.

η ^g(t

⁾ (t ^L ) = K(t ^L ) ^> Y ⁿ = η ^g(t

⁾ (t ^L ) + P

) K(t ^L ) ^> X(h ^L ) e β ^c(h

⁾ (h ^L ) + K(t ^L ) ^> e ⁿ ; t ^L ∈ ∆, where

(2.6) η ^g(t

⁾ (t ^L ) = X

K(t ^L ) ^> X (h ^L )β ^c(h

⁾ (h ^L ); t ^L ∈ ∆

is the mean vector of e η ^g(t

⁾ (t ^L ), t ^L ∈ ∆. From the independence of the β e ^c(h

⁾ (h ^L ), h ^L ∈ ∆ ^0c , and of e ⁿ , we also get the variance-covariance matrix of e η ^g(t

⁾ (t ^L )

(2.7) 6Σ(e η ^g(t

⁾ (t ^L )) = γ(t ^L )I _g(t

) ; t ^L ∈ ∆ with

(2.8) γ (t ^L ) = X

b(h ^L )σ ² (h ^L ) + σ ² ,

since 6Σ(e β ^c(h

⁾ (h ^L )) = σ ² (h ^L )I _c(h

₎ , h ^L ∈ ∆ ^0c , 6Σ(e ⁿ ) = σ ² I _n , and

K(t ^L ) ^> M (h ^L )K(t ^L ) = b(h ^L )I _g(t

) ; t ^L ≤ h ^L K(t ^L ) ^> M (h ^L )K(t ^L ) = 0 _g(t

) ; t ^L 6≤ h ^L

since K(t ^L ) ^> K(t ^L ) = I _g(t

₎ , t ^L ∈ ∆. If t ^L ∈ ∆ ^0c , ∆ ⁰ (t ^L ) = ∅, and

(2.10) η ^g(t

⁾ (t ^L ) = 0 ^g(t

⁾ .