TESTING HYPOTHESES IN UNIVERSAL MODELS

(1)

TESTING HYPOTHESES IN UNIVERSAL MODELS

^∗

Eva Fiˇ serov´ a

Department of Mathematical Analysis and Applied Mathematics Faculty of Science, Palack´y University

Tomkova 40, 779 00 Olomouc, Czech Republic e-mail: fiserova@inf.upol.cz

Abstract

A linear regression model, when a design matrix has not full column rank and a covariance matrix is singular, is considered. The problem of testing hypotheses on mean value parameters is studied. Conditions when a hypothesis can be tested or when need not be tested are given.

Explicit forms of test statistics based on residual sums of squares are presented.

Key words: universal linear model, unbiased estimator, tests hypotheses.

2000 Mathematics Subject Classification62J05, 62F03, 62F10.

1. Introduction

Let a linear regression model be under consideration. Generally, no assump- tions on the rank of design and covariance matrices are given. When testing linear hypotheses on mean value parameters in universal (singular) models, three typical situations can occur; either a hypothesis cannot be tested, or a hypothesis need not be tested, since it is automatically true, or a hypothesis can be tested.

The aim of the paper is to investigate possible situations which can occur when testing hypothesis in universal models and to find proper test statistics based on residual sums of squares.

∗Supported by the Council of Czech Government MSM 6 198 959 214.

(2)

2. Notations and auxiliary statements

Let A be an m × n matrix. Let M(A) = {Au : u ∈ R

ⁿ

} ⊂ R

^m

and Ker (A) = {u : u ∈ R

ⁿ

, Au = 0} ⊂ R

ⁿ

denote the column space and the null space of the matrix A, respectively. Let W be an m × m symmetric positive semidefinite matrix such that M(A) ⊂ M(W). Then P

^W_A

= A(A

⁰

WA)

⁻

A

⁰

W denotes a projector on M(A) in the W-seminorm.

The symbol M

^W_A

means I − P

^WA

. If W = I (identity matrix), symbols P

A

and M

A

are used. The W-seminorm of x, x ∈ R

^m

, is given by kxk

^W

= √

x

⁰

Wx. Symbols A

⁻

and A

⁺

mean the g-inverse and the Moore- Penrose inverse of the matrix A, respectively.

Let N be an n × n symmetric positive semidefinite matrix. The symbol A

⁻_m(N)

denotes the minimum N-seminorm g-inverse of the matrix A, i.e., the matrix A

⁻_m(N)

satisfies equations

(1) AA

⁻_m(N)

A = A, NA

⁻_m(N)

A = A

⁰

A

⁻_m(N)

⁰

N. One of representation of the matrix A

⁻_m(N)

is

A

⁻_m(N)

=

 



N

⁻

A

⁰

(AN

⁻

A

⁰

)

⁻

if M(A

⁰

) ⊂ M(N),

(N + A

⁰

A)

⁻

A

⁰

[A(N + A

⁰

A)

⁻

A

⁰

]

⁻

otherwise.

In more detail cf. [4].

Lemma 2.1. Let M(B) ⊂ M(A) and M(B

⁰

) ⊂ M(C). Then (2) (A − BC

⁻

B

⁰

)

⁻

= A

⁻

+ A

⁻

B (C − B

⁰

A

⁻

B)

⁻

B

⁰

A

⁻

.

P roof. It is an obvious consequence of Rohde theorem (cf., e.g., [1], p. 446,

Lemma 10.1.40).

(3)

3. Universal model The universal linear model is considered in the form

(3) Y ∼ N

n

Xβ, Σ

, β ∈ R

^k

,

where Y is an n-dimensional normally distributed random vector, Xβ is the mean value of Y and Σ its covariance matrix. X is a given matrix of the type n × k and Σ is a given n × n symmetric positive semidefinite matrix.

The best linear unbiased estimator (BLUE) of the function Xβ in the universal model (3) is (cf. [4], p. 148)

(4) Xβ d = X[(X

⁰

)

⁻_m(_Σ₎

]

⁰

Y = XD

⁻

X

⁰

T

⁻

Y with the covariance matrix

Var Xβ d

= XD

⁻

X

⁰

T

⁻

ΣT

⁻

XD

⁻

X

⁰

= X(D

⁻

− I)X

⁰

, where

T = Σ + XX

⁰

, D = X

⁰

T

⁻

X. Let a null hypothesis

H

₀

: h + Hβ = 0, h ∈ M(H),

where H is a given h × k matrix and h is a given h-dimensional vector, be tested in the universal model (3) against an alternative hypothesis

H

_a

: h + Hβ = ξ 6= 0.

If the hypothesis is taken into account as constraints on the parameter β,the estimator of Xβ can be determined in the following way. Let β

₀

be any solution of the equation h + Hβ = 0. Then the parameter β, β ∈ {u : h + Hu = 0}, can be expressed by the help of a new parameter γ

β = β

₀

+ K

H

γ, γ ∈ R

^k−rank(H)

,

where K

H

is a k × [k − rank(H)] matrix such that M(K

^H

) = Ker(H).

(4)

The new model without constraints is in the form

Y ∼ N

ⁿ

(Xβ

₀

+ XK

H

γ, Σ ) , γ ∈ R

^k−rank(H)

.

Hence the BLUE of Xβ in the universal model (3) respecting the null hypothesis is

d d

Xβ = Xβ

₀

+ \ XK

H

γ

= Xβ

₀

+ XK

H

h

(K

⁰H

X

⁰

)

⁻_m(_Σ₎

i

⁰

(Y − Xβ

0

).

Since

M (M

^H⁰

) = Ker(H), M (XK

^H

) = M (XM

^H⁰

) , and

P {Y − Xβ

0

∈ M(Σ)} = 1, it holds that

XK

H

h

(K

⁰H

X

⁰

)

⁻_m(_Σ₎

i

⁰

(Y − Xβ

0

) = XM

H0

h

(M

H0

X

⁰

)

⁻_m(_Σ₎

i

⁰

(Y − Xβ

0

) and thus

(5) Xβ d d = Xβ

₀

+ XM

H0

h

(M

H0

X

⁰

)

⁻_m(_Σ₎

i

⁰

(Y − Xβ

0

).

The symbol b means the estimator in the universal model (3) and bb means the estimator in the universal model (3) respecting the null hypothesis.

4. Testing linear hypotheses Here approach of χ

²

-tests based on residual sums of squares

R

²₀

=

Y − d Xβ

⁰

h Var

Y − d Xβ i

⁻

Y − d Xβ

,

(5)

R

²₁

=

Y − Xβ d d

⁰

Var

Y − Xβ d d

⁻

Y − Xβ d d

is used (cf. [3], p. 153-157, the first and the second theorems of the least squares theory).

Lemma 4.1. Let in the universal model (3) the null hypothesis be considered.

(i) Matrices Σ

⁻

, (Σ + XM

H0

X

⁰

)

⁻

and (Σ + XX

⁰

)

⁻

can be chosen as a g-inverses of the matrix Var(Y − d Xβ).

(ii) Matrices Σ

⁻

and (Σ + XM

H0

X

⁰

)

⁻

can be chosen as a g-inverses of the matrix Var(Y − Xβ). d d

P roof. Obviously covariance matrices of residual vectors are Var

Y − d Xβ

= Σ − X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Σ, Var

Y − Xβ d d

= Σ − XM

^H⁰

h

(M

H0

X

⁰

)

⁻_m(_Σ₎

i

⁰

Σ.

Let the matrix (Σ + XM

H⁰

X

⁰

)

⁻

be chosen. Then

Σ − X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Σ

(Σ + XM

H0

X

⁰

)

⁻

n

Σ − Σ(X

⁰

)

⁻_m(_Σ₎

X

⁰

o

=

I − X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Σ(Σ + XM

H0

X

⁰

)

⁻

Σ + XM

H0

X

⁰

− XM

^H⁰

X

⁰

× n

I − (X

⁰

)

⁻_m(_Σ₎

X

⁰

o

=

I − X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Σ n

I − (X

⁰

)

⁻_m(_Σ₎

X

⁰

o

= Var

Y − d Xβ .

The other statements can be proved in an analogous way.

(6)

Theorem 4.2. Let in the universal model (3) the null hypothesis be considered. Let M(H

⁰

) ∩ M(X

⁰

) = {0}. Then R

²1

− R

²0

= 0, i.e, the hypothesis cannot be tested by the help of the statistic R

₁²

− R

²0

.

P roof. If M(H

⁰

) ∩ M(X

⁰

) = {0}, then M(XM

^H⁰

) = M(X), since

rank(X) + rank(H) = rank X H

!

= rank(XM

H0

) + rank(H)

(cf. [4], p. 137). Thus Y − d Xβ = n

I − X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

o Y = n

I − X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

o

(Y − Xβ

0

)

= n

I − XM

^H⁰

h

(M

H0

X

⁰

)

⁻_m(_Σ₎

i

⁰

o

(Y − Xβ

0

) = Y − Xβ d d

and with respect to Lemma 4.1 we obtain R

²₁

− R

²0

= 0.

The last theorem implies that those rows of the matrix H, which cannot be obtained from rows of the matrix X by a linear combination, cannot be used in the hypothesis. Therefore in the following text M(H

⁰

) ⊂ M(X

⁰

) is assumed. Moreover, the assumption M(H

⁰

) ⊂ M(X

⁰

) implies that the vector function Hβ is unbiasedly estimable in the universal model (3) as

Hβ d = H h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Y = HD

⁻

X

⁰

T

⁻

Y ,

Var Hβ d

= HD

⁻

X

⁰

T

⁻

ΣT

⁻

XD

⁻

H

⁰

= H(D

⁻

− I)H

⁰

.

Before a theorem on a test of a linear hypothesis some auxiliary statements must be proved.

Lemma 4.3. Let in the universal model (3) the null hypothesis be considered.

(i) As a minimum Σ-seminorm g-inverse (X

⁰

)

⁻_m(_Σ₎

of the matrix X

⁰

also matrices (X

⁰

)

⁻_m(_Σ_+XM

H0X0)

and (X

⁰

)

⁻_m(_Σ_+XX₀₎

can be chosen.

(7)

(ii) As a minimum Σ-seminorm g-inverse (M

H0

X

⁰

)

⁻_m(_Σ₎

of the matrix M

H0

X

⁰

also the matrix (M

H0

X

⁰

)

⁻_m(_Σ_+XM

H0X0)

can be chosen.

P roof. (i) Both matrices (X

⁰

)

⁻_m(_Σ_+XM

H0X0)

and (X

⁰

)

⁻_m(_Σ_+XX₀₎

are g-inverses of the matrix X

⁰

. Thus it suffices to prove the symmetry of matrices

Σ(X

⁰

)

⁻_m(_Σ_+XM

H0X0)

X

⁰

and Σ(X

⁰

)

⁻_m(_Σ_+XX₀₎

X

⁰

. The matrix

(Σ + XM

H0

X

⁰

)(X

⁰

)

⁻_m(_Σ_+XM

H0X0)

X

⁰

is symmetric with respect to definition of the matrix (X

⁰

)

⁻_m(_Σ_+XM

H0X0)

. Since

(Σ + XM

H0

X

⁰

)(X

⁰

)

⁻_m(_Σ_+XM

H0X0)

X

⁰

= Σ(X

⁰

)

⁻_m(_Σ_+XM

H0X0)

X

⁰

+ XM

H0

X

⁰

, the matrix Σ(X

⁰

)

⁻_m(_Σ_+XM

H0X0)

X

⁰

is symmetric too.

Analogously for the matrix Σ(X

⁰

)

⁻_m(_Σ_+XX₀₎

X

⁰

. (ii) It can be proved in the same way as (i).

Lemma 4.4. Let in the universal model (3) the null hypothesis be under consideration. Let M(X) ⊂ M(Σ + XM

^H⁰

X

⁰

) and M(H

⁰

) ⊂ M(X

⁰

).

Then one choice of the g-inverse of the matrix Var( d Hβ + h) is n

H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

.

P roof. The covariance matrix of the random vector d Hβ + h is Var

Hβ d

= H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

Σ

× (Σ + XM

^H⁰

X

⁰

)

⁻

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

,

(8)

since the assumption M(X) ⊂ M(Σ + XM

^H⁰

X

⁰

) implies that one version of the minimum Σ-seminorm g-inverse of the matrix X

⁰

is

X

⁰

⁻

m(Σ)

= (Σ + XM

H0

X

⁰

)

⁻

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

.

The last term of the expression for Var( d Hβ) is the matrix H

⁰

and thus it is sufficient to prove the equality

H

⁰

n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

Var Hβ d

= H

⁰

. It holds that

H

⁰

n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

Var Hβ d

= H

⁰

n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

× X

⁰

(Σ + XM

H0

X

⁰

)

⁻

Σ + XM

H0

X

⁰

− XM

^H⁰

X

⁰

× (Σ + XM

^H⁰

X

⁰

)

⁻

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

= H

⁰

n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

×

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

= H

⁰

.

The following theorem deals with a test of a linear hypothesis under the special condition M(X) ⊂ M(Σ + XM

^H⁰

X

⁰

).

Theorem 4.5. Let in the universal model (3) the null hypothesis be

considered. If M(H

⁰

) ⊂ M(X

⁰

) and M(X) ⊂ M(Σ + XM

^H⁰

X

⁰

), the test

statistic is

(9)

R

²₁

− R

0²

=

Hβ d + h

⁰

n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

Hβ d + h . The statistic R

²₁

− R

²0

has the central chi-squared distribution when the null hypothesis is true and the noncentral chi-squared distribution when the null hypothesis is not true; the parameter of noncentrality is

δ = (Hβ

^∗

+ h)

⁰

H

h

X

⁰

Σ + XM

H0

X

⁰

⁻

X

i

−

H

⁰

⁻

(Hβ

^∗

+ h)

where β

^∗

is an actual value of the parameter β. Degrees of freedom are f = rank(H).

P roof. According to Lemma 4.3 the matrix (M

H0

X

⁰

)

⁻_m(_Σ_+XM

H0X0)

is one choice of the minimum Σ-seminorm g-inverse of the matrix M

H0

X

⁰

. Thus using relations

h

(M

H0

X

⁰

)

⁻_m(_Σ_+XM

H0X0)

i

⁰

= h

M

H0

X

⁰

Σ + XM

H0

X

⁰

⁻

XM

H0

i

−

M

H0

X

⁰

Σ + XM

H0

X

⁰

⁻

, h

M

H0

X

⁰

Σ + XM

H0

X

⁰

⁻

XM

H0

i

⁻

= h

X

⁰

Σ + XM

H0

X

⁰

⁻

X i

⁻

− h

X

⁰

Σ + XM

H0

X

⁰

⁻

X i

⁻

H

⁰

×

H h

X

⁰

Σ + XM

H0

X

⁰

⁻

X i

⁻

H

⁰

⁻

H h

X

⁰

Σ + XM

H0

X

⁰

⁻

X i

⁻

,

(X

⁰

)

⁻_m(_Σ₎

= (X

⁰

)

⁻_m(_Σ_+XM

H0X0)

(10)

and

M(H

⁰

) ⊂ M(X

⁰

) ⇒ H h

(X

⁰

)

⁻_m(_Σ_+XM

H0X0)

i

⁰

Xβ

₀

= Hβ

₀

= −h the estimator d Xβ d given by (5) can be rewritten as

d d

Xβ = d Xβ + k, where

k = −X

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

n

H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

⁻

×

Hβ d + h . Then

R

²₁

− R

²0

=

Y − Xβ d d

⁰

Σ + XM

H0

X

⁰

⁻

Y − d d Xβ

−

Y − d Xβ

⁰

Σ + XM

H⁰

X

⁰

⁻

Y − d Xβ

=

Y − d Xβ − k

⁰

Σ + XM

H0

X

⁰

⁻

Y − d Xβ − k

−

Y − d Xβ

⁰

Σ + XM

H0

X

⁰

⁻

Y − d Xβ

= −2k

⁰

Σ + XM

H0

X

⁰

⁻

Y − d Xβ

+ k

⁰

Σ + XM

H0

X

⁰

⁻

k.

It is easy to show that

k

⁰

(Σ + XM

H0

X

⁰

)

⁻

Y − d Xβ

= 0 and

k

⁰

(Σ + XM

H0

X

⁰

)

⁻

k

=

Hβ d + h

⁰

n

H[X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X]

⁻

H

⁰

o

⁻

Hβ d + h

.

(11)

Further f = rank h

Var Hβ d i

= rank n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

× (Σ + XM

^H⁰

X

⁰

− XM

^H⁰

X

⁰

)(Σ + XM

H0

X

⁰

)

⁻

× X

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

o

= rank n H

X

⁰

(Σ + XM

H0

X

⁰

)

⁻

X

⁻

H

⁰

− HM

^H⁰

H

⁰

o

= rank(H).

The rest of the proof is obvious.

The condition M(X) ⊂ M(Σ + XM

^H⁰

X

⁰

), which enable us to utter the statement on testing linear hypotheses in the classical form, is an obstacle in a general solution of the problem. Therefore we shall investigate a situation in which we assume M(H

⁰

) ⊂ M(X

⁰

), however M(X) ⊂ M(Σ+XM

^H⁰

X

⁰

), is not assumed.

Theorem 4.6. Let in the universal model (3) the null hypothesis be considered. Let M(H

⁰

) ⊂ M(X

⁰

). Then the BLUE of the vector

_X

H

β is

\ \ X H

!

β = Xβ d Hβ d

!

− X

H

!

(D

⁻

− I)H

⁰

H(D

⁻

− I)H

⁰

⁻

Hβ d + h .

The expression \ \

_X

H

β is invariant with respect to the choice of the g-inverse.

P roof. The universal model (3) with the null hypothesis can be written in the form

Y

−h

!

∼ N

n+q

"

X H

!

β, Σ, 0 0, 0

!#

.

(12)

Then the sought estimator is

\ \ X H

!

β = X H

! h

(X

⁰

, H

⁰

)

⁻

m

Σ, 0 0, 0

i

⁰

Y

−h

! .

Further h

(X

⁰

, H

⁰

)

⁻

m

Σ, 0 0, 0

i

⁰

=

"

(X

⁰

, H

⁰

) T, XH

⁰

HX

⁰

, HH

⁰

!

⁻

X H

!#

⁻

(X

⁰

, H

⁰

) T, XH

⁰

HX

⁰

, HH

⁰

!

⁻

=

"

(X

⁰

, H

⁰

) Q, R S, U

! X H

!#

⁻

(X

⁰

, H

⁰

) Q, R S, U

! ,

where (cf. Rohde theorem, e.g., in [1], p. 446, Lemma 10.1.40) Q = T

⁻

+ T

⁻

XH

⁰

[H(I − D)H

⁰

]

⁻

HX

⁰

T

⁻

, R = −T

⁻

XH

⁰

[H(I − D)H

⁰

]

⁻

,

S = −[H(I − D)H

⁰

]

⁻

HX

⁰

T

⁻

, U = [H(I − D)H

⁰

]

⁻

.

Then the expression

(X

⁰

, H

⁰

) Q, R S, U

!

can be rewritten as

X

⁰

T

⁻

− (I − D)H

⁰

[H(I − D)H

⁰

]

⁻

HX

⁰

T

⁻

; (I − D)H

⁰

[H(I − D)H

⁰

]

⁻

(13)

and

"

(X

⁰

, H

⁰

) Q, R S, U

⁰

! X H

!#

⁻

= n

D + (I − D)H

⁰

[H(I − D)H

⁰

]

⁻

H (I − D) o

⁻

. Further, with respect to formula (2) we have

n

D + (I − D)H

⁰

[H(I − D)H

⁰

]

⁻

H (I − D) o

⁻

= D

⁻

−D

⁻

(I−D)H

⁰

h

H (I−D)H

⁰

+H(I−D)D

⁻

(I−D)H

⁰

i

⁻

H (I−D)D

⁻

and using DD

⁻

H

⁰

= H

⁰

we obtain

H (I − D)H

⁰

+ H(I − D)D

⁻

(I − D)H

⁰

= HH

⁰

+ HD

⁻

H

⁰

− 2HDD

⁻

H

⁰

= H(D

⁻

− I)H

⁰

. Now the expression for the BLUE of the vector

_X

H

β is obvious.

The statement on an arbitrary choice of a g-inverse is a consequence of the relationship

P

( Y

−b

!

∈ M

"

Σ, 0 0, 0

!

+ X

H

!

(X

⁰

, H

⁰

)

#)

= 1.

Theorem 4.7. Let in the universal model (3) the null hypothesis be considered. If M(H

⁰

) ⊂ M(X

⁰

), then the test statistic is

R

₁²

− R

²0

=

Hβ d + h

⁰

H(D

⁻

− I)H

⁰

⁻

Hβ d + h ,

where H(D

⁻

− I)H

⁰

= Var(d Hβ + h).

(14)

The statistic R

²₁

− R

²0

has the central chi-squared distribution when the null hypothesis is true and the noncentral chi-squared distribution when the null hypothesis is not true; the parameter of noncentrality is

δ = (Hβ

^∗

+ h)

⁰

H(D

⁻

− I)H

⁰

⁻

(Hβ

^∗

+ h) ,

where β

^∗

is an actual value of the parameter β. The degrees of freedom are rank

H(D

⁻

− I)H

⁰

.

P roof. With respect to Lemma 4.1 as a g-inverse of both matrices Var

Y − Xβ d d

and Var

Y − d Xβ

the matrix W

⁻

= (Σ + XM

H0

X

⁰

)

⁻

can be chosen and at the same time the quadratic forms

Y − Xβ d d

⁰

Var

Y − Xβ d d

⁻

Y − Xβ d d

and

Y − d Xβ

⁰

h Var

Y − d Xβ i

⁻

Y − d Xβ are invariant with respect to the choice of the g-inverse. Let

d d

Xβ = d Xβ − Xa, where

a = (D

⁻

− I)H

⁰

H(D

⁻

− I)H

⁰

⁻

Hβ d + h . Then

Y − Xβ d d

⁰

W

⁻

Y − Xβ d d

=

Y − d Xβ

⁰

W

⁻

Y − d Xβ

− 2

Y − d Xβ

⁰

W

⁻

Xa + a

⁰

X

⁰

W

⁻

Xa.

(15)

In the following consideration the matrix W

⁺

will be used instead of W

⁻

. Thus we can proceed in simpler way.

At first the validity of the equality (Y − d Xβ)

⁰

W

⁺

Xa = 0 will be proved.

Using the relationship (2), the identity

(Σ + XM

H0

X

⁰

)

⁺

= (T − XP

^H⁰

X

⁰

)

⁺

can be rewritten as

T − XP

^H⁰

X

⁰

+

= T

⁺

+ T

⁺

XP

H0

I − P

^H⁰

X

⁰

T

⁺

XP

H0

+

P

H0

X

⁰

T

⁺

. Then we obtain

Y − d Xβ

⁰

W

⁺

Xa = Y

⁰

I − T

⁺

XD

⁻

X

⁰

× h

T

⁺

+ T

⁺

XP

H0

I − P

^H⁰

X

⁰

T

⁺

XP

H0

+

P

H0

X

⁰

T

⁺

i

Xa = 0, since

I − T

⁺

XD

⁻

X

⁰

T

⁺

X = 0.

Further a

⁰

X

⁰

W

⁺

Xa

=

Hβ d + h

⁰

H(D

⁻

− I)H

⁰

⁻

H(D

⁻

− I)X

⁰

Σ + XM

H0

X

⁰

+

× X(D

⁻

− I)H

⁰

H(D

⁻

− I)H

⁰

⁻

Hβ d + h

=

Hβ d + h

⁰

H(D

⁺

− P

^H⁰

)H

⁰

+

H(D

⁺

− P

^H⁰

)X

⁰

Σ + XM

H0

X

⁰

+

× X(D

⁺

− P

^H⁰

)H

⁰

H(D

⁺

− P

^H⁰

)H

⁰

+

Hβ d + h

=

Hβ d + h

⁰

H(D

⁺

− I)H

⁰

+

Hβ d + h

=

Hβ d + h

⁰

H(D

⁻

− I)H

⁰

⁻

Hβ d + h

,

(16)

since

X

⁰

Σ + XM

H0

X

⁰

+

X

= X

⁰

T

⁺

+ T

⁺

XP

H0

(I − P

^H⁰

DP

H0

)

⁺

P

H0

X

⁰

T

⁺

X

= D + DP

H0

(I − P

^H⁰

DP

H0

)

⁺

P

H0

D = (D

⁺

− P

^H⁰

)

⁺

. Finally

Var

Hβ d + h

= HD

⁻

X

⁰

T

⁻

ΣT

⁻

XD

⁻

H

⁰

= H(D

⁻

− I)H

⁰

. The other statements are obvious.

Theorem 4.8. Let in the universal model (3) the null hypothesis be considered. If M(H

⁰

) ⊂ M(X

⁰

M

Σ

), then the hypothesis need not be tested since in this case P{d Hβ + h = 0} = 1.

P roof. This statement is implied by the fact that Var( d Hβ) = 0. It is a consequence of the relationship

M(H

⁰

) ⊂ M(X

⁰

M

Σ

) ⇒ ∃E : H

⁰

= X

⁰

M

Σ

E, which implies

Var Hβ d

= Var

E

⁰

M

Σ

X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Y

= E

⁰

M

Σ

X h

(X

⁰

)

⁻_m(_Σ₎

i

⁰

Σ(X

⁰

)

⁻_m(_Σ₎

X

⁰

M

Σ

E

= E

⁰

M

Σ

X h

(X

⁰

)

⁻_m(_Σ₎

i

0

ΣM

Σ

E = 0.

If the null hypothesis is not true, the test statistic R

²₁

−R

²0

has the noncentral chi-squared distribution with f degrees of freedom and the parameter of noncentrality is

δ = (Hβ

^∗

+ h)

⁰

h Var

Hβ d + h i

⁻

(Hβ

^∗

+ h) ,

(17)

where β

^∗

is a true value of the parameter β. The power of the test at the point ξ is

p (ξ) = P

R

²₁

− R

²0

≥ χ

²f

(0, 1 − α)| H

a

: Hβ + h = ξ 6= 0 .

Here χ

²_f

(0, 1 − α) is (1 − α)-quantile of the central chi-squared distribution with f degrees of freedom.

The random variable R

²₁

− R

²0

∼ χ

²f

(δ) can be approximated by (cf. [2]) χ

²_f

(δ) ≈ c

²

χ

²_g

(0),

where

c

²

= f + 2δ

f + δ , g = (f + δ)

²

f + 2δ .

Remark 4.9. It is to be pointed out that in practical computing it is nec- essary to be very careful since in some situations some derived formulae can be numerically unstable. For example, small numbers on the main diagonal of the matrix Σ can caused the covariance matrix H(D

⁻

− I)H

⁰

numerically unstable. In practice it is useful to compute with both expressions

Var

Hβ d + h

= H(D

⁻

− I)H

⁰

,

Var

Hβ d + h

= HD

⁻

X

⁰

T

⁻

ΣT

⁻

XD

⁻

H

⁰

and to compare obtained results. If they are different, one can use for example the substitution Σ → kΣ, X → √

kX, where k > 0 is a sufficiently large number, and to compute them once more. This substitution does not influence the result of the original covariance matrix.

Another problem can occur when degrees of freedom are computed.

Here, e.g., expressions rank[(Σ + XX

⁰

)

⁻

X] or rank[(Σ + XM

H0

X

⁰

)

⁻

X] can be numerically unstable.

5. Example

Example 5.1. Let a linear part of high-speed lane be under consideration.

One of the safety conditions is that rails are in the line. For the sake of

simplicity let the problem be studied in plane only. An experiment for

the verification that rails are in the straight line can be done, e.g.,

(18)

in the following way. Firstly, points X

i

, i = 1, . . . , 4, are chosen elsewhere on rails. Then other points Z

₁

, Z

₂

, Z

₃

are chosen around rails such that all distances Z

i

X

_j

and Z

i

Z

_k

, i, k = 1, 2, 3, i 6= k, j = 1, . . . , 4, can be observed. Finally points X

i

, Z

j

are put into proper coordinates system (the map), see Figure 1. Let each distance be measured just once. Let distances Z

_i

X

_j

, i, j = 1, 2, and Z

1

Z

₂

have been measured in previous experiment by Väisälä interferometer, i.e., the accuracy of measurement is practically σ

₁

= 0. (cf. [5], p. 50). Let other distances be measured by optical range- finder with the accuracy σ = 0.01 m. The problem is to test a hypothesis that all points X

i

, i = 1, . . . , 4, are located on a straight line.

0 200 400 600 800 1000

300 400 500 600 700 800 900 1000 1100 1200

x (m)

y (m)

Z1

Z₂ Z₃

X1

X2

X3

X4

y1 y₂ y₃ y4

y5 y₆ y7

y8 y9

y10 y₁₁ y12

y13 y14

y15

Figure 1. The design of the experiment

(19)

Let the notation

• y = (y

1

, . . . , y

₁₅

)

⁰

. . . a vector of observed distances Z

i

X

j

, Z

i

Z

_k

, i, k = 1, 2, 3, i 6= k, j = 1, . . . , 4,

• β = (β

1

, . . . , β

₁₄

)

⁰

. . . a vector of unknown coordinates of points X

_i

= [β

_2i−1

, β

_2i

]

⁰

, i = 1, . . . , 4, and Z

_i

= [β

_2i−1

, β

_2i

]

⁰

, i = 5, 6, 7, be used. The mentioned process of measurement can be modelled by

Y ∼ N

15

(f (β), Σ) , where, e.g.,

f

₁

(β) = p

(β

9

− β

¹

)

²

+ (β

10

− β

²

)

²

and Σ is a diagonal matrix given by

Σ = Diag

 

 0, 0, σ

²

, σ

²

, 0, 0, σ

²

, . . . , σ

²

| {z }

6×

, 0, σ

²

, σ

²

 

 . The linear version of the model can be written in the form

Y − f β

⁽⁰⁾

∼ N

15

(X∆β, Σ) , ∆β = β − β

⁽⁰⁾

, where β

⁽⁰⁾

are approximate values of the vector β and

X = ∂f(u)

∂u

⁰

u=β⁽⁰⁾

.

Let straight line p

_i

intersects points X

_i

and X

_i+1

, i = 1, 2, 3. Straight lines p

_i

can be expressed as

p

_i

: y = a

i

+ b

i

x, i = 1, 2, 3, where

a

_i

= β

_2i

− β

2i−1

β

_2i

− β

2i+2

β

_2i−1

− β

²ⁱ⁺¹

, b

_i

= β

_2i

− β

²ⁱ⁺²

β

_2i−1

− β

2i+1

.

(20)

The problem is to test the null hypothesis H

₀

: p

₁

= p

₂

= p

₃

against the alternative hypothesis

H

a

: ∃i 6= j : p

ⁱ

6= p

^j

, i, j ∈ {1, 2, 3}.

Straight lines p

₁

and p

₂

are identical if and only if a

₁

= a

₂

and b

₁

= b

₂

, i.e., b

₁

= b

₂

= b ⇔ β

₂

− β

⁴

β

₁

− β

3

= β

₄

− β

⁶

β

₃

− β

5

= b and

a

₁

= a

2

⇔ β

₂

− β

¹

b = β

4

− β

³

b ⇔ β

₂

− β

4

β

₁

− β

3

= b.

Analogously for p

2

, p

₃

and p

1

, p

₃

. Thus

p

₁

= p

₂

= p

₃

⇔ g(β) = 0, where

g

_i

(β) = (β

_2i

− β

2i+2

)(β

_2i+1

− β

2i+3

) − (β

2i−1

− β

2i+1

)(β

_2i+2

− β

2i+4

), i = 1, 2, and

g

₃

(β) = (β

2

− β

⁴

)(β

5

− β

⁷

) − (β

¹

− β

³

)(β

6

− β

⁸

).

Linear version of the null hypothesis can be written as H

₀

: H∆β = 0,

where

H = ∂g(u)

∂u

⁰

u=β⁽⁰⁾

and the alternative hypothesis as

H

_a

: H∆β = ξ 6= 0.

(21)

Let approximate values β

⁽⁰⁾

have been chosen as (in meters):

Z

₁⁽⁰⁾

= 100 400

!

, Z

₂⁽⁰⁾

= 800 650

!

, Z

₃⁽⁰⁾

= 250 1100

! ,

X

₁⁽⁰⁾

= 200 660

!

, X

₂⁽⁰⁾

= 320 696

!

, X

₃⁽⁰⁾

= 400 720

!

, X

₄⁽⁰⁾

= 510 753

! .

In this case, terms in linearized model are the following one:

f β⁽⁰⁾

= [ 278.56777, 368.80347, 438.63424, 541.02588, 600.08333, 482.19913, 406.07881, 307.74827, 442.83180, 410.01951, 408.53396, 433.60005, 743.30344, 715.89105, 710.63352 ]⁰,

the design matrix X = (X

₁

, X

₂

), where

X

₁

=



 

 

5.99149, 15.57786, 0, 0, 0, 0, 0

0, 0, 11.45579, 15.41325, 0, 0, 0

0, 0, 0, 0, 14.32419, 15.27913, 0

0, 0, 0, 0, 0, 0, 17.62686

−24.49320, 0.40822, 0, 0, 0, 0, 0

0, 0,−21.85889, 2.09481, 0, 0, 0

0, 0, 0, 0,−19.84974, 3.47370, 0

0, 0, 0, 0, 0, 0,−16.53104

−2.37602,−20.90900, 0, 0, 0, 0, 0

0, 0, 3.45697,−19.95166, 0, 0, 0

0, 0, 0, 0, 7.42125,−18.80050, 0

0, 0, 0, 0, 0, 0, 12.48615

0, 0, 0, 0, 0, 0, 0



 

 

,

(22)

X

₂

=



 

 

0, −5.99149, −15.57786, 0, 0, 0, 0

0, −11.45579, −15.41325, 0, 0, 0, 0

0, −14.32419, −15.27913, 0, 0, 0, 0

15.17629, −17.62686, −15.17629, 0, 0, 0, 0

0, 0, 0, 24.49320, −0.40822, 0, 0

0, 0, 0, 21.85889, −2.09481, 0, 0

0, 0, 0, 19.84974, −3.47370, 0, 0

5.87137, 0, 0, 16.53104, −5.87137, 0, 0

0, 0, 0, 0, 0, 2.37602, 20.90900

0, 0, 0, 0, 0, −3.45697, 19.95166

0, 0, 0, 0, 0, −7.42125, 18.80050

−16.66421, 0, 0, 0, 0, −12.48615, 16.66421

0, −25.67527, −9.16974, 25.67527, 9.16974, 0, 0 0, −5.60619, −26.16222, 0, 0, 5.60619, 26.16222 0, 0, 0, 20.63193, −16.88067, −20.63193, 16.88067



 

 

and the matrix H = (H

₁

, 0

_3×6

), where

H

₁

=



 

24, −80, −60, 200, 36, −120, 0, 0

0, 0, 33, −110, −57, 190, 24, −80 33, −110, −33, 110, −36, 120, 36, −120



  .

Let the simulated data of observed distances be (in meters):

y= [ 278.56778, 368.80346, 438.63423, 541.02588, 600.07120, 482.18593, 406.08812, 307.74839, 442.82535, 410.02757, 408.53628, 433.59015, 743.31683, 715.89395, 710.64831 ]⁰.

TESTING HYPOTHESES IN UNIVERSAL MODELS