NIEPARAMETRYCZNE METODY IDENTYFIKACJI MOMENTU ZMIANY ROZKŁADU

Niech dany będzie model

t =

1

, • • • f ni, t = m+

1

, . . ., n xt

-F

1

(x), F

2

(^x ),

o nieznanych dystrybuantach typu ciągłego F^(x) i F

2

(x). Często czynione są ograniczające założenia, że zmiana rozkładu polega na zmianie wartości parametru położenia (Bhattacharya i Johnson (1968), Schechtman (1982)) albo skali (Talwar i Gentle (1981), Ali i Gia- cotto (1982)), tj. przyjmuje się, że F

2

(x) = F^(x-&) bądź F

2

(x) =

= F1(t x).

4.1. Nieparametryczne testy do weryfikacji stabilności rozkładu

Rozpatrujemy testy do weryfikacji stabilności rozkładu

n", H

0

• nx-(- ^ F/| (x), t =

1

9 •^{• •}⁹

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU W CI£GU ZMIENNYCH... 77 przeciwko alternatywie zmiany rozkładu

1

: it3m« (1 , n-

1

} takie, że x^

1

(x), F2 (x ),

t < m,

t > m."

Zależnie od tego, czy zakładamy F^(x)> F

2

(x) (x ^ jest stochas-tycznie większe niż xm ), F

1

(x) < F

2

(x) lub F

1

(x) ^ F

2

(x) wyróżnia-my alternatywy jednostronne i dwustronne. W przypadku zmiany para-metru położenia bądź skali zakłada się odpowiednio

0

, © <

0

, 0 / 0 bądź £ <1, f > 1, tr ^ 1 . Przyjmijmy najpierw alternatywy F1(x) > F2(^x).

Przy założeniu znanego, symetrycznego względem 0 rozkładu F^(x) 1 danego rozkładu a priori p^m), m =

1

, ..., n-

1

, momentu zmiany Bhattacharya i Johnson (1968) proponują odrzucać hipotezę o stabil-ności rozkładu, gdy

(4.1) 2 Qtsgn(xt ) Ef-f^v^ t^)/f1(v^ t^)] > c,x, t

=1

przy czym ftj (x) jest gęstością rozkładu F^x), v

^1

^ < ...

••• < - uporządkowaną próbą losową o rozkładzie z gęstością (x) dla x >

0

, r^ - rangą | x^_ j w uporządkowanej próbie

x-i| »••*» |xn | i

2

Dla rozkładów podwójnie wykładni-m

=1

czego, logistycznego i normalnego Bhattacharya i Johnson podają jawną postać statystyki testowej (4.1).

Przy ograniczeniu, że zmiana polega na przesunięciu rozkładu, tj. zachodzi F

2

(x) = F

1

(x- G ), 0>0, Bhattacharya i Johnson propo-nują odrzucać hipotezę HQ , jeżeli

^ (r ) (r )

(4.2) 2 Qt E [-f^ v t )/f1(v t )] > ccc*

=1

gdzie f

1

(x) jest gęstością F^x), rt - rangą xt , v v < v v < ...

... < v^n ^ oznaczają uporządkowaną próbę losową o rozkładzie F^(x), a jest zdefiniowane jak wyżej.

Dla obu testów (4.1) i (4.2) zostały podane warunki ich lokal-nej optymalności oraz asymptotyczlokal-nej normalności statystyk testo-wych. Wartości krytyczne c^ mogą być określone jako kwantyle rzę-du ^ck asymptotycznego rozkładu normalnego.

Punktem wyjścia konstrukcji kolejnych nieparametrycznych testów są testy Manna-Whitneya i Kołmogorowa-Smirnowa do testowania zgod-ności rozkładów dwóch prób losowych x^, ..., xm i x >|, ...» xn « Pettitt (1979) i Schechtman (1982) startują od statystyki Manna- -Whitneya.

Pettitt (1979) wykorzystuje statystykę Manna-Whitneya

m n

(

4

-3) um =

2

S sgn(x. - x.) i

=1

j=m

+1 i

i odrzuca hipotezę o stabilności rozkładu na korzyść rozpatrywanych jednostronnych lub dwustronnych alternatyw, jeżeli

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU W CI£GU ZMIENNYCH... 79

K = n max

1

^m^n

-1

|Um > c* dla

H1

: F

1

(x) F

2

(x) (4.4) n max Um > < ^dla

H1

: F

1

(x) < F

2

(x)

1

$m^n

-1

■ - min \<mx<n

-1

^{Um > C}

0

C dla Hr F

1

(x) > F£ (^x)

Ponieważ rozkłady statystyk K^, K*, K~ są nieznane, jako wartości krytyczne wybieramy kwantyle rzędu oc ich rozkładów asymptotycz-nych opisaasymptotycz-nych przez poniższe zależności:

(4.5) lim P (n"1(3/(n+1))l/2K ^ a) =

n - > o o

H0

1 + 2

1

)rexp(-

2

a2 ), r

=1

lim P„ (n“

1

(3/(n+1))1/2IĆ ^ a) =

1

- exp(-

2

a2 ).

n-»oo

0

Prawa strona wyrażenia (4.5) przedstawia rozkład Kołmogorowa. K* i K^ mają takie same rozkłady.

Schechtman (1982) stosuje statystykę Manna-Whitneya

fSJVm /Um,n-m _ l w / _ n ±

1

_ j )

1/2

vm(n-m)

2

'' '

12

m(n-m' *

przy czym U m,n—in oznacza liczbę inwersji x,., ..., x względemi m x xm+

1

’ ***’ xn* Odrzuca hipotezę o stabilności rozkładu, jeżeli

V = max |V | > Cpf, przy : F

1

(x) f F

2

(x), n

1

x<mN<n

-1

V+ = max V > c* , przy H

1

: F

1

(x) < Fp (x).

1

x<m^n

-1

Dla n =

6

, 7,

8

, 9 i oc = 0.01, 0.05, 0.10 oblicza wartości kryty-czne jako kwantyle rzędu oc dokładnego rozkładu Vn i V* przy HQ . Dla n > 9 uzyskuje wartości krytyczne przez symulację.

Przy założeniu zmiany wartości parametru położenia Schechtman dokonuje porównania metodą Monte Carlo mocy testów bazujących na V i K oraz V+ i K+. Dochodzi do konkluzji, że należy przedkładać n n n tl

Kn bądź K*, gdy moment zmiany rozkładu spełnia m n

/2

oraz wybie-rać Vn bądź V*, gdy moment zmiany znajduje się blisko początku lub końca serii doświadczeń (m jest małe lub duże).

Talwar i Gentle (1981) rozpatrują przypadek zmiany parametru skali i stosują zmodyfikowaną statystykę Manna-Whitneya

m n

Um = Z Z sgn(|x. - x| - | - x|) i =

1

j=m

+1

z medianą x = m e d ^ , ...» x ). Jako statystyki testowe wykorzys-tują K , K* lub K“ , przy czym w (4.4) zamiast statystyk Um są wsta-wione zmodyfikowane statystyki U . Przeprowadzone przez nich bada-nia symulacyjne wykazują, że uzyskane w taki sposób testy są testa-mi konserwatywnytesta-mi odpornytesta-mi (na odchylenia od rozkładu normalnego).

Deshayes i Picard (1980) rozważają ogólny przypadek zmiany roz-kładu i wychodzą od statystyki Kołmogorowa-Smirnowa. Jeżeli rozkład

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU W ClfGU ZMIENNYCH... 81

F^ (x) jest znany, to HQ zostaje odrzucona, gdy

sup sup Vn (1 - ~) | F

2

⁽^x ⁾ - F.(x) | > .

1

v<m<:n

-1

x *

Tu F

2

m (x) jest dystrybuantą empiryczną określoną dla próby loso-wej x ^, ..., x^. Jeżeli F^(x) jest nieznane, to stosowana jest dystrybuanta empiryczna F^ m (x ) z próby x^, ..., xm » Hipoteza Hq

zostaje odrzucona, gdy

sup sup V5T

(1 1

$m^n

-1

nu rn i

n' n IF2,,4X > - F

1

,m(x) ^{I >} cioc.

Ponieważ rozkłady obydwu statystyk są nieznane, jako są wybie-rane kwantyle rzędu oc ich asymptotycznych rozkładów analizowanych przez Deshayesa i Picarda (1980).

Ali i Giacotto (1982) przedstawiają nieparametryczny test do weryfikowania stabilności rozkładu wobec alternatyw niestabilności w formie szczególnej postaci zmian parametrów położenia albo skali.

Bhattacharya i Frierson (1981) proponują sekwencyjny nieparame-tryczny test skumulowanych sum, który opiera się na tzw. sekwencyj-nych rangach. Badają oni asymptotyczne zachowanie tych skumulowasekwencyj-nych sum i pokazują, że zaproponowane przez nich testy asymptotycznie osiągają poziom istotności oc .

4.2. Nieparametryczne estymatory momentu zmiany rozkładu

Nieparametryczne estymatory dla momentu zmiany rozkładu były zapro-ponowane przez Pettitta'(1980b, 1981). Opierają się one na

statys-tykach Manna-Whitneya U (4.3) i ich uogólnieniach:

-1 2

= arg max [m(n-m)] U ,

1

$m<n

m = arg max [m(n-m)J

1

W^,

1

<m<n

W = 2 w ( r .),

nr m ^ 0

J =

1

' = arg max [m(n-m

)]"1

V^, =

2 2

(x^x.),

~ "1S m Sm - ^

m n

1

<m<n i

=1

j=m

+1

gdzie r. oznacza rangę x. w ciągu x^, . .., x a w( ) i

2 2

(f( ) są da nymi funkcjami. Dla w(r.) = 2r. - 1 - n i ę?(x) = sgn(x) zachodziJ ^U W = V = U . Pettitt wybiera m m m

(j) = rUj/cn+D)

oraz

<f(x)

“

1

' x, jeśli | x A " | ^

5

A sgn(x) w przeciwnym przypadku,

A = med |x,-x,

1

^t<n t t+

k 1 oznacza tu odwrotną funkcję do dystrybuanty standardowego roz-kładu normalnego.

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU W CI£GU ZMIENNYCH... 83

W celu porównania tych trzech estymatorów Pettitt przeprowadził badania metodą Monte Carlo dla klasy zaburzonych rozkładów normal-nych. Wówczas m okazał się najlepszym, a m^, - najgorszym rem. W przypadku rozkładów normalnych m^ jest najlepszym estymato-rem, ale różnica względem m jest nieznaczna. Przy rozkładach „o grubych ogonach" m jest jednakże lepszy niż mr . Rozkłady tych trzech estymatorów okazują się stosunkowo odpornymi przy szerokim wyborze rozkładów F^(x) i F£(x).

Wreszcie zwróćmy jeszcze uwagę na nieparametryczne estymatory bayesowskie, które Pettitt (1981) utworzył dla przypadku takich F^(x) i F£(x ), że pozwalają się poprzez monotoniczne przekształcę-nie sprowadzić do rozkładów normalnych N(0, 1) i N(0, 1)./

PRACE CYTOWANE

M.M. Ali, C. Giacotto (1982), The identical distribution hypothesis for market prices - location - and scale-shift alternatives, J. Amer. Statist. Assoc., 77, 19-28.

T.W. Anderson, D.A. Darling (1952), Asymptotic theory of certain

„Goodness-of-fit" criteria based on stochastic processes, Ann.

Math. Statist., 23, 193-212.

M. Bagshaw, R.A. Johnson (1975), The influence of reference values and estimated variance on the ARL of Cusum tests, J. Royal Sta-tist. Soc., Ser. B, 37, 413-420.

G.A. Barnard (1959), Control charts and stochastic processes, J. Royal Statist. Soc., Ser. B, 21, 239-271.

P.K. Bhattacharya, D. Frierson Jr. (1981), A nonparametric control chart for detecting small disorders, Ann. Statist., 9, 544-554.

GIK. Bhattacharya, R.A. Johnson (1968), Nonparametric tests for a shift at unknown timepoint, Ann. Math. Statist., 39, 1731-1743.

A.F. Bissel (1969), Cusum techniques for quality control (with dis-cussion), Appl. Statist., 18, 1-30.

L.N. Bol'Sev, N.V. Smirnov (1983), Tablicy matematiceskoj statis- tiki, Moskva.

D. Brook, D.A. Evans (1972), An approach to the probability distri-bution of Cusum run length, Biometrika, 59, 539-549.

R.L. Brown, J. Durbin, J.M. Evans (1975), Techniques for testing the constancy of regression relationships over time, J. Royal.

Statist. Soc., Ser. B, 37, 149-192.

H. Chernoff, S. Zacks (1964), Estimating the current mean of a nor-mal distribution which is subjected to changes in time, Ann.

Math. Statist., 35, 999-1018.

J. Deshayes, D. Picard (1979), Application aux tests de repture de regression, Asterisque, 68, 73-98.

J.. Deshayes, D. Picard (1980), Testing for a change in statistical models, Prepublication, University de Paris-Sud, Departm. d.

Mathematique, Bat 425, 91405 Orsay, France.

J. Diaz (1982), Bayesian detection of a change of scale parameter in a sequence of independent gamma random variables, J. Econo-metrics, 19, 23-29.

L.A. Gardner (1969), On detecting changes in the mean of normal va-riates, Ann. Math. Statist., 40, 116-126.

P.L. Goldsmith, H. Whitfield (1961), Average run lengths in cumula- tive charts quality control schemes, Technometrics, 3, 11-20.

I. Guttman, U. Menzefricke (1982), On the use of loss functions in the changepoint problem, Ann. Inst. Statist.- Math., 34, 319-326.

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU W CI£GU ZMIENNYCH... 85

P. Hackl (1980), Testing the constancy of regression models over time, Gttttingen: Vandenhoeck u. Ruprecht, Angew. Statist, u.

nOkonometrie, H. 16.

D.M. Hawkins (1977), Testing a sequence of observations for a shift in location, J. Amer. Statist. Assoc., 72, 180-186.

D.V. Hinkley (1970), Inference about the change-point in a sequence of random variables, Biometrika, 57, 1-17.

D.V. Hinkley (1971), Inference about the change-point from cumula-tive sum tests, Biometrika, 58, 509-523.

D.V. Hinkley (1972), Time ordered classification, Biometrika, 59, 509-523.

D.V. Hinkley,-,E.A. Hinkley (1970), Inference about the change-point in a sequence of binomial variables, Biometrica, 57, 477-4-88.

D. Holbert, L. Broemling (1977), Bayesian inferences related to shifting sequences and two-phase regression, Comm. Statist.- Theor. Meth., A6, 265-275.

D.A. Hsu (1979), Detecting shifts of parameter in gamma sequences with application to stock price and air traffic flow analysis, J. Amer. Statist. Assoc., 74, 31-40.

D.A. Hsu (1982), A Bayesian robust detection of shift in the risk structure of stock market returns, J. Amer. Statist. Assoc., 77, 29-39.

N.L. Johnson (1961), A simple theoretical approach to cumulative sum control charts, J. Amer. Statist. Assoc., 55, 835-840.

Z. Kander, S. Zacks (1966), Test procedures for possible changes in parameters of statistical distributions occuring at unknown time points, Ann. Math. Statist., 37, 1196-1210.

K.W. Kemp (1961), The average run length of the Cusum charts when

a V-mask is used, J. Royal Statist. Soc., Ser. B, 23, 149-153.

K.W. Kemp (1962), The use of cumulative sums for sampling inspec- tion schemes, Appl. Statist., 11, 16-31.

K.W. Kemp (1971), Formal expresions which can be applied to Cusum charts, J. Royal Statist. Soc., Ser. B, 33, 311-360.

r.^a. Khan (1981), A note on Page's two-sided cumulative sum proce- dure, Biometrika, 68, 717-719.

j.M. Lucas, R.B. Crosier (1982a), Fast initial response for Cusum quality-control schemes: Give your Cusum a head start, Technome- .tries, 24, 199-205.

J.M. Lucas, R.B. Crosier (1982b), Robust Cusum: A robustness study for Cusum quality control schemes, Comm. Statist.-Theor. Meth., A11, 2669-2687.

I. B. MacNeill (1974), Tests for change of parameter at unknown ti-mes and distributions of some related functionals on Brownian motion, Ann. Statist., 2, 950-962.

R. Maronna, V.J. Yohai (1978), A bivariate test for the detection of a systematic change in mean, J. Amer. Statist. Assoc., 73, 640-645.

U. Menzefricke (1981), A Bayesian analysis of a change in the pre- cision of a sequence of independent normal random variables at an unknown time point, J. Royal Statist. Soc., Ser. C, 30, 141-146.

C.K. Mustafi (1968), Inference problems about parameters which are subjected to changes over time, Ann. Math. Statist., 39, 840- - 854.

J. Nadler, N.B. Robbins (1971), Some characteristics of Page's two-sided procedure for detecting a change in a location para-meter, Ann. Math. Statist., 42, 538-551.

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU ¥ CI£GU ZMIENNYCH... 87

D. B. Owen (1973), Sbornik statisticeskich tablic, Moskva.

E. S. Page (1954), Continuous inspection schemes, Biometrika, 41, 100-114.

E.S. Page (1955), A test for a change in parameter occuring at an unknown point, Biometrika, 42, 523-526.

E.S. Page (1957), On problems in which a change of parameters oc-curs at an unknown point, Biometrika, 44, 248-252.

E.S. Page (1961), Cumulative sum charts, Technometrics, 3, 1-9.

E.S. Pearson, H.O. Hartley (1972), Biometrika Tables for Statisti-cians, Vol. 2, Cambridge University Press.

A.N. Pettitt (1979), A non-parametric approach to the change-point problem, J. Royal Statist. Soc., Ser. C, 28, 126-135.

A.N. Pettitt (1980a), A simple cumulative sum type statistic for”

the change-point problem with zero-one observations, Biometri-ka, 67, 79-84.

A.N. Pettitt (1980b), Some results on estimating a change-point using non-parametric type statistics, J. Statist. Comput. Si- mul., 11, 261-272.

A.N. Pettitt (1981), Posterior probabilities for a change-point using ranks, Biometrika, 68, 443-450.

J.B. Ramsey (1969), Tests for specification errors in classical li-near least-squared regression analysis, J. Royal Statist. Soc.,

Ser. B, 31, 350-371.

S.N. Roy (1953), On a heuristic method of test construction and its use in multivariate analysis, Ann. Math. Statist., 24, 220-238.

E. Schechtman (1982), A nonparametric test for detecting changes in location, Commun. Statist.-Theor. Meth., A11, 1475-1482.

U. Schulze (1982), Regressionsmodelle mit Zustandsflnderungen, Diss.

A., Forschungsber. Kybern./Math. d. AdW der DDR.

A.K. Sen, M.S. Srivastava (1973), On multivariate tests for detec-ting change in mean, Sankhya, A35, 173-186.

A. Sen, M.S. Srivastava (1975a), On tests for detecting change in mean, Ann. Statist., 3, 98-108.

A. Sen, M.S. Srivastava (1975b), On tests for detecting a change in mean when variance in unknown, Ann. Inst. Statist. Math., 27, 479-^86.

A. Sen, M.S. Srivastava (1975c), Some one-sided tests for change in level, Technometrics, 17, 61-64.

A.F.M. Smith (1975), A Bayesian approach to inference about a chan- ge-point in a sequence of random variables, Biometrika, 62, 407-416.

M.S. Srivastava (1980), On tests for detecting change in the multi- variate mean, In: NATO Conference, Triest, Italy, Ed. G. P.

Patii.

P.P. Talwar, J.E. Gentle (1981), Detecting a scale shift in a'ran- dom sequence at an unknown time point, Appl. Statist., 30, 301-304.

W. Thalheim (1977), Prtifung linearer Modelle, Diplomarbeit, Hum- boldt-Universitflt~ Berlin.

K.J. Worsley (1979), On the likelihood ratio test for a shifts in location of normal populations, J. Amer. Statist. Assoc., 74, 365-367.

K.J. Worsley (1983), The power of the likelihood ratio and cumula-tive sum tests for a change in a binomial probability, Biometri-ka, 70, 455-464.

S. Zacks (1980), Numerical determination of the distributions of stopping variables associated with sequential procedures for

IDENTYFIKACJA MOMENTU ZMIANY ROZKŁADU W CI£GU ZMIENNYCH... 89

detecting epochs of shift in distributions of discrete random variables, Commun. Statist.-Simul. Cornput., B9, 1-18.

S. Zacks (1981), The probability distribution and the expected va- lue of a stopping variable associated with one-sided Cusum pro-cedures for nonnegative valued random variables, Commun. Sta-tist. -Theor. Meth., A10, 2245-2258.

S. Zacks, Z. Barzily (1981), Bayes procedures for detecting a shift in the probability of success in a series of Bernoulli trials, J. Statist. Planning and Inference, 5, 107-119.

R. Zieliński (1972), Tablice statystyczne, Warszawa.

W dokumencie Jeśli jakość w procesie produkcyjnym jest stabilna, to xt mają identyczny rozkład o pewnej dystrybuancie Fx| (x) (Stron 44-57)