• Nie Znaleziono Wyników

On Some Estimator of Finite Population Skewness Under Nonresponse

N/A
N/A
Protected

Academic year: 2021

Share "On Some Estimator of Finite Population Skewness Under Nonresponse"

Copied!
8
0
0

Pełen tekst

(1)

A C T A U N I V E R S I T A T I S L O D Z I E N S I S FOLIA OECONOMICA 228, 2009___________

* W ojciech G am rot

ON SOME ESTIM ATOR OF FINITE POPULATION

SKEWNESS UNDER NONRESPONSE

A bstract. One o f the most popular measures for the assymetry o f distribution is the coefficient o f skewness computed by standarizing the third central moment about the mean. In this paper the well-known two-phase sampling procedure is applied to estimate the finite population skewness under nonresponse. An estimator o f this parameter is constructed as a function o f well-known unbiased estimators o f population totals. The properties o f proposed estimator are investigated by the simulation study. The data ob-tained from agricultural census in boroughs o f the Dąbrowa Tarnowska district is used in simulations.

Key words: finite population skewness, incomplete data, estimation.

Consider finite population U={mi,...,«//}• Let X be some population charac-teristic taking fixed values X\,...j c n. This paper focuses on the estimation o f population skewness coefficient given by formula:

and X = V ' 1 Z » * - In order to estimate this parameter a simple random sam-ple s o f size n is drawn without replacement from U. It is assumed, that nonre-sponse appears in the survey and as a result the sample splits into two subsets Si and s2 o f sizes П] and n2 such that units from Si respond while units from

I. INTRODUCTION

where:

* Ph.D., Department of Statistics, University o f Economics, Katowice.

(2)

S2 do not. Data incompleteness is treated as a random phenomenon described by means o f response distribution ^(äi|í) = = q fa & ls ) determining individual response probabilities p i<[S = \ s ) (see. Cassel et al. 1983). To compen-sate for nonrespondent underrepresentation another phase o f the survey is car-ried out with simple subsample s ’ o f size n ’=cn2 being drawn without replace-ment from 52- It is assumed that all subsampled units respond when re-contacted.

II. ESTIM ATIO N

Let us consider the population total o f the Л-th power o f X defined by the following expression:

ieU

Let us also consider the double-sample-based statistic:

fh -n

/ \

N

W €Jj ies' )

Särndal et al (1992) shows that it is unbiased for th irrespective o f the under-lying response distribution, and provides the formula for the variance o f th as well as its unbiased estimator. In particular for h = 0 we have th = N and t0 is an unbiased estimator o f n.

In order to construct the estimator o f skewness coefficient, let us express this parameter as the following function o f population totals:

( V 2 - 'ľ ) 3/2

Now let us replace unknown totals with their respective unbiased estimators computed from the two-phase sample. Hence, we obtain the following estimator o f Лу.

í2í t í , i í3

Я2F =■/0% - З у , / 2 + 2/г

;2чЗ/2

(3)

ON SOME ESTIMATOR OF FINITE POPULATION SKEWNESS. 37

A simulation study was carried out to assess the properties o f the proposed estimator and to compare it with the single-phase based estimator:

í _ tąttyn — "F 2í|*

UN~

(ŕ0J2. - t f )3/2

which is constructed by replacing each unknown total with an uncorrected esti-mator:

The results o f the study are presented in the following paragraph.

111. SIM U LA TIO N RESULTS

During simulations, the data describing 2420 households acquired during the 1996’ Polish agricultural census in three boroughs o f the Dąbrowa Tar-nowska district represented the population under study. The farm area was used as the study variable. The nonresponse was assumed to follow logistic model with units responding independently and individual response probabilities re-spectively equal to p,\s = pi= (1 + exp(ßo + ßixi)) for ie U, with and ß\ being arbitrarily chosen constants. Several sample - subsample pairs were repeatedly drawn from the population using the two-phase sampling design involving sim-ple random sampling without replacement in both phases. The subsamsim-ple size was always equal to 30% o f the nonrespondent subset size. The empirical distri-bution o f 106 estimates served as a basis for assessing estimator properties. The first experiment was carried out for ß , = 0 and Д = 0 (this represents response probabilities unrelated to the characteristic under study). The relative efficiency o f estimators (ratio o f their M SE’s), their bias and share o f the bias in the mean square error are shown on pic. 1—3. The recorded relative efficiency is below unity which means that the estimator is more accurate than

XUN,

but the gain in accuracy is modest. The improvement is most substantial for n > 200. Biases o f both estimators are negative and diminish when n grows, with the bias o f estimator ä2f being somehow smaller. The share o f bias in MSE also quickly diminishes for both estimators.

(4)

Pic. 3. Share o f bias in MSE for ß0= О, Д = 0 <2 0,93

Pic. 1. Relative efficiency for ß 0 = 0, /?, = 0

Pic. 2. Bias for До = 0, Д = 0 .2 -0,3

(5)

ON SOME ESTIMATOR OF FINITE POPULATION SKEWNESS. 39

The second experiment was carried out for Д>= 0 and ß \ - -1 , which means that response probabilities increased with growing values o f the characteristic under study. The relative efficiency o f estimators, their bias and share o f the bias in the mean square error are shown on pic. 4-6. Now relative efficiency takes values below 0.5 which indicates substantial gains in precision resulting from the use o f second phase data, and it decreases when n grows. Also the bias o f the two-phase-based estimator is significantly lower than the dramatically high bias o f the uncorrected one. The shares o f bias are at extremes - respectively below 20% and above 60% which strongly favors the proposed estimator.

(6)

The third experiment was carried out for ßo = 0 and ß\ = 1, which corre-sponds to the situation where response probabilities decrease when values o f the characteristic under study grow. The relative efficiency o f estimators, their bias and share o f the bias in the mean square error are shown on pic. 7-9. The results are both surprising and disappointing: the relative efficiency exceeds unity and grows with n. So the use o f second phase data actually distorts the estimates instead o f improving them. The comparison o f biases shows, that the effect o f bias reduction is obtained only for quite large samples (n > 160) although for large samples the reduction is significant.

(7)

ON SOME ESTIMATOR OF FINITE POPULATION SKEWNESS. 41

Pic. 9. Share o f bias in MSE for/70= 0, ß\ = 1

IV. CONCLUSIONS

Presented simulation results indicate, that the two-phase sampling procedure has the potential to improve properties o f estimates for the finite population skewness under nonresponse. It is particularly useful to reduce the nonresponse bias. However, special care should be taken in the situation when individual response probabilities are negatively correlated with the characteristic under study. Analytical studies are necessary to determine conditions where the two- phase sampling is preferable to other estimation methods..

(8)

REFERENCES

Bethlehem J.G. (1988) Reduction o f Nonresponse Bias Through Regression Estimation Journal o f Official Statistics, Vol. 4, No. 3, 251-160.

Cassel C.M. Sämdal C.E. Wretman J.H. (1983), Some Uses o f Statistical M odels in Connection with the Nonresponse Problem [w:] Incomplete Data in Sample Surveys W.G. Madow, I.Olkin (red.), Academic Press New York.

Sämdal C.E. Swensson B. Wretman J.H. (1992), M odel A ssisted Survey Sampling Springer Verlag New York.

Srinath K.P. (1971), M ultiphase Sampling in Nonresponse Problems. Journal o f the American Statistical Association, Vol. 66, No 335, 583-586.

Wojciech Gamrot

O PEW NYM E STY M A TO R ZE W SPÓ ŁC ZY N N IK A SKOŚNOŚCI PRZY BRAK ACH ODPOW IEDZI

Jedną z najpopularniejszych miar asymetrii rozkładu cechy w populacji jest współ-czynnik skośności wyznaczany poprzez standaryzację trzeciego momentu centralnego względem średniej. W niniejszej pracy rozważono wykorzystanie powszechnie znanej procedury losowania dwufazowego do szacowania współczynnika skośności w populacji skończonej przy brakach odpowiedzi. Zaproponowano estymator tego współczynnika będący funkcją znanych nieobciążonych estymatorów wartości globalnych cechy w populacji. Własności skonstruowanego estymatora zbadano w drodze symulacji kom-puterowych. W eksperymentach wykorzystano dane uzyskane podczas spisu rolnego w wybranych gminach powiatu Dąbrowa Tarnowska.

Cytaty

Powiązane dokumenty

Bartoszewicz [10] has remarked that many stochastic orderings, among them the q-ordering, are preserved by the total time on test transform, used in reliability theory.. The

Mutation rates had a more substantial effect on MRSE (line colors in Fig. 5) than the maximum number of allelic states at a locus (line types in Fig. 5; Table S1), except for H MLE

On Some Generalization of the Well-known Class of Bounded Univalent Functions 51 is an extremal function in many questions investigated in the class S(M).. It is evident that,

Dlatego tak istotne jest dla życia społeczeństwa, by każdy człowiek, także ten, który pobiera najniższe wynagrodzene, mógł z pracy na jednym etacie zaspokoić potrzeby

The aim of this study is to enhance bone regeneration of large bone defects using porous titanium scaffolds incorpo- rated with nanostructured colloidal gelatin gels for time-

With this partial introductory information the proof of the global exis- tence becomes much simpler, and also suitable time independent estimate of the solutions (necessary

Keywords: linear models, least squares estimator, strong onsisten y

Chistyakov, Superposition operators in the algebra of functions of two variables with finite total variation, Monatshefte Math.. Adams, On definitions of bounded variation for