• Nie Znaleziono Wyników

The more formal description of the problem is given below.

N/A
N/A
Protected

Academic year: 2021

Share "The more formal description of the problem is given below."

Copied!
6
0
0

Pełen tekst

(1)

M. M E¸ C Z A R S K I (Warszawa)

STABILITY AND CONDITIONAL Γ -MINIMAXITY IN BAYESIAN INFERENCE

Abstract. Two concepts of optimality corresponding to Bayesian robust analysis are considered: conditional Γ -minimaxity and stability. Conditions for coincidence of optimal decisions of both kinds are stated.

1. In Bayesian statistical inference arbitrariness of a unique prior distri- bution is a permanent question. Robust Bayesian inference deals with the problem of expressing uncertainty of the prior information and of quantita- tive consequences of this uncertainty. A natural measure is width (oscilla- tion, diameter) of the range of a posterior quantity while the prior distribu- tion π runs over a class Γ of probability distributions. If the oscillation of the posterior quantity is small, then the presence of robustness with respect to the prior inexactness can be assured.

A natural goal of research are optimal decisions under a specified loss function and a class Γ of prior distributions, with an idea of optimality related to the robustness problem. The concept of conditional Γ -minimax actions was considered in DasGupta and Studden [3] and Betr` o and Ruggeri [1] and it was exhaustively substantiated therein. The idea of stability in Bayesian robust analysis was developed in M¸ eczarski and Zieli´ nski [5], with some additional results in M¸ eczarski [4] and in Boraty´ nska and M¸ eczarski [2].

The more formal description of the problem is given below.

Let (X , F , {P

θ

}

θ∈(Θ,B)

) be a statistical space, Θ ⊂ R. Let Γ be a class of probability distributions on (Θ, B), i.e. of prior distributions. It reflects the uncertainty of the prior. Let x ∈ R be a given observation and a ∈ A a decision (an action) about θ based on x, with the action space A ⊂ R being a compact interval. We consider a loss function L(θ, a) convex in a

1991 Mathematics Subject Classification: 62C10, 62F15, 62F35.

Key words and phrases: robust Bayesian analysis, conditional Γ -minimax action, stability of a statistical procedure.

(2)

and the corresponding expected posterior loss (the posterior risk) %(π, a) of the action a under the prior π (it depends on x).

The concept of conditional Γ -minimaxity is as follows: construct an action a

∈ A such that

sup

π∈Γ

%(π, a

) = inf

a∈A

sup

π∈Γ

%(π, a).

Such an action a

is termed a conditional Γ -minimax (CGM) action.

The concept of stability is as follows: construct an action a

#

∈ A such that

sup

π∈Γ

%(π, a

#

) − inf

π∈Γ

%(π, a

#

) = inf

a∈A

{sup

π∈Γ

%(π, a) − inf

π∈Γ

%(π, a)} , i.e. an action a

#

is said to be stable if it minimizes the oscillation of %(π, a) on Γ with respect to a ∈ A.

Solutions of some particular estimation problems show that CGM and stable actions may coincide. As seen in Boraty´ nska and M¸ eczarski [2] a stable solution may lead to large losses of the posterior risk. For that reason such coincidence is desirable and also the stability of CGM actions seems to be a favourable property.

2. The following theorem characterizes CGM actions.

Theorem 1 (Betr` o and Ruggeri [1]). Assume that %(π, a) is a strictly convex function of a for each π∈Γ . Let Π

a

={π

a

: %(π

a

, a)= sup

π∈Γ

%(π, a)}

be the set of least favourable priors for a decision a. Let a

Bπ

denote the Bayes action under π. If at b a ∈ A there exist π

1

and π

2

in Π

ˆa

such that a

Bπ1

≤ b a ≤ a

Bπ2

, then b a is a CGM action.

Therefore if the stable action satisfies the conditions of Theorem 1 then it is CGM. The problem is: when is the CGM action stable?

We assume hereafter that Γ = {π

α

: α ∈ [α

1

, α

2

] }.

Theorem 2. Let

(∀a ∈ A)(∀π

α

∈ Γ ) %(π

α

, a) = r(α, a) = (Aα + B − a)

2

+ Cα + D , with A 6= 0, B, C, D real constants. If Π

aˆ

= {π

α1

, π

α2

} for an action b a ∈ A, then b a is stable; conversely, Π

a#

= {π

α1

, π

α2

}.

P r o o f. Let α = (α

1

+ α

2

)/2 and α

min

(a) be the minimum point of

the function r(·, a). The elementary geometry of the quadratic curve implies

that the oscillation of r(·, a) over [α

1

, α

2

] is the least iff α

min

(a) = α or,

equivalently, r(α

1

, a) = r(α

2

, a), which is equivalent to Π

a

= {π

α1

, π

α2

}

because of the form of r. This yields the value a

#

which is the unique

(3)

solution. Since sup

π∈Γ

%(π, a) = sup

α∈[α12]

r(α, a) =  r(α

1

, a) if α

min

(a) ≥ α , r(α

2

, a) if α

min

(a) ≤ α ,

the conditions Π

aˆ

= {π

α1

, π

α2

} and r(α

1

, b a) = r(α

2

, b a) are equivalent, which ends the proof.

Corollary 1. Assume the conditions of Theorem 2. If the CGM action satisfies the conditions of Theorem 1, then it is stable. If the stable action is in the interval with endpoints a

Bπ

α1

and a

Bπ

α2

, then it is CGM.

The following examples show that the situation from Theorem 2 is real- istic.

Example 1 (M¸ eczarski and Zieli´ nski [5], Betr` o and Ruggeri [1]). The problem is to estimate the mean λ in the Poisson distribution P(λ), given an observation x, under the prior gamma G(α, β), α ∈ [α

1

, α

2

], β fixed and under the quadratic loss function. Thus

Γ = {G(α, β) : α ∈ [α

1

, α

2

], β is fixed} ,

%(π

α

, a) = r(α, a) =  α + x β + 1 − a



2

+ α + x (β + 1)

2

, a

Bπα

= α + x

β + 1 .

We have Π

ˆa

= {π

α1

, π

α2

} only for b a = (α + 1/2 + x)/(β + 1) and this is a stable action; if a

Bπ

α1

≤ b a ≤ a

Bπ

α2

, then b a is a CGM action.

Example 2 (DasGupta and Studden [3], Boraty´ nska and M¸ eczarski [2]).

The problem is to estimate the mean θ in the normal distribution N (θ, b

2

) with known b > 0, under the normal prior N (µ, σ

2

), where σ ∈ [σ

1

, σ

2

] ⊂ R

+

and under the quadratic loss. For α = (b

−2

+ σ

−2

)

−1

we can write Γ = {π

α

: α ∈ [α

1

, α

2

]}. Then

%(π

α

, a) = r(α, a) =



α x − µ

b

2

+ µ − a



2

+ α and

a

Bπα

= α x − µ b

2

+ µ . We have Π

ˆa

= {π

α1

, π

α2

} only for

b a = α x − µ b

2

+ 1

2 b

2

x − µ

and this is a stable action; if b a is in the closed interval with endpoints a

Bπ

α1

and a

Bπ

α2

, then b a is a CGM action. The last conclusions are valid for x 6= µ;

(4)

otherwise any a is stable and a = µ is CGM; therefore this CGM action is stable as well.

3. In Theorem 2 and Examples 1 and 2 the class Γ is defined by a real parameter α from a compact interval. The posterior risk is a quadratic function of α. By minimizing the oscillation of that function we arrive at the condition %(π

α1

, a) = %(π

α2

, a), which defines the stable action. From the shape of the function we obtain the conditions of Theorem 1 and the stability of the CGM action.

Let us consider the following example.

Example 3 (M¸ eczarski [4]). The problem is to estimate the parameter θ = e

−λ

in the Poisson distribution P(λ). The class of priors and the loss function are as in Example 1. Then

%(π

α

, a) = r(α, a) = (e

β1(α+x)

− a)

2

− e

1(α+x)

+ e

β2(α+x)

, β

i

= log β + 1

β + 1 + i , i = 1, 2 and a

Bπα

= e

β1(α+x)

.

By looking for inf

a∈A

sup

π∈Γ

%(π, a) we obtain Π

ˆa

= {π

α1

, π

α2

} only for the b a which is the unique solution of r(α

1

, b a) = r(α

2

, b a). If a

Bπ

α2

≤ b a ≤ a

Bπ

α1

, then b a is a CGM action. By investigating the monotonicity of the oscillation of r(·, a) over [α

1

, α

2

] we conclude that a

#

is its unique minimum point (a stable action) if and only if it satisfies r(α

1

, a

#

) = r(α

2

, a

#

).

Example 3 shows that the function r(·, a) need not be quadratic to obtain the considered connection between stability and conditional Γ -minimaxity.

As before, the condition r(α

1

, a) = r(α

2

, a) is a tool to construct an action with both properties.

Theorem 3. Define a function r by r(α, a) = %(π

α

, a). Assume that it satisfies the following conditions:

(a) r(α, ·) is strictly convex for any α;

(b) for any a the minimum point α

min

(a) of r(·, a) is unique and α

min

is a strictly monotone function of a;

(c) for any α and e e a such that α

min

( e a) = α we have e (∀a

0

< a

00

≤ e a) r( α, a e

00

) − r( α, a e

0

)

a

00

− a

0

< r(α

min

(a

00

), a

00

) − r(α

min

(a

0

), a

0

) a

00

− a

0

and

(∀a

00

> a

0

≥ e a) r( α, a e

00

) − r( α, a e

0

)

a

00

− a

0

> r(α

min

(a

00

), a

00

) − r(α

min

(a

0

), a

0

)

a

00

− a

0

;

(d) the function r(α

1

, a) − r(α

2

, a) is monotone in a.

Then the conclusion of Theorem 2 holds.

(5)

P r o o f. Denote the oscillation of r(·, a) over [α

1

, α

2

] by Q(a). Then

Q(a) =

 

 

 

 

 

 

r(α

2

, a) − r(α

1

, a) if α

min

(a) ≤ α

1

, r(α

2

, a) − r(α

min

(a), a) if α

1

< α

min

(a) < α

2

and r(α

1

, a) − r(α

2

, a) ≤ 0, r(α

1

, a) − r(α

min

(a), a) if α

1

< α

min

(a) < α

2

and r(α

1

, a) − r(α

2

, a) ≥ 0, r(α

1

, a) − r(α

2

, a) if α

min

(a) ≥ α

2

.

Let a

i

be a solution of α

min

(a) = α

i

, i = 1, 2. The type of monotonicity of the function r(α

1

, a) − r(α

2

, a) agrees with that of α

min

. Then for α

min

increasing we have

Q(a) =

 

 

 

 

 

 

r(α

2

, a) − r(α

1

, a) if a ≤ a

1

, r(α

2

, a) − r(α

min

(a), a) if a

1

< a < a

2

and r(α

1

, a) − r(α

2

, a) ≤ 0, r(α

1

, a) − r(α

min

(a), a) if a

1

< a < a

2

and r(α

1

, a) − r(α

2

, a) ≥ 0, r(α

1

, a) − r(α

2

, a) if a ≥ a

2

and for α

min

decreasing we have

Q(a) =

 

 

 

 

 

 

r(α

1

, a) − r(α

2

, a) if a ≤ a

2

, r(α

1

, a) − r(α

min

(a), a) if a

2

< a < a

1

and r(α

1

, a) − r(α

2

, a) ≥ 0, r(α

2

, a) − r(α

min

(a), a) if a

2

< a < a

1

and r(α

1

, a) − r(α

2

, a) ≤ 0, r(α

2

, a) − r(α

1

, a) if a ≥ a

1

.

Observe that the functions r(α

i

, a) − r(α

min

(a), a) have their unique min- imum points at a

i

and they are decreasing for a < a

i

and increasing for a > a

i

, i = 1, 2, respectively. This implies that Q has its unique minimum point at a

#

which is defined by the condition r(α

1

, a

#

) − r(α

2

, a

#

) = 0.

Now,

sup

α∈[α12]

r(α, a) =  r(α

1

, a) if r(α

1

, a) ≥ r(α

2

, a), r(α

2

, a) if r(α

1

, a) ≤ r(α

2

, a)

and therefore Π

a#

= {π

α1

, π

α2

}. Conversely, if Π

ˆa

= {π

α1

, π

α2

}, then b a = a

#

. This ends the proof.

Corollary 2. Assume the conditions of Theorem 3. Let the function f (α) = a

Bπα

be strictly monotone and f ([α

1

, α

2

]) = A. Then the conclusion of Corollary 1 holds.

Now we can see that the conditions of Theorem 3 and Corollary 2 hold

in Examples 1, 2 and 3.

(6)

References

[1] B. B e t r `o and F. R u g g e r i, Conditional Γ -minimax actions under convex losses, Comm. Statist. Theory Methods 21 (1992), 1051–1066.

[2] A. B o r a t y ´n s k a and M. M ¸e c z a r s k i, Robust Bayesian estimation in the one- dimensional normal model , submitted.

[3] A. D a s G u p t a and W. J. S t u d d e n, Frequentist behavior of robust Bayes estimates of normal means, Statist. Decisions 7 (1989), 333–361.

[4] M. M ¸e c z a r s k i, On stable Bayesian estimation in the Poisson model , Sci. Bull.

L´od´z Technical Univ. No. 687, Matematyka, z. 25 (1993), 83–91.

[5] M. M ¸e c z a r s k i and R. Z i e l i ´n s k i, Stability of the Bayesian estimator of the Poisson mean under the inexactly specified gamma prior , Statist. Probab. Lett. 12 (1991), 329–333.

MAREK ME¸ CZARSKI

INSTITUTE OF ECONOMETRICS WARSAW SCHOOL OF ECONOMICS AL. NIEPODLEG LO´SCI 162

02-554 WARSZAWA, POLAND

Received on 25.3.1993

Cytaty

Powiązane dokumenty

volvement  of  enterprises  in  CSR  activities  not  only  pleases  the  customers  and  positively  impacts  various  aspects  of  a  company’s  performance, 

Marianna Tomaszewska  University of Gdansk, Faculty of Economics  2 nd  year SS2 MSG, Międzynarodowe Strategie Marketingowe   

Our goal is to select the observational planes that must be used in order to obtain the most exhaustive characterization of the individual sources observed, in terms of

The aim of this study was to determine the beneficial effect of natural substances – enterocin M (Ent M; the proteinaceous substance produced by Enterococcus faecium CCM8558) and sage

kultury przeworskiej z młodszego okresu przedrzyniskiego i okresu wpływów rzymskich.. Badania trwały od 31VII do 1 IX

We then give an explicit example by showing that Lebesgue measure regarded as a function on the space of compact subsets of [0, 1], which is upper semicontinuous, is not

Dziś, kiedy mój syn wykształcił się w normalnych warunkach, kiedy wszyscy - od najwyższych władz oświatowych począwszy, a na rodzicach skończywszy - starają się tę

Skoro władze pan ´ stwowe, w tym władze prawodawcze, istnieja˛ce na tery- torium dzisiejszej Polski w latach 1944-1989 nie były władzami suwerennego Pan ´ stwa Polskiego,