• Nie Znaleziono Wyników

On the suprema of Bernoulli Processes

N/A
N/A
Protected

Academic year: 2021

Share "On the suprema of Bernoulli Processes"

Copied!
4
0
0

Pełen tekst

(1)

Received *****; accepted after revision +++++

Presented by Michel Talagrand

On the suprema of Bernoulli Processes

I

Witold Bednorz, Rafa l Lata la

Institute of Mathematics, University of Warsaw, Banacha 2, 02-097 Warszawa, Poland

Abstract

In this note we announce the affirmative solution of the so-called Bernoulli Conjecture concerning the characterization of the sample boundedness of Bernoulli processes. We also present some applications and discuss related open problems.

R´esum´e

Dans cette note nous annon¸cons la solution positive de la “Conjecture de Bernoulli ” concernant la caract´erisation des processus de Bernoulli born´es. Nous pr´esentons des applications et discutons de questions ouvertes reli´ees.

1. Introduction and Main Result

One of the fundamental issues of probability theory is the study of suprema of stochastic processes.

In particular in many situations one needs to estimate the quantity E supt∈TXt, where (Xt)t∈T is a stochastic process. (To avoid measurability problems one may either assume that T is countable or de- fine E supt∈TXt:= supFE supt∈FXt, where the supremum is taken over all finite sets F ⊂ T ). The modern approach to this problem is based on chaining techniques, present already in the works of Kolmogorov and successfully developed over the last 40 years (see the monographs [13] and [15]).

The most important case of centered Gaussian processes is well understood. In this case the boundedness of the process is related to the geometry of the metric space (T, d), where d(t, s) := (E(Xt− Xs)2)1/2. In 1967 R. Dudley [3] obtained an upper bound for E supt∈TXt in terms of entropy numbers and in 1975 X. Fernique [4] improved Dudley’s bound using so-called majorizing measures. In the seminal paper [10]

M. Talagrand showed that Fernique’s bound may be reversed and that for centered Gaussian processes (Xt), 1

2(T, d) ≤ E sup

t∈T

Xt≤ Lγ2(T, d),

where here and in the sequel L denotes a universal constant (which value may differ at each occurence).

There are many equivalent definitions of the functional γ2[12], for example one may set

γα(T, d) := inf sup

t∈T

X

n=0

2n/αd(t, Tn),

where the infimum runs over all sequences Tn of subsets of T such that |T0| = 1 and |Tn| ≤ 22n.

IResearch supported by the NCN grant DEC-2012/05/B/ST1/0041

Email addresses: wbednorz@mimuw.edu.pl (Witold Bednorz), rlatala@mimuw.edu.pl (Rafa l Lata la)

Preprint submitted to Comptes Rendus Mathematique February 25, 2013

(2)

Another fundamental class of processes is based on the Bernoulli sequence, i.e the sequence (εi)i≥1 of i.i.d. symmetric r.v’s taking values ±1. For t ∈ l2 the series Xt:=P

i≥1tiεi converges a.s. and for T ⊂ l2 we may define a Bernoulli process (Xt)t∈T and try to estimate b(T ) := E supt∈TXt. There are two easy ways to bound b(T ). The first is a consequence of the uniform bound |Xt| ≤ ktk1 = P

i≥1|ti|, so that b(T ) ≤ supt∈Tktk1. Another is based on the domination by the canonical Gaussian process Gt:=P

i≥1tigi, where giare i.i.d. N (0, 1) r.v’s. Indeed, assuming independence of (gi) and (εi), Jensen’s inequality implies

g(T ) := E sup

t∈T

X

i≥1

tigi= EX

i≥1

tiεi|gi| ≥ EX

i≥1

tiεiE|gi| = r2

πb(T ).

Obviously also if T ⊂ T1+ T2= {t1+ t2: ti ∈ Ti} then b(T ) ≤ b(T1) + b(T2), hence

b(T ) ≤ infn sup

t∈T1

ktk1+r π

2g(T2): T ⊂ T1+ T2

o≤ infn sup

t∈T1

ktk1+ Lγ2(T2): T ⊂ T1+ T2

o , where γ2(T ) = γ2(T, d2) and d2 is the l2-distance. It was open for about 25 years (under the name of Bernoulli conjecture) whether the above estimate may be reversed (see e.g. Problem 12 in [9] or Chapter 4 in [13]). The next theorem provides an affirmative answer.

Theorem 1. For any nonempty set T ⊂ l2(I) with b(T ) < ∞ one may find a decomposition T ⊂ T1+ T2

with supt∈T1P

i|ti| ≤ Lb(T ) and g(T2) ≤ Lb(T ).

As a corollary we obtain another useful criteria of boundedness for Bernoulli processes. For a random variable X and p > 0 we set kXkp:= (E|X|p)1/p.

Corollary 2. Suppose that (Xt)t∈T is a Bernoulli process with b(T ) < ∞. Then there exist t1, t2, . . . ∈ l2

such that T ⊂ conv{tn: n ≥ 1} and kXtnklog(n+2)≤ Lb(T ) for all n ≥ 1.

The converse statement easily follows follows from the union bound and Chebyshev’s inequality. Indeed, suppose that T ⊂ conv{tn: n ≥ 1} and kXtnklog(n+2)≤ M . Then for u ≥ 1,

P

 sup

t∈T

Xt≥ uM

≤ P sup

n≥1

Xtn≥ uM

≤X

n≥1

P(Xtn≥ ukXtnklog(n+2)) ≤X

n≥1

u− log(n+2)

and integration by parts easily yields E supt∈TXt≤ LM .

2. Ideas of the proof

In this section we try to briefly describe a few ideas behind the proof of Theorem 1, presented in details in [2]. The proof is not very long, but is quite technical. It builds on many ideas developed over the years by Michel Talagrand. As in the case of Gaussian processes it uses both concentration and minorization properties.

The main difficulty lies in the fact that there is no direct way of producing decomposition t = t1+ t2 for t ∈ T such that supt∈Tkt1k1 ≤ Lb(T ) and γ2({t2: t ∈ T }) ≤ Lb(T ). Following Talagrand we connect decompositions of the set T with suitable sequences of its partitions in the next theorem. We recall that an increasing sequence (An)n≥0 of partitions of T is called admissible if A0= {T } and |An| ≤ 22n. For such partitions and t ∈ T by An(t) we denote a unique set in An which contains t.

Theorem 3. Suppose that M > 0, r > 1, (An)n≥0 is an admissible sequence of partitions of T , and for each A ∈ An there exists an integer jn(A) and a point πn(A) ∈ T satisfying the following assumptions:

i) kt − sk2≤ M r−j0(T ) for t, s ∈ T ,

ii) if n ≥ 1, An 3 A ⊂ A0∈ An−1then either jn(A) = jn−1(A) and πn(A) = πn−1(A) or jn(A) > jn−1(A0), πn(A) ∈ A0 and

X

i∈In(A)

min{(ti− πn(A)i)2, r−2jn(A)} ≤ M 2nr−2jn(A) for all t ∈ A,

2

(3)

where for any t ∈ A,

In(A) = In(t) :=i ≥ 1: |πk+1(Ak+1(t))i− πk(Ak(t))i| ≤ r−jk(Ak(t)) for 0 ≤ k ≤ n − 1 . Then there exist sets T1, T2⊂ l2 such that T ⊂ T1+ T2 and

sup

t1∈T1

kt1k1≤ LM sup

t∈T

X

n=0

2nr−jn(An(t)) and γ2(T2) ≤ L

√ M sup

t∈T

X

n=0

2nr−jn(An(t)).

The main novelty here is the introduction of sets In(A). To build such partition we use another idea of Talagrand and construct functionals on nonempty subsets of T that satisfy certain growth conditions. Our functionals depend not only on two integer-valued parameters, but also on the subset. Their construction combines Talagrand’s “chopping maps” and ideas from [7]. The key ingredient to show the growth condition is the following modification of Proposition 1 in [7], which is based on concentration and minorization properties of Bernoulli processes.

Proposition 4. Let J ⊂ N, an integer m ≥ 2, σ > 0 and T ⊂ l2 be such that

 X

i∈J

(ti− si)21/2

≤ 1

Lσ and kt − sk≤ σ

√log m for all t, s ∈ T.

Then there exist t1, . . . , tm∈ T such that either T ⊂S

l≤mB(tl, σ) or the set S := T \S

l≤mB(tl, σ) satisfies bJ(S) := E sup

t∈S

X

i∈J

tiεi≤ b(T ) − 1 Lσp

log m.

The crucial point here is that we make no assumption about the diameter of the set T with respect to the l2 distance.

3. Selected applications

In this section we present two consequences of Theorem 1. The first shows that if a Bernoulli vector Y weakly dominates random vector X then Y strongly dominates X (cf. [8]).

Corollary 5. Let X, Y be random vectors in a separable Banach space (F, k · k) such that Y =P

i≥1uiεi

for some vectors ui∈ F and

P(|ϕ(X)| ≥ u) ≤ P(|ϕ(Y )| ≥ u) for all ϕ ∈ F, u > 0.

Then there exists universal constant L such that

P(kXk ≥ u) ≤ LP(kY k ≥ u/L) for all u > 0.

Another result is a Levy-Ottaviani type maximal inequality for VC-classes (see [6] for details). Recall that a class C of subsets of I is called a Vapnik-Chervonenkis class (or in short a VC-class) of order d if for any set A ⊂ I of cardinality d + 1 we have |{C ∩ A: C ∈ C}| < 2d+1.

Theorem 6. Let (Xi)i∈I be independent random variables in a separable Banach space (F, k · k) such that

|{i: Xi6= 0}| < ∞ a.s. and C be a countable VC-class of subsets of I of order d. Then P

 sup

C∈C

X

i∈C

Xi

≥ u

≤ K(d) sup

C∈C∪{I}

P



X

i∈C

Xi

≥ u

K(d)



for u > 0,

where K(d) is a constant that depends only on d. Moreover if the variables Xi are symmetric then P

 sup

C∈C

X

i∈C

Xi

≥ u

≤ K(d)P

X

i∈I

Xi

≥ u

for u > 0.

3

(4)

4. Further questions

The following generalization of the Bernoulli Conjecture was formulated by S.Kwapie´n (private commu- nication).

Problem 7. Let (F, k · k) be a normed space and (ui) be a sequence of vectors in F such that the series P

i≥1uiεiconverges a.s. Does there exist a universal constant L and a decomposition ui= vi+ wi such that E

X

i≥1

vigi ≤ LE

X

i≥1

uiεi

and sup

ηi=±1

X

i≥1

wiηi ≤ LE

X

i≥1

uiεi ? Theorem 1 states that the answer is positive for F = l.

It is natural to ask for bounds on suprema for another classes of stochastic processes. The majorizing measure upper bound works in quite general situations [1], however two-sided estimates are only known in very few cases. For “canonical processes” of the form Xt=P

i≥1tiXi, where Xi are independent centered r.v’s results in the spirit of Corollary 2 were obtained for certain symmetric variables with log-concave tails [11, 5]. A basic important class of canonical processes worth investigation is based on two points-valued r.v.’s. The following conjecture was stated by M. Talagrand (dp(t, s) = kt − skp denotes the lp-distance).

Conjecture 8. Let 0 < δ ≤ 1/2, δibe independent random variables such that P(δi= 1) = δ = 1−P(δi= 0) and δ(T ) := E supt∈T

P

i≥1tii − δ) for T ⊂ l2. Then for any set T with δ(T ) < ∞ one may find a decomposition T ⊂ T1+ T2 such that

δγ2(T1, d2) ≤ Lδ(T ), γ1(T1, d) ≤ Lδ(T ) and E sup

t∈T2

X

i≥1

|tii≤ Lδ(T ).

It may be showed that for δ = 1/2 the above conjecture follows from Theorem 1. Moreover Bernstein’s inequality and a generic chaining argument imply that δ(T ) ≤ L(√

δγ2(T, d2) + γ1(T, d)) (see [14]), so Conjecture 8 would result in a two-sided bound on δ(T ).

References

[1] W. Bednorz, A theorem on majorizing measures, Ann. Probab. 34 (2006), 1771–1781.

[2] W. Bednorz, R. Lata la, On the boundedness of Bernoulli processes, preprint.

[3] R. M. Dudley, The sizes of compact subsets of Hilbert space and continuity of Gaussian processes, J. Functional Analysis 1 (1967), 290–330.

[4] X. Fernique, Regularit´e des trajectoires des fonctions al´eatoires gaussiennes, ´Ecole d’ ´Et´e de Probabilit´es de Saint-Flour, IV-1974, Lecture Notes in Mathematics 480, 1–96, Springer, Berlin, 1975.

[5] R. Lata la, Sudakov minoration principle and supremum of some processes, Geom. Funct. Anal. 7 (1997), 936–953.

[6] R. Lata la, A note on the maximal inequalities for VC classes, Advances in stochastic inequalities (Atlanta, GA, 1997), 125–134, Contemp. Math. 234, Amer. Math. Soc., Providence, RI, 1999.

[7] R. Lata la, On the boundedness of Bernoulli processes over thin sets, Electron. Commun. Probab. 13 (2008), 175–186.

[8] R. Lata la, On weak tail domination of random vectors, Bull. Pol. Acad. Sci. Math. 57 (2009), 75–80.

[9] M. Ledoux, M. Talagrand, Probability in Banach spaces. Isoperimetry and processes, Springer-Verlag, Berlin, 1991.

[10] M. Talagrand, Regularity of Gaussian processes, Acta Math. 159 (1987), 99–149.

[11] M. Talagrand, The supremum of some canonical processes, Amer. J. Math. 116 (1994), 284–325.

[12] M. Talagrand, Majorizing measures without measures, Ann. Probab. 29 (2001), 411–417.

[13] M. Talagrand, The generic chaining. Upper and lower bounds of stochastic processes, Springer-Verlag, Berlin, 2005.

[14] M. Talagrand, Chaining and the geometry of stochastic processes, Proceedings of the 6th European Congress of Mathe- matics, to appear.

[15] M. Talagrand, Upper and lower bounds for stochastic processes. Modern methods and classical problems, Springer-Verlag, to appear.

4

Cytaty

Powiązane dokumenty

The strong Markov property implies that the conditional probability we will return one more time given that we have returned k − 1 times is ρ yy.. Thus, eventually the Markov chain

The first is that the existence of a majorizing measure is sufficient for the existence of a sequence of admissible nets (as recently introduced by Talagrand [5]), and the second

be independent identically distributed random variables having the uniform distribution in [0, 1 ], and N a Poisson random variable independent of ^%(r), fe/J and Xx,X2,....

Payne potw ierdza bowiem w swych badaniach, że strategicznym celem p artii komunistycznej było zap ro ­ wadzenie w Hiszpanii systemu sowieckiego, porównywalnego z systemem powstałym

First, we would like to find fairly general assumptions that will allow us to reverse inequality (1), that is we want to obtain the Sudakov minoration principle for a large class

Naast de mogelijkheid van het (cyclisch) winnen van klei uit de Dollard kwelders werd door de stakeholders het winnen van klei uit dichtgeslibde slenken, het gebruiken van klei

Abstract We present a method to determine lower and upper bounds to the predicted production or any other economic objective from history-matched reservoir mod- els. The method

Making use of dif­ ferent properties of the characteristic function, empirical dis­ tribution function and empirical characteristic function we dis­ cuss