Second order PoincarÃ© inequalities and CLTs on ...

Viewer
Transcript

Second order Poincaré inequalities and CLTs on Wiener space ∗

† and Gesine Reinert‡

by Ivan Nourdin , Giovanni Peccati

Université Paris VI, Université Paris Ouest and Oxford University Abstract: We prove innite-dimensional second order Poincaré inequalities on Wiener space, thus

closing a circle of ideas linking limit theorems for functionals of Gaussian elds, Stein's method and Malliavin calculus. We provide two applications: (i) to a new second order characterization of CLTs on a xed Wiener chaos, and (ii) to linear functionals of Gaussian-subordinated elds.

Key words:

central limit theorems; isonormal Gaussian processes; linear functionals; multi-

ple integrals; second order Poincaré inequalities; Stein's method; Wiener chaos

2000 Mathematics Subject Classication:

60F05; 60G15; 60H07

1 Introduction Let

N ∼ N (0, 1)

be a standard Gaussian random variable. In its most basic formulation, the

Gaussian Poincaré inequality states that, for every dierentiable function

Varf (N ) 6 Ef 0 (N )2 , with equality if and only if

f : R → R, (1.1)

f

is ane. The estimate (1.1) is a fundamental tool of stochastic

analysis: it implies that, if the random variable

f 0 (N ) has a small L2 (Ω) norm, then f (N ) has

necessarily small uctuations. Relation (1.1) has been rst proved by Nash in [14], and then rediscovered by Cherno in [9] (both proofs use Hermite polynomials). The Gaussian Poincaré inequality admits extensions in several directions, encompassing both the case of smooth functionals of multi-dimensional (and possibly innite-dimensional) Gaussian elds, and of

et al. [1], Bobkov [2], Cacoullos et al., Chen [5, 6, 7], Houdré and Perez-Abreu [10], and the references therein. In particular, the

non-Gaussian probability distributions see e.g. Bakry

results proved in [10] (which make use of the Malliavin calculus) allow to recover the following innite-dimensional version of (1.1). real separable Hilbert space

H

with values

X

be an isonormal Gaussian process over some

(see Section 2), and let

X . Then, the Malliavin in H, and it holds that

functional of

Let

derivative of

VarF 6 EkDF k2H , with equality if and only if chaos of

X.

F,

F ∈ D1,2

be a Malliavin-dierentiable

denoted by

DF ,

is a random element

(1.2)

F

has the form of a constant plus an element of the rst Wiener

In Proposition 3.1 below we shall prove a more general version of (1.2), involving

central moments of arbitrary even orders and based on the techniques developed in [16]. Note

Laboratoire de Probabilités et Modèles Aléatoires, Université Pierre et Marie Curie (Paris VI), Boîte courrier 188, 4 place Jussieu, 75252 Paris Cedex 05, France. Email: [email protected] † Equipe Modal'X, Université Paris Ouest Nanterre la Défense, 200 Avenue de la République, 92000 Nanterre, and LSTA, Université Paris VI, France. Email: [email protected] ‡ Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK. Email [email protected] ∗

1

that (1.2) contains as a special case the well-known fact that, if function of i.i.d.

N (0, 1)

random variables

X1 , ..., Xd ,

F = f (X1 , ..., Xd ) is a smooth

then

VarF 6 Ek∇f (X1 , ..., Xd )k2Rd , where

∇f

is the gradient of

(1.3)

f.

Now suppose that the random variable i.i.d.

N (0, 1))

is such that

f

F = f (X1 , ..., Xd )

X1 , ..., Xd

are again

is twice dierentiable. In the recent paper [4], Chatterjee has

pointed out that if one focuses also on the

∇f ,

(where the

d×d

Hessian matrix

Hess f ,

and not only on

then one can state an inequality assessing the total variation distance (see Section 3.2,

(3.21)) between the law of and variance.

F

and the law of a Gaussian random variable with matching mean

The precise result goes as follows (see [4, Theorem 2.2]).

Let

E(F ) = µ,

> 0, Z ∼ N (µ, σ), and denote by dT V (F, Z) the total variation distance F and Z , see (3.21). Then √ 1 1 2 5 dT V (F, Z) 6 2 E[kHess f (X1 , ..., Xd )k4op ] 4 × E[k∇f (X1 , ..., Xd )k4Rd ] 4 , σ

VarF =

σ2

between

the laws of

where

kHess f (X1 , ..., Xd )kop

is the operator norm of the (random) matrix

(1.4)

Hessf (X1 , ..., Xd ).

A relation such as (1.4) is called a second order Poincaré inequality: it is proved in [4] by combining (1.3) with an adequate version of Stein's method (see e.g. [8, 24]). In [16, Remark 3.6] the rst two authors of the present paper pointed out that the nitedimensional Stein-type inequalities leading to Relation (1.4) are special instances of much more general estimates, which can be obtained by combining Stein's method and Malliavin calculus on an innite-dimensional Gaussian space. It is therefore natural to ask whether the results of [16] can be used in order to obtain a general version of (1.4), involving a distance to Gaussian for smooth functionals of arbitrary innite-dimensional Gaussian elds. We shall show that the answer is positive. Indeed, one of the principal achievements of this paper is the proof of the following statement (dW denotes the Wasserstein distance, see (3.22)):

Theorem 1.1 (Second order innite-dimensional Poincaré inequality) Let isonormal Gaussian process over some real separable Hilbert space H, and let F ∈ that E (F ) = µ and Var (F ) = σ 2 > 0. Let Z ∼ N (µ, σ 2 ). Then dW

√ h i1 10 2 4 41 4 (F, Z) 6 E kD F kop × E kDF k4H . 2σ

D2,4 .

X be an

Assume

(1.5)

If, in addition, the law of F is absolutely continuous with respect to the Lebesgue measure, then dT V

√ h i1 10 2 4 14 4 4 (F, Z) 6 2 E kD F kop × E kDF kH . σ

The class 2; note that of

H

D2,4 D2 F

(1.6)

of twice Malliavin-dierentiable functionals is formally dened in Section is a random element with values in

with itself ) and that we used

2

D F op

H 2

to indicate the operator norm (or, equivalently,

the spectral radius) of the random Hilbert-Schmidt operator Theorem 1.1 is detailed in Section 4.1.

(the symmetric tensor product

f 7→ hf, D2 F iH .

The proof of

As discussed in Section 4.2, a crucial point is that

Theorem 1.1 leads to further (and very useful) inequalities, which we name random contraction

2

inequalities. These estimates involve a contracted version of the second derivative

D2 F , and

will lead (see Section 5) to the proof of new necessary and sucient conditions which ensure that a sequence of random variables belonging to xed Wiener chaos converges in law to a standard Gaussian random variable. This result generalizes and unies the ndings contained in [16, 20, 21, 23], and virtually closes a very fruitful circle of recent ideas linking Malliavin calculus, Stein's method and central limit theorems (CLTs) on Wiener space (see also [15]). The role of contraction inequalities is further explored in Section 6, where we study CLTs for linear functionals of Gaussian subordinated elds. The rest of the paper is organized as follows. In Section 2 we recall some preliminary results involving Malliavin operators. Section 3 concerns Poincaré type inequalities and bounds on distances between probabilities.

Section 4 deals with the proof of Theorem 1.1, as well as

with random contraction inequalities. Section 5 and Section 6 focus, respectively, on CLTs on Wiener chaos and on CLTs for Gaussian subordinated elds. Finally, Section 7 is devoted to a version of (1.5) for random variables of the type

F = (F1 , . . . , Fd ).

2 Preliminaries We shall now present the basic elements of Gaussian analysis and Malliavin calculus that are used in this paper. The reader is referred to the two monographs by Malliavin [12] and Nualart [19] for any unexplained denition or result.

H be a real separable Hilbert space. For any q > 1 let H⊗q be the q th tensor product of H and denote by H q the associated q th symmetric tensor product. We write X = {X(h), h ∈ H} to indicate an isonormal Gaussian process over H, dened on some probability space (Ω, F, P ). This means that X is a centered Gaussian family, whose covariance is given in terms of the inner product of H by E [X(h)X(g)] = hh, giH . We also assume that F is generated by X . For every q > 1, let Hq be the q th Wiener chaos of X , that is, the closed linear subspace 2 of L (Ω, F, P ) generated by the random variables of the type {Hq (X(h)), h ∈ H, khkH = 1}, 2 2 q x dq e− x2 . We write where Hq is the q th Hermite polynomial dened as Hq (x) = (−1) e 2 dxq ⊗q ) = q!H (X(h)) can be extended by convention H0 = R. For any q > 1, the mapping Iq (h q q equipped with the modied to a linear isometry between the symmetric tensor product H √ norm q! k·kH⊗q and the q th Wiener chaos Hq . For q = 0 we write I0 (c) = c, c ∈ R. 2 It is well-known (Wiener chaos expansion) that L (Ω, F, P ) can be decomposed into the innite orthogonal sum of the spaces Hq . Therefore, any square integrable random variable F ∈ L2 (Ω, F, P ) admits the following chaotic expansion Let

F =

∞ X

Iq (fq ),

(2.7)

q=0

f0 = E[F ], and the fq ∈ H q , q > 1, are uniquely determined by F . For every q > 0 we denote by Jq the orthogonal projection operator on the q th Wiener chaos. In particular, 2 if F ∈ L (Ω, F, P ) is as in (2.7), then Jq F = Iq (fq ) for every q > 0. p and g ∈ H q , Let {ek , k > 1} be a complete orthonormal system in H. Given f ∈ H ⊗(p+q−2r) for every r = 0, . . . , p ∧ q , the contraction of f and g of order r is the element of H where

3

dened by

∞ X

f ⊗r g =

hf, ei1 ⊗ . . . ⊗ eir iH⊗r ⊗ hg, ei1 ⊗ . . . ⊗ eir iH⊗r .

(2.8)

i1 ,...,ir =1

e rg ∈ f ⊗r g is not necessarily symmetric: we denote its symmetrization by f ⊗ (p+q−2r) H . Moreover, f ⊗0 g = f ⊗ g equals the tensor product of f and g while, for p = q , f ⊗q g = hf, giH⊗q . In the particular case where H = L2 (A, A, µ), where (A, A) is a measurable q = L2 (Aq , A⊗q , µ⊗q ) is the space and µ is a σ -nite and non-atomic measure, one has that H s q q , I (f ) space of symmetric and square integrable functions on A . Moreover, for every f ∈ H q Notice that

coincides with the multiple Wiener-Itô integral of order

q

of

f

with respect to

X

introduced

by Itô in [11]. In this case, (2.8) can be written as

Z (f ⊗r g)(t1 , . . . , tp+q−2r ) =

f (t1 , . . . , tp−r , s1 , . . . , sr ) Ar

× g(tp−r+1 , . . . , tp+q−2r , s1 , . . . , sr )dµ(s1 ) . . . dµ(sr ). It can then be also shown that the following multiplication formula holds: if

g∈

f ∈ H p

and

H q , then p∧q X p q e r g). Ip (f )Iq (g) = r! Ip+q−2r (f ⊗ r r

(2.9)

r=0

Let us now introduce some basic elements of the Malliavin calculus with respect to the

X.

isonormal Gaussian process

Let

S

be the set of all cylindrical random variables of the form

F = g (X(φ1 ), . . . , X(φn )) , where

n > 1, g : Rn → R

φ i ∈ H.

(2.10)

is an innitely dierentiable function with compact support and

The Malliavin derivative of

F

X

with respect to

is the element of

L2 (Ω, H)

dened as

n X ∂g DF = (X(φ1 ), . . . , X(φn )) φi . ∂xi i=1

In particular,

DX(h) = h

for every

Dm F , which is an element of denotes the closure of

kF kpm,p

S

h ∈ H.

with respect to the norm

p

= E [|F | ] +

m X

mth derivative m > 1 and p > 1, Dm,p

By iteration, one can dene the

L2 (Ω, H m ), for every

m > 2. For k · km,p , dened by

the relation

E kDi F kpH⊗i .

i=1 The Malliavin derivative

D

dierentiable with bounded partial derivatives and if of

D1,2 ,

then

ϕ(F ) ∈ D1,2

D ϕ(F ) =

ϕ : Rn → R is continuously F = (F1 , . . . , Fn ) is a vector of elements

veries the following chain rule. If

and

n X ∂ϕ (F )DFi . ∂xi i=1

4

P 2 F as in (2.7) is in D1,2 if and only if ∞ q=1 qkJq F kL2 (Ω) < ∞ P ∞ 2 2 2 and, in this case, E kDF kH = q=1 qkJq F kL2 (Ω) . If H = L (A, A, µ) (with µ non-atomic), then the derivative of a random variable F as in (2.7) can be identied with the element of L2 (A × Ω) given by Note also that a random variable

Dx F =

∞ X

qIq−1 (fq (·, x)) ,

x ∈ A.

(2.11)

q=1 We denote by

δ

the adjoint of the operator

D,

also called the divergence operator.

A

2 random element u ∈ L (Ω, H) belongs to the domain of δ , noted Domδ , if and only if it 1,2 , where c is a constant depending only on veries |EhDF, uiH | 6 cu kF kL2 (Ω) for any F ∈ D u

u.

If

u ∈ Domδ ,

then the random variable

δ(u)

is dened by the duality relationship (called

integration by parts formula)

E(F δ(u)) = EhDF, uiH , which holds for every

(2.12)

F ∈ D1,2 .

The divergence operator

δ

is also called the Skorohod integral

because in the case of the Brownian motion it coincides with the anticipating stochastic integral introduced by Skorohod in [26]. The family

Tt =

∞ X

(Tt , t > 0)

of operators is dened through the projection operators

Jq

e−qt Jq ,

as

(2.13)

q=0 and is called the Ornstein-Uhlenbeck semigroup. Assume that the process

X 0,

which stands

0 for an independent copy of X , is such that X and X are dened on the product probability 0 0 0 1,2 , we can regard its Malliavin space (Ω×Ω , F ⊗F , P ×P ). Given a random variable Z ∈ D H −1 -almost derivative DZ = DZ(X) as a measurable mapping from R to R, determined P ◦ X surely. Then, for any

t > 0,

we have the so-called Mehler's formula (see e.g. [12, Section 8.5,

Ch. I] or [19, formula (1.54)]):

Tt (DZ) = E 0 DZ(e−t X + where

E0

p 1 − e−2t X 0 ) ,

(2.14)

denotes the mathematical expectation with respect to the probability

The operator

L

is dened as

L=

q=0 −qJq , and it can be proven to be the innitesimal

generator of the Ornstein-Uhlenbeck semigroup

2

DomL = {F ∈ L (Ω) :

∞ X

P 0.

P∞

(Tt )t>0 .

The domain of

L

is

q 2 kJq F k2L2 (Ω) < ∞} = D2,2 .

q=1 There is an important relation between the operators 1.4.3]). A random variable

DF ∈ Domδ ),

F

D, δ and L (see e.g. [19, Proposition F ∈ Dom (δD) (i.e. F ∈ D1,2 and

2,2 if and only if belongs to D

and in this case

δDF = −LF. F ∈ L2 (Ω), we dene L−1 F = 2 pseudo-inverse of L. For any F ∈ L (Ω), we For any

(2.15)

P∞

1 q=0 − q Jq (F ). The operator −1 F ∈ DomL, and have that L

LL−1 F = F − E(F ).

L−1

is called the

(2.16)

5

We end the preliminaries by noting that Shigekawa [25] has developed an alternative framework which avoids the inverse of the Ornstein-Uhlenbeck operator

L.

This framework

could provide an alternative derivation of the integration by parts formula (2.30) in [16] which leads to Theorem 3.3.

3 Poincaré-type inequalities and bounds on distances 3.1

Poincaré inequalities

The following statement contains, among others, a general version (3.19) of the innitedimensional Poincaré inequality (1.2).

Proposition 3.1 Fix p > 2 and let F

∈ D1,p be such that E(F ) = 0.

1. The following estimate holds:

p E DL−1 F H 6 E kDF kpH .

(3.17)

2. If in addition F ∈ D2,p , then

p

p 1 E D2 L−1 F op 6 p E D2 F op , (3.18) 2

where D2 F op indicates the operator norm of the random Hilbert-Schmidt operator

H → H : f 7→ f, D2 F H .

(and similarly for kD2 L−1 F kop ). 3. If p is an even integer, then E F p 6 (p − 1)p/2 E kDF kpH .

Proof.

(3.19)

By virtue of standard arguments, we may assume throughout the proof that

L2 (A, A, µ),

where

(A, A)

is a measurable space and

1. In what follows, we will write

X0

µ

is a

σ -nite

H =

and non-atomic measure.

to indicate an independent copy of

X.

Let

F ∈ L2 (Ω)

have the expansion (2.7). Then, from (2.11),

−Dx L−1 F =

X

Iq−1 (fq (x, ·)) ,.

q>1 By combining this relation with Mehler's formula (2.14), one deduces that

−1

−Dx L

∞ p e Tt Dx F (X)dt = e−t EX 0 Dx F e−t X + 1 − e−2t X 0 dt 0 0 p = EY EX 0 Dx F e−Y X + 1 − e−2Y X 0 Z

F

=

∞

Z

−t

where

Y ∼ E(1)

t > 0}

is the Ornstein-Uhlenbeck semigroup (2.13). Note that we regard every random

is an independent exponential random variable of mean

6

1,

and

{Tt :

variable write

Dx F

as an application

RH → R

and that (for a generic random variable

EG to indicate that we take the expectation with respect to G. It

p p

p

E DL−1 F H = EX EY EX 0 DF e−Y X + 1 − e−2Y X 0

H p

p −Y 0 6 EX EY EX 0 DF e X + 1 − e−2Y X

H p

p

= EY EX EX 0 DF e−Y X + 1 − e−2Y X 0

G)

we

follows that

H

= EY EX kDF (X)kpH = EX kDF (X)kpH = E kDF kpH where we used the fact that

e−t X 0 +

√

law

1 − e−2t X = X

for any

t > 0.

2. From the relation

2 −Dxy L−1 F =

X

(q − 1) Iq−2 (fq (x, y, ·))

q>2 one deduces analogously that

2 −Dxy L−1 F

Z

∞

=

2 e−2t Tt Dxy F dt

Z0 ∞ = = where

Y ∼ E(2)

p 2 e−2t EX 0 Dxy F e−t X + 1 − e−2t X 0 dt 0 p 1 2 EY EX 0 Dxy F e−Y X + 1 − e−2Y X 0 2

is an independent exponential random variable of mean

p

E D2 L−1 F op = 6 = =

1 2 . Thus

p p 1

2 −Y −2Y X 0 0 D F 1 − e E E E e X +

X Y X 2p op

p 1

p EX EY EX 0 D2 F e−Y X + 1 − e−2Y X 0 p 2 op

p p 1

EY EX EX 0 D2 F e−Y X + 1 − e−2Y X 0 2p op

p

p

p

2

2 1 1 1 EY EX D F (X) op = p EX D F (X) op = p E D2 F op . p 2 2 2

p = 2k , we have 2k E F = E LL−1 F × F 2k−1 = −E δDL−1 F × F 2k−1 = (2k − 1)E hDF, −DL−1 F iF 2k−2 k k1 2k 1− k1 6 (2k − 1) E hDF, −DL−1 F i E F

3. Writing

by Hölder's inequality,

from which we infer that

k E F 2k 6 (2k − 1)k E hDF, −DL−1 F i 6 (2k − 1)k E kDF kkH kDL−1 F kkH q q k 2k 6 (2k − 1)k E kDF k2k E kDL−1 F k2k H H 6 (2k − 1) E kDF kH .

7

We also state the following technical result which will be needed in Section 4. The proof is standard and omitted.

Lemma 3.2 Let F and G be two elements of D2,4 . Then, the two random elements hD2 F, DGiH

and hDF, D2 GiH belong to L2 (Ω, H). Moreover, hDF, DGiH ∈ D1,2 and DhDF, DGiH = hD2 F, DGiH + hDF, D2 GiH . 3.2 Let

(3.20)

Bounds on the total variation and Wasserstein distances

U, Z

be two generic real-valued random variables.

U

distance between the law of

and the law of

Z

We recall that the total variation

is dened as

dT V (U, Z) = sup |P (U ∈ A) − P (Z ∈ A)|,

(3.21)

A where the supremum is taken over all Borel subsets

d with values in R ,

d > 1,

dW (U, Z) =

A

of

R.

For two random vectors

the Wasserstein distance between the law of

sup

U

U

and

and the law of

|E[f (U )] − E[f (Z)]|,

Z

Z is

(3.22)

f :kf kLip 61

k · kLip stands for the usual Lipschitz seminorm. We stress that the topologies induced dT V and dW , on the class of all probability measures on R, are strictly stronger than the

where by

topology of weak convergence. The following statement has been proved in [16, Theorem 3.1] by means of Stein's method.

Theorem 3.3 Suppose that Z ∼ N (0, 1). Let F

∈ D1,2 and E(F ) = 0. Then,

dW (F, Z) 6 E|1 − hDF, −DL−1 F iH | 6 E[(1 − hDF, −DL−1 F iH )2 ]1/2 .

(3.23)

If moreover F has an absolutely continuous distribution, then dT V (F, Z) 6 2E|1 − hDF, −DL−1 F iH | 6 2E[(1 − hDF, −DL−1 F iH )2 ]1/2 .

(3.24)

4 Proof of Theorem 1.1 and contraction inequalities 4.1

Proof of Theorem 1.1

We can assume, without loss of generality, that First, note that

W

µ = 0 and σ 2 = 1.

Set

W = DF, −DL−1 F H .

has mean 1, as

E(W ) = E[hDF, −DL−1 F iH ] = −E[F × δDL−1 F ] = E[F × LL−1 F ] = E[F 2 ] = 1. p By Theorem 3.3 it follows that we only need to bound Var (W ). By (1.2), we have Var (W ) 6 E kDW k2H . So, our problem is now to evaluate kDW k2H . By using Lemma 3.2 in the special −1 F , we deduce that case G = −L

2 kDW k2H = hD2 F, −DL−1 F iH + hDF, −D2 L−1 F iH H

2

2 6 2 hD2 F, −DL−1 F iH H + 2 hDF, −D2 L−1 F iH H . 8

We evaluate the last two terms separately. We have

2

hD F, −DL−1 F iH 2 6 D2 F 2 DL−1 F 2 H op H and

hDF, −D2 L−1 F iH 2 6 kDF k2 D2 L−1 F 2 . H op H It follows that

h

2 i

2

2 E kDW k2H 6 2 E DL−1 F H D2 F op + kDF k2H D2 L−1 F op

4 1/2

4

4 1/2 . 6 2 E DL−1 F H × E D2 F op + 2 E kDF k4H × E D2 L−1 F op The desired conclusion follows by using, respectively, (3.17) and (3.18) with

4.2

p = 4.

Random contraction inequalities

When the quantity

4

E D2 F op

appearing in (1.5)-(1.6) is analytically too hard to assess, one

can resort to the following inequality, which we name random contraction inequality :

Proposition 4.1 (Random contraction inequality). Let F

∈ D2,4 . Then

2 4

D F 6 D2 F ⊗1 D2 F 2 ⊗2 , H op

(4.25)

where D2 F ⊗1 D2 F is the random element of H 2 obtained as the contraction of the symmetric random tensor D2 F , see (2.8). Proof.

We can associate with the symmetric random elements

Schmidt operator

f 7→ f, D2 F H⊗2 .

Denote by

{γj }j>1

D2 F ∈ H 2 the random Hilbert-

the sequence of its (random) eigen-

values. One has that

X

2 4

2

D F = max |γj |4 6 |γj |4 = D2 F ⊗1 D2 F H⊗2 , op j>1

j>1

and the conclusion follows. The following result is an immediate corollary of Theorem 1.1 and Proposition 4.1.

Corollary 4.2 Let F Then

∈ D2,4 with E (F ) = µ and Var (F ) = σ 2 . Assume that Z ∼ N (µ, σ 2 ).

√ h i1 1 10 2 4 dW (F, Z) 6 E kD F ⊗1 D2 F k2H⊗2 4 × E kDF k4H . (4.26) 2σ If, in addition, the law of F is absolutely continuous with respect to the Lebesgue measure,

then

dT V

√ h i1 1 10 4 (F, Z) 6 2 E kD2 F ⊗1 D2 F k2H⊗2 4 × E kDF k4H . σ

Remark 4.3

When used in the context of central limit theorems, inequality (4.27) does not

give, in general, optimal rates. For instance, if such that

E

(4.27)

Fk2

→1

and

law

Fk = I2 (fk )

Fk −→ Z ∼ N (0, 1)

as

k → ∞,

is a sequence of double integrals then (4.27) implies that

1/2

dT V (Fk , Z) 6 cst × kfk ⊗1 fk kH⊗2 → 0, and the rate

1/2

kfk ⊗1 fk kH⊗2

is suboptimal (by a power of

9

1/2),

see Proposition 3.2 in [16].

5 Characterization of CLTs on a xed Wiener chaos The following statement collects results proved in [21] (for the equivalences between (i), (ii) and (iii)) and [20] (for the equivalence with (iv)).

Theorem 5.1 Fix

q > 2, and let Fk = Iq (fk ), k > 1, be a sequence of multiple Wiener-Itô integrals such that E Fk2 → 1. As k → ∞, the following four conditions are equivalent:

(i)

law

Fk −→ Z ∼ N (0, 1);

(ii)

E(Fk4 ) −→ E(Z 4 ) = 3;

(iii)

kfk ⊗r fk kH⊗(2q−2r) −→ 0 for all r = 1, . . . , q − 1;

(iv)

kDF k2H −→ q .

L2 (Ω)

See Section 9 in [22] for a discussion of the combinatorial aspects of the implication (ii)

→

(i) in the statement of Theorem 5.1. The next theorem, which is a consequence of the main results of this paper, provides two new necessary and sucient conditions for CLTs on a xed Wiener chaos.

Theorem 5.2 Fix q > 2, and let Fk = Iq (fk ) be a sequence of multiple Wiener-Itô integrals such that E Fk2 → 1. Then, the following three conditions are equivalent as k → ∞:

(i)

law

Fk −→ Z ∼ N (0, 1);

(ii) (iii)

2 (Ω)

2

D Fk ⊗1 D2 Fk ⊗2 L−→ 0; H

2 L4 (Ω)

D Fk −→ 0. op

Proof.

kDFk k2H live inside p a nite sum of Wiener chaoses (where all the L (Ω) norms are equivalent), we deduce that 4 the sequence EkDFk kH , k > 1, is bounded. In view of (1.5) and (4.25), it is therefore Since

EkDFk k2H = qE(Fk2 ) → q ,

enough to prove the implication (i)

H = L2 (A, A , µ)

where

(A, A )

→

and since the random variables

(ii). Without loss of generality, we can assume that

is a measurable space and

µ

is a

σ -nite

atoms. Now observe that

2 Da,b Fk = q(q − 1)Iq−2 fk (·, a, b) ,

10

a, b ∈ A.

measure with no

Hence, using the multiplication formula (2.9),

D2 Fk ⊗1 D2 Fk (a, b) Z Iq−2 fk (·, a, u) Iq−2 fk (·, b, u) µ(du) = q 2 (q − 1)2 A

Z q−2 X q−2 2 2 2 e r fk (·, b, u)µ(du) r! fk (·, a, u)⊗ = q (q − 1) I2q−4−2r r A r=0 q−2 X q−2 2 2 2 e r+1 fk (·, b) r! = q (q − 1) I2q−4−2r fk (·, a)⊗ r r=0 q−1 X q−2 2 2 2 e r fk (·, b) . = q (q − 1) (r − 1)! I2q−2−2r fk (·, a)⊗ r−1 r=1

Using the orthogonality and isometry properties of the integrals

Iq ,

we get

4 q−1 X

2

2 2 q−2 4 4 2

(r − 1)! (2q − 2 − 2r)!kfk ⊗r fk k2H⊗(2q−2r) . E D Fk ⊗1 D Fk H⊗2 6 q (q − 1) r−1 r=1

The desired conclusion now follows since, according to Theorem 5.1, if (i) is veried then, necessarily,

kfk ⊗r fk kH⊗(2q−2r) → 0

for every

r = 1, ..., q − 1.

6 CLTs for linear functionals of Gaussian subordinated elds Let B denote a centered |ρ(x)|dx < ∞, where ρ(u−v) := R E (Bu+1 −Bu )(Bv+1 −Bv ) . Also, in order to avoid trivialities, assume that ρ is not identically We now provide an explicit application of the inequality (4.26).

Gaussian process with stationary increments and such that

R

zero.

B can be identied with an isonormal Gaussian process of the type X = {X(h), h ∈ H}, for H dened as follows: (i) denote by E the set of all step functions on R, (ii) dene H as the Hilbert space obtained by closing E with respect to the inner product h1[s,t] , 1[u,v] iH = Cov(Bt − Bs , Bv − Bu ). In particular, with such a notation, one has that Bt − Bs = X(1[s,t] ). 2 Let f : R → R be a real function of class C , and Z ∼ N (0, 1). We assume that f is not constant, that E|f (Z)| < ∞ and that E|f 00 (Z)|4 < ∞. As a consequence of the 0 4 generalized Poincaré inequality (3.19), we see that we also automatically have E|f (Z)| < ∞ 4 and E|f (Z)| < ∞. Fix a < b in R and, for any T > 0, consider The Gaussian space generated by

1 FT = √ T

Z

bT

f (Bu+1 − Bu ) − E[f (Z)] du.

aT

Theorem 6.1 As T dW

√

FT VarFT

→ ∞, , Z = O(T −1/4 ).

(6.28)

11

Remark 6.2

We believe that the rate in (6.28) is not optimal (it should be

O(T −1/2 ) instead),

see also Remark 4.3.

Proof of Theorem 6.1.

We have

Z

1 DFT = √ T and

bT

f 0 (Bu+1 − Bu )1[u,u+1] du

aT

Z bT 1 D FT = √ f 00 (Bu+1 − Bu )1⊗2 [u,u+1] du. T aT Z 1 2 kDFT kH = f 0 (Bu+1 − Bu )f 0 (Bv+1 − Bv ) ρ(u − v)dudv T [aT,bT ]2 2

Hence

so that

kDFT k4H

=

Z

1 T2

f 0 (Bu+1 − Bu )f 0 (Bv+1 − Bv )f 0 (Bw+1 − Bw )

[aT,bT ]4

×f 0 (Bz+1 − Bz )ρ(w − z)ρ(u − v)dudvdwdz. By applying Cauchy-Schwarz inequality twice, and by using the fact that

law

Bu+1 − Bu = Z ,

we get

E f 0 (Bu+1 − Bu )f 0 (Bv+1 − Bv )f 0 (Bw+1 − Bw )f 0 (Bz+1 − Bz ) 6 E|f 0 (Z)|4 so that

EkDFT k4H 6 E|f 0 (Z)|4 0

4

6 E|f (Z)|

1 T

Z

1 T

Z

!2 |ρ(u − v)|dudv

[aT,bT ]2 bT

2

Z |ρ(x)|dx

du aT

= O(1).

(6.29)

R

On the other hand, we have

1 D FT ⊗1 D FT = T 2

2

Z [aT,bT ]2

f 00 (Bu+1 − Bu )f 00 (Bv+1 − Bv )ρ(u − v)1[u,u+1] ⊗ 1[v,v+1] dudv.

Hence

EkD2 FT ⊗1 D2 FT k2H⊗2 Z 1 = E f 00 (Bu+1 − Bu )f 00 (Bv+1 − Bv )f 00 (Bw+1 − Bw )f 00 (Bz+1 − Bz ) 2 T [aT,bT ]4 ×ρ(u − v)ρ(w − z)ρ(u − w)ρ(z − v)dudvdwdz Z 00 4 1 6 E|f (Z)| 2 |ρ(u − v)||ρ(w − z)||ρ(u − w)||ρ(z − v)|dudvdwdz T [aT,bT ]4 Z 00 4 b−a 6 E|f (Z)| |ρ(x)||ρ(y)||ρ(t)||ρ(x − y − t)|dxdydt = O(T −1 ). T R3 By combining all these facts and (4.26), the desired conclusion follows. Theorem 6.1 does not guarantee that

limT →∞ VarFT exists. The f is symmetric.

shows that the limit does indeed exist, at least when

12

following proposition

Proposition 6.3 Suppose that

f : R → R is a symmetric real function of class C 2 . Then σ 2 := limT →∞ VarFT exists in (0, ∞). Moreover, as T → ∞, law

FT −→ Z ∼ N (0, σ 2 ).

Proof of Proposition 6.3.

(6.30)

We expand

f

in terms of Hermite polynomials. Since

we can write

f (x) = E[f (Z)] +

∞ X

c2q H2q (x),

f

is symmetric,

x ∈ R,

q=1 where the real numbers

VarFT

= =

=

=

c2q Z

are given by

−→

Since assumed

so that

a

(b − a)

∞ X

c22q (2q)!

−T (u−a)

Z

ρ2q (x)dx =: σ 2 ,

by monotone convergence.

R

q=1

f is not constant, there exists q > 1 such that c2q 6= 0 so ρ 6≡ 0). Moreover, we also have q VarFT 6 E kDFT k2H 6 E kDFT k4H = O(1),

σ 2 < ∞.

that

σ2 > 0

(recall that we

see (6.29),

The assertion now follows from Theorem 6.1.

H < 1/2, Theorem 6.1 applies because, in this case, it is easily checked that < ∞. On the other hand, using R |ρ(x)|dx h i law 1 R b Bx+h −Bx f − E f (Z) dx for all the scaling property of B , observe that F1/h = √ H h h a When

B

Thus

1 Cov f (Bu+1 − Bu ), f (Bv+1 − Bv ) dudv T [aT,bT ]2 Z ∞ X 1 2 ρ2q (v − u)dudv c2q (2q)! T [aT,bT ]2 q=1 Z Z bT −u ∞ X 1 bT 2 c2q (2q)! du dxρ2q (x) T aT aT −u q=1 Z b Z T (b−u) ∞ X 2 c2q (2q)! du dxρ2q (x) q=1

T %∞

(2q)!c2q = E[f (Z)H2q (Z)].

is a fractional Brownian motion with Hurst index

R

xed

h > 0.

Hence, since

E|Bt − Bs |2 = σ 2 (|t − s|)

with

σ 2 (r) = r2H

a concave function, the

general Theorem 1.1 in [13] also applies, and this gives another proof of (6.30). We believe however that, even in this particular case, our proof is simpler (since not based on the rather technical method of moments). distance between the laws of

Moreover, note that [13] is not concerned with bounds on

F1/h /

p VarF1/h

and

Z ∼ N (0, 1).

7 A multidimensional extension Let

V, Y

be two random vectors with values in

between the laws of

V

and

Y

Rd , d > 2.

Recall that the Wasserstein distance

is dened in (3.22). The following statement, whose proof is

based on the results obtained in [18], provides a multidimensional version of (1.5).

13

Theorem 7.1 Fix

d > 2, and let C = {C(i, j) : i, j = 1, . . . , d} be a d × d positive denite matrix. Suppose that F = (F1 , . . . , Fd ) is a Rd -valued random vector such that E[Fi ] = 0 and Fi ∈ D2,4 for every i = 1, . . . , d. Assume moreover that F has covariance matrix C . Then √ d d X X 3 2 −1 1/4 2 4 1/4 dW F, Nd (0, C) 6 kC kop kCk1/2 EkD F k × EkDFj k4H , i op op 2 i=1

j=1

where Nd (0, C) indicates a d-dimensional centered Gaussian vector, with covariance matrix equal to C . Proof.

In [18, Theorem 3.5] it is shown that

v u d uX −1 1/2 t E (C(i, j) − hDFi , −DL−1 Fj iH )2 . F, Nd (0, C) 6 kC kop kCkop

dW

i,j=1 Since, using successively (2.12), (2.15) and (2.16), we have

E hDFi , −DL−1 Fj iH = −E Fi × δDL−1 Fj ] = E Fi × LL−1 Fj ] = E[Fi Fj ] = C(i, j), we deduce, applying successively (1.2), (3.20), Cauchy-Schwarz inequality and Proposition 3.1,

dW F, Nd (0, C) 6 kC

−1

kop kCk1/2 op

d q X Var hDFi , −DL−1 Fj iH

i,j=1

6 kC

−1

kop kCk1/2 op

d q X E kDhDFi , −DL−1 Fj iH k2H i,j=1

6

√

2kC

−1

kop kCk1/2 op

d q X q 2 2 2 −1 2 −1 E khD Fi , −DL Fj iH kH + E khDFi , −D L Fj iH kH i,j=1

6

√

2kC

−1

kop kCk1/2 op

d h X

1/4 1/4 E kD2 Fi k4op E kDL−1 Fj k4H

i,j=1

1/4 2 −1 1/4 i + E kDFi k4H E kD L Fj k4op 6

√

2kC

−1

kop kCk1/2 op

d h X

1/4 1/4 E kD2 Fi k4op E kDFj k4H

i,j=1

1/4 2 1/4 1 + E kDFi k4H E kD Fj k4op 2 =

√ d d X 2 4 1/4 X 1/4 3 2 −1 kC kop kCk1/2 E kD F k × E kDFj k4H . i op op 2 i=1

Acknowledgments.

j=1

We would like to thank Professor Paul Malliavin for very stimulating

discussions and an anonymous referee for pointing us towards the reference [25].

14

References [1] D. Bakry, F. Barthe, P. Cattiaux and A. Guillin (2008). A simple proof of the Poincaré inequality for a large class of probability measures including the log-concave case.

Comm. in Probab. 13, 6066 (Electronic)

Elect.

[2] S.G. Bobkov (1999). Isoperimetric and analytic inequalities for log-concave probability measures.

Ann. Probab. 27(4), 19031921.

[3] T. Cacoullos, V. Papahanasiou and S.A. Utev (1994). Variational inequalities with examples and an application to the central limit theorem.

Ann. Probab. 22(3), 16071618.

[4] S. Chatterjee (2009). Fluctuation of eigenvalues and second order Poincaré inequalities.

Probab. Theory Related Fields 143, 140.

[5] L.H.Y. Chen (1985). Poincaré-type inequalities via stochastic integrals.

Gebiete 69, 251277.

Z. Wahrsch. verw.

[6] L.H.Y. Chen (1987). Characterization of probability distributions by Poincaré-type inequalities.

Ann. Inst. H. Poincaré Probab. Statist. 23(1), 91110.

[7] L.H.Y. Chen (1988). The central limit theorem and Poincaré-type inequalities.

Probab. 16(1), 300304.

[8] L. Chen and Q.-M. Shao (2005). Stein's method for normal approximation. In:

duction to Stein's method,

Ann.

An intro-

159. Lect. Notes Ser. Inst. Math. Sci. Natl. Univ. Singap.

4,

Singapore Univ. Press, Singapore, 2005. [9] H. Cherno (1981). A note on an inequality involving the normal distribution.

Probab. 9(3), 533535.

Ann.

[10] C. Houdré and V. Pérez-Abreu (1995). Covariance identities and inequalities for functionals on Wiener and Poisson spaces.

Ann. Probab. 23, 400419.

[11] K. Itô (1951). Multiple Wiener integral. [12] P. Malliavin (1997).

J. Math. Soc. Japan 3, 157169.

Stochastic Analysis. Springer-Verlag, Berlin, Heidelberg, New York.

[13] M. Marcus and J. Rosen (2008). CLT for

Stoch. Proc. Appl. 118, 11071135.

Lp

moduli of continuity of Gaussian processes.

[14] J. Nash (1956). Continuity of solutions of parabolic and elliptic equations.

80, 931954.

Amer. J. Math.

[15] I. Nourdin and G. Peccati (2008). Non-central convergence of multiple integrals.

Probab., to appear.

[16] I. Nourdin and G. Peccati (2008). Stein's method on Wiener chaos.

Fields, to appear.

Ann.

Probab. Theory Rel.

[17] I. Nourdin and G. Peccati (2008). Stein's method and exact Berry-Essén bounds for functionals of Gaussian elds. Preprint.

15

[18] I. Nourdin, G. Peccati and A. Réveillac (2008). Multivariate normal approximation using Stein's method and Malliavin calculus. [19] D. Nualart (2006).

Ann. Inst. H. Poincaré Probab. Statist., to appear.

The Malliavin calculus and related topics. Springer-Verlag, Berlin, 2nd

edition. [20] D. Nualart and S. Ortiz-Latorre (2008). Central limit theorem for multiple stochastic integrals and Malliavin calculus.

Stoch. Proc. Appl. 118 (4), 614628.

[21] D. Nualart and G. Peccati (2005). Central limit theorems for sequence of multiple stochastic integrals. [22] G.

Peccati

mulae

for

Ann. Probab. 33 (1), 177193. and

M.S.

non-linear

Taqqu

(2008).

functionals

of

Moments, random

cumulants

measures.

and

Preprint

diagram available

forat

http://fr.arxiv.org/abs/0811.1726. [23] G. Peccati and C.A. Tudor (2005). Gaussian limits for vector-valued multiple stochastic integrals. In:

Séminaire de Probabilités XXXVIII , 247262. Lecture Notes in Math. 1857,

Springer-Verlag, Berlin. [24] G. Reinert (2005). Three general approaches to Stein's method. In:

An introduction to

Stein's method, 183221. Lect. Notes Ser. Inst. Math. Sci. Natl. Univ. Singap. 4, Singapore

Univ. Press, Singapore. [25] I. Shigekawa (1986). De Rham-Hodge-Kodaira's decomposition on an abstract Wiener space.

J. Math. Kyoto Univ. 26 (2), 191202.

[26] A. V. Skorohod (1975). On a generalization of a stochastic integral.

20, 219233.

16

Theory Probab. Appl.

On the Efficient Second Order Minimization and Image ...