Monograf´ıas del Semin. Matem. Garc´ıa de Galdeano. 27: 337–343, (2003).

An urn model with non-deterministic total replacement I. Higueras1 , J. A. Moler2 , F. Plo3 , M. San Miguel3 1 2

Departamento de Matem´atica e Inform´atica, Universidad P´ ublica de Navarra.

Depart. de Estad´ıstica e Investigaci´on Operativa, Universidad P´ ublica de Navarra. 3

Departamento de M´etodos Estad´ısticos. Universidad de Zaragoza. Abstract

A strong law is obtained for the process {Xn } that represents the proportion of balls of each colour in a generalized P´olya urn with non-deterministic total replacement. We prove that this process fits the Robbins-Monro scheme of stochastic approximation and, by means of the ODE method, we obtain its a.s. limit. Keywords: Generalized P´olya urn, ODE method, Differential Algebraic Equations. AMS Classification: 60F15, 34D23, 62L20.

1

Introduction

Let us consider an urn that initially contains T0 > 0 balls of two colours, for example black and white. The replacement policy consists in drawing one ball from the urn, putting it back into the urn and, if the ball is white, adding a white balls and b black balls and, otherwise, c white balls and d black balls. In the probabilistic literature it is usual to assume that a + b = c + d = s,

(1)

so that the total number of balls in each stage, {Tn }n≥0 , is a deterministic process Tn = T0 + ns, for all n ≥ 0. Let {Xn } be the bidimensional stochastic process that represents the proportion of balls of each colour in the urn at stage n. This process evolves in the 1-simplex ∆1 = {x = (x1 , x2 ) ∈ R2 : x1 + x2 = 1, x1 ≥ 0, x2 ≥ 0}. The asymptotic behaviour of this process is well-known (see, [4], [5]) and some generalizations have been studied taking advantage of the deterministic behaviour of {Tn } (see, for example, [6], [1], [2]). 337

When condition (1) is not assumed the process {Tn } is non-deterministic. This is the case in [3], where the exact distribution of the number of balls of each colour in each stage is obtained, but strong laws for {Xn } are not established. Our aim is to obtain a strong law for {Xn }. In section 2 we model the urn process as a recurrence equation and we prove that it fits the Robbins-Monro scheme of stochastic approximation. In section 3 we apply the EDO method to obtain a.s convergence of {Xn }. Since the EDO associated with the recurrence equation evolves in the ∆1 manifold, we can restate the EDO as a differential algebraic equation (DAE) and apply the qualitative theory of DAEs to solve it.

2

Recurrence equation of the urn model

Let us consider the replacement matrix

C=

a b

!

c d

where a, b, c, d are non-negative numbers, and b, c > 0. If we identify number 1 with white balls and number 2 with black balls, then the element cij of C represents the number of balls of colour j that will be added to the urn when a ball of colour i has been extracted. Let {An } be a bidimensional stochastic process such that, for each n, the distribution of An conditioned to Xn−1 is ( An =

(1, 0) with probabilityX1n−1 (0, 1) with probability X2n−1

Let Sn = An C1t , with n ≥ 1 and 1 = (1, 1). The sequence {Sn } indicates the total number of balls added to the urn in each stage, so that Tn = T0 +

n X

Sk , n ≥ 1.

k=1

Now, we can describe the evolution of the process by means of the recurrence equation Tn Xn + An+1 C Tn+1 An+1 C − Sn+1 Xn = Xn + . Tn+1

Xn+1 =

(2)

The process {(Xn , An )} will be referred to as a generalized P´olya urn model (GPUM), and we will consider the natural filtration {Fn }n≥1 , where Fn = σ((Xi , Ai ) 1 ≤ i ≤ n). 338

Lemma 1 Let {Ten } be a sequence such that Ten =

n X

E[Sk | Fk−1 ].

k=1

Then, 1 |Tn − Ten | → 0, n

a.s.

Proof. Let m = max{a + b, c + d}. Since the sequence {Sn } is uniformly bounded by m, the lemma follows straightforward from Theorem 2.19 in [7].



Theorem 1 Let {(Xn , An )} be a GPUM. Then, the process {Xn } fits the recurrence equation Xn+1 = Xn + γn Yn ,

(3)

that satisfies the conditions [A1] supn E|Yn |2 < ∞. [A2] Yn = F (Xn ) + εn + βn , where [A2.1] F : R2 → R2 is a continuous function. [A2.2] {εn } is a sequence of martingale differences relative to the σ-algebra {Fn }. [A2.3] βn → 0, a.s. [A3]

X

γn = ∞,

X

γn2 < ∞, γn > 0, ∀n, γn → 0.

Proof. Let s = (a + b, c + d). From Lemma 1 we have Ten = T0 +

n X

Xk−1 st .

k=1

From (2), we can write Xn+1 = Xn +

An+1 C − Sn+1 Xn 1 1 + (An+1 C − Sn+1 Xn )( − ). Tn+1 Ten+1 Ten+1

Now, we have that E[Xn+1 | Fn ] = Xn +

1 Ten+1

" Xn (C − Xn st ) + E (An+1 C − Sn+1 Xn )

339

! #! Ten+1 − 1 |Fn . Tn+1

As Xn+1 = E[Xn+1 | Fn ] + (Xn+1 − E[Xn+1 | Fn ]),

(4)

the process {Xn } satisfies the recurrence relation Xn+1 = Xn + γn+1 (F (Xn ) + εn+1 + βn+1 ). where F (Xn ) := Xn (C − Xn st ), 1 γn+1 := , Ten+1 εn+1 := Ten+1 # " (Xn+1 − E[Xn+1 | Fn ]), Ten+1 − 1)|Fn . βn+1 := E (An+1 C − Sn+1 Xn )( Tn+1 It is easy to check that conditions [A1], [A2.1], [A2.2] and [A3] hold. Thus, we only have to focus on condition [A2.3]. We consider, for x = (x1 , x2 ) ∈ R2 , the norm kxk = |x1 | + |x2 |. Let m = max{a + b, c + d}. Then, k(An+1 C − Sn+1 Xn )k ≤ 2m, and |Xn st − Sn+1 | ≤ 2m. Since Tn+1 = Tn +Sn+1 , Sn+1 is a positive random variable and Tn is Fn -measurable, it follows that # Te − T + X st − S n n n n+1 kβn+1 k ≤ E k(An+1 C − Sn+1 Xn )k | Fn Tn Te − T 1 n n ≤ 2m + 4m2 Tn Tn "

As Tn > n, the second addend converges to 0, and from Lemma 1 the first one also converges to 0. Therefore kβn k converges to 0 and the result follows.



Remark 1 The recurrence relation (3) with conditions [A1]-[A3] is usually referred to as the Robbins-Monro algorithm of stochastic approximation.

340

3

Almost sure convergence of the urn model

In order to obtain the a.s. convergence of the process {Xn } of a GPUM, we are going to use the ODE method. This method relates a recurrence equation with an ODE (see [8]). Then, under some assumptions, the asymptotic behaviour of the ODE corresponds with that of the recurrence equation. From Theorem 5.2.1 in [8], and the remark that follows it, we can state the following theorem. Theorem 2 Let {Xn } be a stochastic process that fits the recurrence equation (3) and the conditions [A1], [A2] and [A3] of Theorem 1. If u ∈ R2 is a globally asymptotically stable point to the ODE with restrictions ( x˙ = F (x),

(5)

x ∈ ∆1 then Xn → u, a.s.

 Now, in order to study (5), we point out that its equilibrium points and solutions are the same as those of the DAE

(

x˙ = F (x) − z1t 0 =

(6)

1 − x 1t

where x ∈ R2 and z ∈ R. The stability of equilibrium points for DAEs depends on the spectral abscissa of a regular matrix pencil, (A, B), where the spectral abscissa is α(A, B) := max{ Re (λ) : det (λA + B) = 0} . Now, from Theorem 4.3 in [9], an equilibrium point of the DAE (6) is asymptotically stable if and only if α(A, B) < 0, with

A=

I2 0t 0

0

! ,

B=

−JF (u) 1t −1

!

0

where JF is the the jacobian matrix of the function F . We are going to apply the previous comments to our problem. First, we calculate the equilibrium points by solving F (x1 , x2 ) = 0,

x1 + x2 = 1 , 341

which gives φ(x1 ) = (−a − b + c + d) x21 + (a − 2c − d) x1 + c = 0 . We have to distinguish two cases: i) If a + b = c + d, then there is a unique root x1 =

c c = ∈ [0, 1] , −a + 2c + d c+b

and thus a unique equilibrium point (x∗1 ,

x∗2 )

 =

c b , c+b c+b

 .

ii) If a + b 6= c + d, then there are two roots of the polynomial. As φ(0) = c > 0 and φ(1) = −b < 0, only one of them is in [0, 1], namely p −β − β 2 − 4αc ∗ , x1 = 2α where we have denoted α := −a − b + c + d, β := a − 2c − d. Denoting x∗2 = 1 − x∗1 , the unique equilibrium point in the 1-simplex is (x∗1 , x∗2 ). For x = (x1 , x2 ) ∈ ∆1 we have that F (x)1t = 0. The Jacobian matrix ! ! ! a+b a+b a b I2 (x1 , x2 ) − (x1 , x2 ) − JF (x) = c+d c+d c d has λ1 (x1 ) = −c − d + (−a − b + c + d)x1 = −c − d + αx1 as eigenvalue with eigenvector 1; the other eigenvalue is λ2 (x1 ) = a − 2c − d + 2(−a − b + c + d)x1 = 2αx1 + β .

(7)

To compute these eigenvalues we have already used that (x1 , x2 ) ∈ ∆1 . Evaluating (7) in the equilibrium point x1 = x∗1 , we obtain λ2 (x∗1 ) = −(c + b) < 0 if a + b = c + d and p λ2 (x∗1 ) = − β 2 − 4αc < 0 otherwise. From these results, we have established, in any case, the existence of a unique stable point to the ODE with restrictions (5), and therefore, it is also a globally asymptotically stable point. Now, we can apply Theorem 2 to obtain the following result. 342

Theorem 3 Let {(Xn , An )} be a GPUM. c b , ), a.s. c+b c+b

a) If a + b = c + d

then

Xn → (

b) If a + b 6= c + d

then

Xn → (u1 , 1 − u1 ), a.s.

where α := −a − b + c + d, β := a − 2c − d and u1 =

−β −

p β 2 − 4αc . 2α

References [1] Bai, Z. D. and Hu, F. (1999). Asymptotic theorems for urn models with nonhomogeneous generating matrices. Stoch. Proc. Appl. 80, 87–101. [2] Bandyopadhyay, U. and Biswas, A. (2000). A class of adaptive designs. Sequential Analysis 19, 1 & 2, 45-62. [3] Kotz, S., Mahmoud, H. and Robert, P. (2000). On generalized P´olya urn models. Stat. Probabil. Lett. 49, 163–173. [4] Gouet, R. (1989). A martingale approach to strong convergence in a generalized P´olya-Eggenberger urn model. Stat. Probabil. Lett. 8, 225–228. [5] Gouet, R. (1993). Martingale Functional Central Limit Theorems for a Generalized P´olya Urn. Ann. Prob. 21, 3, 1624–1639. [6] Gouet, R. (1997). Strong Convergence of Proportions in a multicolor P´olya Urn. J. Appl. Prob. 34, 426–435. [7] Hall, P. and Heyde, C. C. (1980). Martingale limit theory and its applications. Academic Press, San Diego. [8] Kushner, H. J. and Yin, G. G. (1997). Stochastic Approximation Algorithms and Applications. Springer-Verlag, New York. ¨rz, R. (1992). On quasilinear index 2 differential-algebraic equations. Semin. [9] Ma ber., Humboldt-Univ. Berl., Fachbereich Math. 92-1, 39–60.

343

An urn model with non-deterministic total replacement 1 Introduction

as eigenvalue with eigenvector 1; the other eigenvalue is λ2(x1) = a − 2c − d + 2(−a − b + c + d)x1 = 2αx1 + β . (7). To compute these eigenvalues we have ...

130KB Sizes 1 Downloads 146 Views

Recommend Documents

An urn model with non-deterministic total replacement 1 Introduction
Abstract. A strong law is obtained for the process {Xn} that represents the propor- tion of balls of each colour in a generalized Pólya urn with non-deterministic total replacement. We prove that this process fits the Robbins-Monro scheme of stochas

An urn model with non-deterministic total replacement 1 ...
Let {(Xn,An)} be a GPUM. Then, the process {Xn} fits the recurrence equation. Xn+1 = Xn + γnYn,. (3) that satisfies the conditions. [A1] supnE|Yn|2 < ∞. [A2] Yn = F(Xn) + εn + βn, where. [A2.1] F : R2 → R2 is a continuous function. [A2.2] {εn

Reverse Total Shoulder Replacement
How long will the procedure last and will I be under anesthesia? 4. Will I have dressings, bandages, or stitches after surgery? When should they be removed? 5.

Ebook Total Knee Replacement and Rehabilitation
... PDF Total Knee Replacement and Rehabilitation: The Knee Owner's Manual .... decision-making process to the surgery itself, how to prepare your home for.

\ REPLACEMENT
Mar 8, 2006 - US RE41,138 E. Page 2. US. PATENT DOCUMENTS. OTHER PUBLICATIONS. 5 265 206 A * “H993 shackelford et a1' ______ __ 719/316. Orafali et al. “The Essential Distributed Object Survival. 5,301,301 A * 4/1994 Kodoskyetal. ............ ..

\ REPLACEMENT
8 Mar 2006 - from the object B] Object-oriented programming is per formed by displaying objects connected by wirings that rep resent the ?ow of data, control or messages from one object to a second object. A coupling ofinput and output terminal so th

i Edge Replacement as a Model of Causal ...
Figure 20: Graph classes, data, and model predictions, for Mayrhofer et al (2010). .... auto mechanic has detailed knowledge about which intermediate events ...

1 Site Zoning Vignette: 1 hour total 1 problem with 2 tasks Problem ...
General notes / Tips: • Exact problem w/ single answer. • Follow the program exactly. • Be aware of sun exposure planes; corner of building will likely be clipped.

On Complementing Nondeterministic Buchi Automata
University of Colorado at Boulder. 2. Hebrew University. 3. Rice University ..... In this section we review the relevant technical details of [12]. Consider a UCW A =.

Neuroethics-An-Introduction-With-Readings-Basic-Bioethics.pdf ...
Retrying... Whoops! There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Neuroethics-An-Introduction-With-Readings-Basic-Bioethics.pdf. Neuroethics-An-Introducti

MODULO SCHEDULING WITH REGULAR UNWINDING 1 Introduction
1 Introduction. 1.1 Modulo Scheduling .... In parallel machine scheduling problems, an opera- ... πi = λi The processing period of operation Oi is λi, implying ...

On Unambigous Nondeterministic Finite Automata ...
ous nondeterministic finite automata (UFA), which has the strict tree property. ..... there is a v ≤i Cw, and a v′ ≤j Cw, such that v = xy, and v′ = xy′, for some y ...

Replacement
indicating their experience with similar engagements. In addition, any relevant. Continuing Education Programs should be listed. 9.1.5.1 Substitutions for the ...

An Empirical Model of Wage Dispersion with Sorting
Dec 5, 2016 - Job opportunities arrive at rate. – If unemployed: + , ≥ 0 and > 0. – If employed: • is arrival rate of offers unrelated to search, is equilibrium ...

Cash burns: An inventory model with a cash-credit choice
Further information is available online at http://www.nber.org/papers/w21110.ack ... Cash is used whenever the agent has enough of it, credit is used when cash ...

An Empirical Model of Wage Dispersion with Sorting
technology is assumed such that sorting does not arise. This paper ... The analysis will allow that the search technology may differ across ...... Vp(h, p) > 0. Hence ...

An Empirical Model of Firm Entry with Endogenous ...
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, .... in a technology-adoption game between internet service providers.

The CAPM Strikes Back? An Investment Model with ... -
response of log consumption to a disaster shock in the model mimics that in the ... Figure 2 reports that the model's impulse response is largely comparable with ...

PAC Reinforcement Learning with an Imperfect Model
than in M for all X ⊆ X ξ-inc . (see the agnostic version of the conditions in the paper.) Theorem 1: Under Condition 1, there exists an algorithm that achieves. O(|X ξ-inc. |. 2. H. 4 log(1/δ)/ε. 3. ) sample complexity for ξ = O(ε/H. 2. ). A

An estimated small open economy model with ... - Acrobat Planet
disconnect between the labor market and the rest of the economy. The rest of the paper is organized as follows. Section 2 details the model. Section 3 presents.