Bayesian Optimal Power-utility Grows Hyperbolically in the Long Run Hideaki Miyata† and Jun Sekine‡ † Department of Mathematics, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-ku, Kyoto 606-8502, Japan. e-mail: [email protected]

Graduate School of Engineering Science, Osaka University, 1-3, Machikaneyama, Toyonaka, Osaka 560-8531, Japan. e-mail: [email protected] March 5, 2012 Abstract A Bayesian power-utility maximization is considered, where meanreturn-rates of risky assets (or, more precisely, the market price of risk) is an unobservable random vector and the Arrow-Pratt’s risk-aversion parameter is larger than 1. It is shown that the optimal expected utility grows hyperbolically in the long run if we omit the effect of the risk-free interest rate. This provides a sharp contrast to the results of “non-Bayesian” settings: for instance, in the case of constant market price of risk, the optimal expected power-utility grows exponentially in the long run. Keywords: Bayesian CRRA-utility maximization, partial information, long-term growth rate, hyperbolic growth, hyperbolic discount, Kelly portfolio, fractional Kelly portfolio.

1

Introduction

In the present article, we introduce major findings of Hayashi, Miyata and Sekine (2012), where the expected power-utility maximization of terminal wealth (1.1)

U (T,γ) (x) := sup Eu(γ) (XTx,π ) π

is considered in a continuous-time financial market, consists of one riskless asset and n-risky assets. Here, we use notation for the CRRA-utility function u(γ) (x) := 1

x1−γ , 1−γ

where Arrow-Pratt’s relative risk-aversion parameter is set as γ > 1, and we denote by XTx,π the wealth of a self-financing investor at the terminal date T ∈ R++ , where x ∈ R++ is an initial wealth and π := (πt )t∈[0,T ] is a dynamic investment policy. In particular, we assume that the so-called market price of risk vector λ is a hidden, unobservable random variable, which means that (1.1) is a partially-observable (or Bayesian) optimization problem (see Section 2 for the detail of the setup). For this problem, we are interested in the long-term growth rate of optimal expected utility, i.e., writing {∫ } T

U (T,γ) (x) = u(γ) (x) exp

∂t log U (t,γ) (x)dt , 0

we are interested in the asymptotic behaviour of ∂t log U (t,γ) (x) as t ≫ 1. We obtain the following hyperbolic long-term growth rate, (1.2)

∂T log U (T,γ) (x) = (1 − γ)r −

n1 + ϵ(T ) 2T

as T → ∞,

where r is the constant risk-free interest rate and ϵ(T ) is a function “smaller” than 1/T as T → ∞ (see Proposition 5.2 and Remark 5.2 for the details). It is interesting to see that (1.2) provides a sharp contrast to the result of “nonBayesian” case: if the market price of risk vector λ is constant, then, we have the exact expression, ( ) 1 (T,γ) 2 ∂T log U (x) = (1 − γ) r + |λ| , 2γ (see (3.3)), i.e., this optimal non-Bayesian power-utility grows exponentially with respect to T , and the (norm) of the market price of risk vector affects the growth rate. It is also interesting to see that the right-hand-side of (1.2) has a “universal” value: it is independent of the law of λ (except for the residual term ϵ(T )), and the hyperbolic term depends on the number n of risky-assets (= the dimension of driving Brownian motion) only. Remark 1.1 (“Endogenous” Hyperbolic Discounting). The above hyperbolic growth of optimal power-utility plays an interesting role in the lifetime consumption maximization problem, { ∫ t } ∫ ∞ (1.3) U (γ) (x) := sup E exp − ρ(u)du u(γ) (ct )dt, (π,c)

0

0

which is studied in Miyata (2012), [15]. To consider (1.3), a similar market model with partial information is employed and a Bayesian self-financing investor with the wealth process (Xtx,π,c )t≥0 is considered, where x ∈ R++ is an initial wealth, π := (πt )t≥0 is a dynamic investment policy, and c := (ct )t≥0 is 2

a dynamic consumption plan, and, for a given discount rate process (ρ(t))t≥0 , the maximization is considered with respect to both π and c. Actually, with the hyperbolic discount rate β , ρ(t) := δ + 1 + αt we can characterize the critical values (δ, β) to ensure the solvability of (1.3), i.e., |U (γ) (x)| < ∞ holds if and only if one of the following conditions are satisfied, (a) δ > δ, or (b) δ = δ and β > β (see [15] for details). Among mathematical finance literatures, Zervos (2008), Bjork and Murgoci (2010), Eckland, Mbodji and Pirvu (2011), and so on treat optimal consumption problems on finite horizon with hyperbolic discounting. They all consider the problems with a priori hyperbolic discounting rates. It is interesting to see that, in Bayesian setting, contrarily to the above-mentioned studies, hyperbolic discounting is derived as a natural consequence, i.e., the critical and “minimal” discounting rate ρ(t) := δ +

β 1 + αt

contains a hyperbolic term. The organization of the present article is as follows. In Section 2, we formulate our financial market model with partial information. After introducing the setup, in Section 3, we mention about the standard baseline results: Merton’s optimal power-utility result, which grows exponentially with respect to the terminal time T . In Section 4, we introduce the results on Bayesian CRRAutility maximization of terminal wealth, which is studied in Karatzas and Zhao (2001). In Section 5, we analyze the long-time asymptotics of the optimal Bayesian CRRA-utility, and observe its hyperbolic growth in power-utility case. In Appendix, we mention about a dynamic programming approach to solve our Bayesian CRRA-utility maximization.

2

Market Model

Consider a continuous-time financial market, consisting of one riskless asset and n-risky assets. The price process S 0 := (St0 )t≥0 of the riskless asset is given by (2.1)

St0 := ert ,

where r ∈ R≥0 is the constant risk-free interest rate. The price process S := (S 1 , . . . , S n )⊤ , S i := (Sti )t≥0 of n-risky assets, where (·)⊤ denotes the transpose ˜ be a of a vector or a matrix, is defined in the following way: Let (Ω, F, P) standard probability space, endowed with the n-dimensional Brownian motion ˜ := (W ˜ 1, . . . , W ˜ n )⊤ , W ˜ i := (W ˜ i )t≥0 , where W ˜ 0 ∈ Rn is constant, and the W t

3

˜ . The law of λ is n-dimensional random variable λ, which is independent of W denoted by ν, i.e., ˜ ∈ dx). ν(dx) := P(λ We alway assume ∫ (2.2) Rn

|z|ν(dz) < ∞

(or a stronger condition (4.1)). We call this probability space the reference ˜ (Ft )t≥0 ), where probability space. On (Ω, F, P, ˜ u ; u ∈ [0, t]), Ft := σ(W we define (2.3)

˜ t, dSt = diag(St )σ(t, St )dW

S0 ∈ Rn++ ,

where σ : R+ × Rn+ ∋ (t, y) 7→ σ(t, y) ∈ Rn×n satisfies c1 I ≤ σσ ⊤ (t, y) ≤ c2 I for any (t, y) ∈ R+ × Rn+ with some constants 0 < c1 < c2 and diag(x) denotes the diagonal matrix whose (i, i)-element is equal to the i-th element xi of x := (x1 , . . . , xn )⊤ ∈ Rn . The stochastic differential equation (abbreviated to SDE, hereafter) (2.3) has a unique strong solution, which implies that Ft ⊃ σ(Su ; u ∈ [0, t]). Moreover, we see that ∫ (2.4)

t

˜t = W ˜0 + W

{ } σ(u, Su )−1 diag(Su )−1 dSu − r1du ,

0

which implies that Ft ⊂ σ(Su ; u ∈ [0, t]). Hence, we deduce the relation (2.5)

Ft = σ(Su ; u ∈ [0, t])

for all t ≥ 0. We next define the filtration (Gt )t≥0 by (2.6)

Gt := Ft ∨ σ(λ),

and the probability measure P on (Ω, ∨t≥0 Gt ) that satisfies (2.7)

˜G dP|Gt = Zt dP| t

for each t ≥ 0, where { } 2 ˜t − W ˜ 0 ) − |λ| t . Zt := exp λ⊤ (W 2 We call P the real-world probability measure. By Cameron-Martin-MaruyamaGirsanov’s theorem, the process W := (Wt )t≥0 , given by (2.8)

˜ t − λt, Wt := W 4

is a (P, Gt )-Brownian motion. Combining (2.3) and (2.8), we obtain the Pdynamics of S, dSt = diag(St ) {µ(t, St )dt + σ(t, St )dWt } ,

S0 ∈ Rn++ ,

where we define the mean-return-rate vector µ(t, y) := r1 + σ(t, y)λ with 1 := (1, . . . , 1)⊤ ∈ Rn . Here, note that W and λ are independent under the real-world probability measure P since the increment Wt2 − Wt1 of the (P, Gt )Brownian motion W is independent of G0 = σ(λ). Also, note that P(λ ∈ A) = ˜ ∈ A) = ν(A) for any A ∈ B(Rn ). P(λ Remark 2.1. For agents having information (Ft )t≥0 , the market price of risk λ = {σ −1 (µ − r1)}(t, St ) of S, which is a G0 -measurable random variable, cannot be directly observed. It is a hidden variable, which has to be estimated. The law ν of λ is called the prior distribution of λ, and the conditional expectation ˆ t := E[λ|Ft ], λ

(2.9)

t ≥ 0,

where E[·] denotes expectation with respect to P, is called the Bayesian estimator of λ. From the Bayes rule, we see ˜ ˆ t = E[Zt λ|Ft ] , λ ˜ t |Ft ] E[Z

(2.10)

˜ ˜ The denominator of where we denote by E[·], expectation with respect to P. (2.10) can be expressed as ˜ t |Ft ] = F (t, W ˜t − W ˜ 0 ), E[Z

(2.11) where

( ) |z|2 ⊤ F (t, y) := exp z y − t ν(dz), 2 Rn ∫

(2.12)

˜ t ). So, we see that and the numerator of (2.10) is equal to ∇F (t, W ˆ t = ∇ log F (t, W ˜t − W ˜ 0 ). λ

(2.13)

˜ , which is expressed as (2.4), can be interpreted as “cumulative Remark 2.2. W Sharpe ratio of the market”: ∫ t { } ˜ t =W ˜0 + W σ(s, Ss )−1 diag(Ss )−1 dSs − r1dt 0 ∫ t 0 ˜0 + =W (“vol. matrix”)−1 s (“return of Ss ” − “return of Ss ”1). 0

5

Next, on this financial market, consider a self-financing investor whose available information flow is (Ft )t≥0 . The wealth process X x,π := (Xtx,π )t≥0 of the investor is defined by the SDE { n ( ) } n ∑ dS i ∑ dSt0 x,π x,π t i i (2.14) dXt = Xt πt i + 1 − πt , X0x,π = x, 0 S S t t i=1 i=1 where x ∈ R++ is the initial wealth of the investor and π := (π 1 , . . . , π n )⊤ , π i := (πti )t≥0 is a dynamic investment strategy of the investor, which is Ft adapted. For a given finite time horizon T ∈ R++ and initial wealth x ∈ R++ , consider the utility maximization of terminal wealth, U (T,γ) (x) := sup Eu(γ) (XTx,π ) ,

(2.15)

π∈AT

where

 1−γ x u(γ) (x) := 1 − γ  log x

(2.16)

if γ > 0, ̸= 1, if γ = 1

is the CRRA-utility function with Arrow-Pratt’s relative risk-aversion parameter γ and { } n-dimensional Ft -progressively measurable, AT := (ft )t∈[0,T ] ∫ T |ft |2 dt < ∞ a.s. 0 is the totality of admissible investment strategies. Remark 2.3. Similar market models with partial information, where unobservable random market price of risks are employed, are studied in Brennan and Xia (2001), Cvitani´c et. al. (2006), Karatzas (1997), Karatzas and Zhao (2001), Lakner (1995), Pham and Quenez (2001), Rieder and B¨auerle (2005), Xia (2001), and Zohar (2001), for example.

3

Exponential Growth of Optimal Power-utility with Constant λ

Before analyzing Bayesian utility maximization (2.15), in this section, we consider a special “non-Bayesian” situation: let ν(dx) := δλ0 (dx) be the Dirac’s delta measure, i.e., let λ :≡ λ0 ∈ Rn be a constant vector. Then, the solution to (2.15), which has been originally investigated by Merton (1969, 1971), is now well-known and described as follows: (γ)

(A) The optimal investment strategy π ˆ (γ) := (ˆ πt )t∈[0,T ] ∈ AT is given by (3.1)

(γ)

π ˆt

:=

1 1 (σσ ⊤ )−1 (µ − r1)(t, St ) = (σ ⊤ )−1 (t, St )λ. γ γ 6

ˆ t(γ) )t∈[0,T ] is written as (B) The associated optimal wealth process (X { ( ) } (γ) ˆ t(γ) := Xtx,ˆπ = x exp 1 λ⊤ (Wt + λt) + r − 1 |λ|2 t . (3.2) X γ 2γ 2 (C) The optimal utility is computed as ) ( ( (γ) ) 2 1 ˆ U (T,γ) (x) = Eu(γ) X = u(γ) xe(r+ 2γ |λ| )T . T

(3.3)

In this constant λ case, since the optimal strategy (3.1) is T -independent, we (γ) can re-define the investment strategy π ˆ (γ) := (ˆ πt )t≥0 by (3.1) and the wealth (γ) ˆ (γ) := (X ˆ t )t≥0 by (3.2) on the time interval [0, ∞). Let process X { } A := (ft )t≥0 ; (ft )t∈[0,T ] ∈ AT for all T > 0 . ˆ (γ) . We note that, for We then see the following long-term optimalities of X obtaining (II)–(IV) below, the exponential growth of optimal power-utility (3.3) (with γ ̸= 1) with respect to T is essential. (I) (Maximizing long-term growth rate). We have that, for any π ∈ A , 1 1 ˆ (1) = −Γ′ (1) log XTx,π ≤ lim log X T T →∞ T T →∞ T lim

a.s..

where

1 −Γ′ (1) := r + |λ|2 . 2 For the proof, see Theorem 3.10.1 of Karatzas and Shreve (1998) [8].

(II) (Maximizing long-term growth rate of expected power-utility). When 0 < γ < 1, we see that, for any π ∈ A , lim

T →∞

1 1 ˆ (γ) ) = Γ(γ). log Eu(γ) (XTx,π ) ≤ lim log Eu(γ) (X T T →∞ T T

When γ > 1, we see that, for any π ∈ A , 1 1 ˆ (γ) ) = Γ(γ). log Eu(γ) (XTx,π ) ≥ lim log Eu(γ) (X T T →∞ T T →∞ T lim

Here, we set

) 1 2 |λ| . Γ(γ) := (1 − γ) r + 2γ (

(III) (Maximizing long-term upside-chance large deviation probability). Let 0 < γ < 1. We have that, for any π ∈ A , ( ( ) ) 1 1 1 1 (γ) x,π ˆ lim log P log XT ≥ k(γ) ≤ lim log P log XT ≥ k(γ) , T →∞ T T →∞ T T T 7

where the target growth rate k(γ) > −Γ′ (1) is defined by (3.4)

k(γ) := −Γ′ (γ) = r +

1 |λ|2 . 2γ 2

For the details, see Pham (2003). (IV) (Minimizing long-term downside-risk large deviation probability). Let γ > 1. We have that, for any π ∈ A , ( ) ( ) 1 1 1 1 (γ) x,π ˆ lim log P log XT ≤ k(γ) ≥ lim log P log XT ≤ k(γ) , T →∞ T T T T →∞ T where the target growth rate r < k(γ) < −Γ′ (1) is defined by (3.4). For the details, see Hata et. al. (2010). Remark 3.1 (Kelly and fractional Kelly portfolios). The log-optimal portfolio π ˆ (1) is sometimes called the GOP (Growth Optimal Portfolio) by the property (I), or the Kelly portfolio. The latter name comes from the pioneer work by Kelly (1956) [10]: the optimality in (I) can be interpreted as a corollary of the result obtained in [10]. Also, the power-optimal portfolio π ˆ (γ) with the risk-aversion parameter γ > 1 is sometimes called the fractional Kelly portfolio, which has been proposed to decrease the “risky” features of the “full” Kelly portfolio π ˆ (1) : see Chapter IV and Section 27 of Chapter III of Maclean et. al. (2011) [12]. Note that the above (II) and (IV) characterize long-term optimalities of the fractional Kelly portfolio π ˆ (γ) (γ > 1).

4

Bayesian CRRA Utility Maximization

In this section, we introduce the solution to our Bayesian CRRA-utility maximization (2.15), which has been obtained in Karatzas and Zhao (2001), [9], in an essential form. For γ = 1, i.e., log-utility case, we see the following. Theorem 4.1 (Theorem 3.2, Example 3.3 and 4.4 of [9]). Assume ∫ (4.1) |z|2 ν(dz) < ∞. Rn

For any T, x ∈ R>0 , the following are valid. ˆ (T,1) := (X ˆ t(T,1) )t∈[0,T ] is given by 1. The optimal wealth process X (4.2)

ˆ t(T,1) = xert F (t, W ˜t − W ˜ 0 ), X

where we use (2.12). (T,1) 2. The optimal strategy π ˆ (T,1) := (ˆ πt )t∈[0,T ] that satisfies ˆ t(T,1) = Xtx,ˆπ X

(T ,1)

8

t ∈ [0, T ]

is given by (4.3)

(T,1)

π ˆt

:= (σ(t, St )⊤ )−1

(

)

∇F F

ˆt, ˜t − W ˜ 0 ) = (σ(t, St )⊤ )−1 λ (t, W

where we use (2.9). 3. The optimal expected utility is expressed as ˜ (T, W ˜T − W ˜ 0 ) log F (T, W ˜T − W ˜ 0) U (T,1) (x) = log x + rT + EF ∫ T 1 ˆ t |2 dt. = log x + rT + E |λ 2 0 Proof. The first two assertions are derived directly from Example 3.3 of [9]. To see the third assertion, we first deduce ˜ (T, W ˜T − W ˜ 0 ) log F (T, W ˜T − W ˜ 0 ) = E log F (T, W ˜T − W ˜ 0 ). EF Use Itˆo’s formula and (2.13) to see that ∫ t ∫ 1 t ˆ 2 ˆ ⊤ dW ˜t − W ˜ 0) = ˜ (4.4) log F (t, W λ − |λu | du u u 2 0 0 ∫ t ∫ t ˆ ⊤ dBu + 1 ˆ u |2 du, = λ |λ u 2 0 0 where



t

ˆ u du, λ

˜t − Bt := W

t≥0

0

is a (P, Ft )-Brownian motion by Cameron-Martin-Maruyama-Girsanov’s formula. Note that ∫ t ∫ t 2 ˆ u |2 du = E |λ duE |E[λ|Fu ]| ≤ tE|λ|2 < ∞ 0

0

for any t > 0. Hence, it follows that ˜t − W ˜ 0) = 1 E E log F (t, W 2



T

ˆ t |2 dt. |λ 0

Next, we consider a power-utility, which is more risk-averse than the logutility, i.e., employ (2.16) with (4.5)

γ ∈ (1, ∞).

To treat this situation, we introduce, for 0 ≤ t ≤ T < ∞, [ ] ˜ F (T, y + W ˜t − W ˜ 0 ) γ1 (4.6) G(T,γ) (t, y) :=E ∫ √ 1 1 |z|2 − 2 = F (T, y + tz) γ dz n e (2π) 2 Rn 9

where we use (2.12). Here, recalling that ( )∫ ( 1 2 t F (t, y) = exp |y| exp − z − 2t 2 Rn we see that



) ( ) y 2 1 2 |y| , ν(dz) ≤ exp t 2t

√ 1 tz) γ

|z|2 1 − 2 dz n e (2π) 2 Rn √ { } ∫ |z|2 |y + tz|2 1 − ≤ exp dz n 2γT 2 Rn (2π) 2 ( ) n2 { } γT |y|2 = exp , γT − t 2(γT − t)

F (T, y +

hence, the integral in (4.6) has a finite value. Moreover, we see that, for 0 ≤ t ≤ T < ∞, [ ] 1−γ ˜ (∇F · F γ )(T, y + W ˜t − W ˜ 0) ∇G(T,γ) (t, y) =E ∫ √ 1−γ |z|2 1 − 2 = (∇F · F γ )(T, y + tz) dz n e (2π) 2 Rn and that the above integral has a finite value. Indeed, when t = 0, these equalities are trivial, and, for 0 < t ≤ T , we deduce that ( )∫ ( ) ( ) 1 2 t y 2 1 2 ˜ |∇F (t, y)| ≤ exp |y| |z| exp − z − ν(dz) ≤ exp |y| E|λ| 2t 2 t 2t Rn and that



√ 1 1−γ |z|2 − 2 dz (∇F · F γ )(T, y + tz) n e (2π) 2 Rn √ { } ∫ 1 |y + tz|2 |z|2 ˜ ≤E|λ| − dz n exp 2γT 2 Rn (2π) 2 ( ) n2 { } γT |y|2 ˜ =E|λ| exp . γT − t 2(γT − t) We now see the following. Theorem 4.2 (Theorem 3.2, Example 3.5 and 4.6 of [9]). Assume (2.2) and (4.5). For any T, x ∈ R>0 , the following are valid. ˆ (T,γ) := (X ˆ t(T,γ) )t∈[0,T ] is given by 1. The optimal wealth process X ˜t − W ˜ 0) (T − t, W , G(T,γ) (T, 0)

(T,γ)

(4.7)

ˆ t(T,γ) = xert G X

where we use (4.6). 10

(T,γ)

2. The optimal strategy π ˆ (T,γ) := (ˆ πt

ˆ t(T,γ) = Xtx,ˆπ X

)t∈[0,T ] that satisfies

(T ,γ)

t ∈ [0, T ]

is given by (4.8)

(T,γ)

π ˆt

:= (σ(t, St )⊤ )−1

˜t − W ˜ 0) ∇G(T,γ) (T − t, W . ˜t − W ˜ 0) G(T,γ) (T − t, W

3. The optimal expected utility is expressed as { }γ (4.9) U (T,γ) (x) = u(γ) (xerT ) G(T,γ) (T, 0) . In [9], the so-called martingale method is employed to solve this problem with partial information. In Appendix, we describe a different solution method, using a standard dynamic programming. Example 4.1 (Gaussian Prior). This example treats a slight generalization of computations demonstrated in Cvitani´c et. al. (2006). Let ν ∼ N (l, L), i.e., { } 1 1 ⊤ −1 √ exp − (z − l) L (z − l) dz, ν(dz) := 2 (2π)n/2 det(L) where L ∈ Rn×n is a symmetric and positive definite covariance matrix and l ∈ Rn is a mean vector. Then, the Bayesian estimator (2.9), which is expressed as (2.13), is computed as (4.10)

ˆ t = ∇ log F (t, W ˜ t ). = (L−1 + tI)−1 (W ˜ t + L−1 l). λ

Indeed, we see F (t, y), given by (2.12), is computed as { 1 F (t, y) = exp (y + L−1 l)⊤ (L−1 + tI)−1 (y + L−1 l) 2 } 1 1 − l⊤ L−1 l − log det(I + tL) . 2 2 Let

Kt := (L−1 + tI)−1 = L(I + tL)−1 = (I + tL)−1 L

and define, for θ ∈ (0, 1), (T )

Pt

{ } −1 (θ) = KT tθ (I − tθKT ) + KT−1 KT .

We recall that I − tθKT =I − tθ(I + T L)−1 L =(I + T L)−1 {I + (T − tθ)L} ≥(I + T L)−1 > 0 11

(T )

for all t ∈ [0, T ] and θ ∈ (0, 1). So, Pt (θ) is a well-defined, symmetric and (T ) positive definite matrix. Also, we may notice that (Pt (θ))t∈[0,T ] solves the differential Riccati equation d P = θP 2 , dt

P0 = KT .

Using these functions, we deduce that (4.6) is calculated as (4.11)

G(T,γ) (t, y) [ θ (T ) = exp (L−1 l + y)⊤ Pt (θ)(L−1 l + y) 2

] } 1 θ { ⊤ −1 − l L l + log det(I + T L) − log det (I − tθKT ) , 2 2

where we set θ :=

1 . γ

Indeed, defining kt :=Kt L−1 l ⊤

−1

κt :=l L

and

Kt L−1 l − l⊤ L−1 l − log det(I + tL),

we see G(T,γ) (t, y) { ( )} 1 ⊤ 1 ⊤ = exp θ y KT y + kT y + κT 2 2 ) { ( } ∫ √ 1 1 2 t ⊤ ⊤ × θ z KT z + t(kT + KT y) z − |z| dz n exp 2 2 Rn (2π) 2 [ ( ) 1 1 1 ⊤ y KT y + kT⊤ y + κT − log det (I − tθKT ) = exp θ 2 2 2 ] tθ2 −1 −1 ⊤ −1 + (L l + y) KT (I − tθKT ) KT (L l + y) , 2 hence, the expression (4.11) is obtained from this calculation. Inserting (4.11) into (4.8) in Theorem 4.2 and combining this with (4.10), we obtain (T,γ)

π ˆt

( ) 1 (T ) ˜t − W ˜ 0 + L−1 l) = (σ ⊤ )−1 (t, St )PT −t γ1 (W γ ( ) 1 (T ) ˆt. = (σ ⊤ )−1 (t, St )PT −t γ1 (L−1 + tI)λ γ

12

5 5.1

Long-time Asymptotics Log-optimal Case

First, consider the log-utility case, assuming (4.1). Then, both the optimal wealth (4.2) and the optimal investment strategy (4.3) are T -independent. So, we re-define ˆ (1) := (X ˆ t(1) )t≥0 X by (4.2) and (1)

π ˆ (1) := (ˆ πt )t≥0 ˜ u ; u ≤ t). by (4.3). We may assume that Ft (t ≥ 0) is the P-completion of σ(W We then see the following. Proposition 5.1 (Example 5.1 of [9]). For any π ∈ A , it holds that (5.1)

1 1 ˆ (1) = r + 1 E|λ|2 . E log XTx,π ≤ lim E log X T T →∞ T T →∞ T 2 lim

Proof. From Proposition 4.1, we can deduce that lim

T →∞

1 1 ˆ (1) E log XTx,π ≤ lim E log X T T →∞ T T

holds for any π ∈ A . Using (4.4) and Fubini’s theorem, we can see that the right-hand-side of the above is equal to 1 r + lim T →∞ 2T

(5.2)



T

ˆ t |2 dt. E|λ 0

ˆ t , we can apply the martingale convergence Recalling the definition (2.9) of λ ˆ theorem to (λt )t≥0 to deduce that ˆ ∞ := lim λ ˆ t = E[λ|F∞ ] P-a.s., λ

(5.3)

t→∞

ˆ ∞ = λ, P-a.s.. Actually, from where F∞ := ∨t≥0 Ft . Moreover, we see that λ (2.8), we see that 1 1 ˜ Wt = Wt + λ. t t 1 ˜ From this, we deduce that limt→∞ t Wt = λ, P-a.s. since we have limt→∞ 1t Wt = 0, P-a.s. by the strong law of large numbers. Hence, the F∞ -measurability of λ follows. So, we deduce that (5.2) is equal to the right-hand-side of (5.1). Remark 5.1. We can also deduce that, for any π ∈ A , lim

T →∞

1 1 ˆ (1) = r + 1 |λ|2 log XTx,π ≤ lim log X T T →∞ T T 2

13

P-a.s..

Indeed, the above inequality follows from Theorem 3.1 of [8]. To derive the expression of the right-hand-side of the above, using (4.4), we write as 1 ˆ (1) = r + 1 MT + 1 ⟨M ⟩T , log X T T T 2T where we define



t

Mt :=

ˆ ⊤ dBu . λ u

0

We first deduce that 1 1 lim ⟨M ⟩T = lim T →∞ T T →∞ T



T

ˆ t |2 dt = |λ|2 |λ

P-a.s.,

0

ˆ t → λ, P-a.s. as t → ∞. We next deduce that where we recall λ lim

T →∞

MT ⟨M ⟩T 1 MT = lim = 0, T →∞ ⟨M ⟩T T T

where the strong law of large numbers for square-integrable martingales is applied, recalling limT →∞ ⟨M ⟩T = ∞ on {λ ̸= 0}. Remark 5.2. The portfolio π ˆ (1) ∈ A is sometimes called the Bayesian Kelly portfolio.

5.2

Power-optimal Case

We next consider power-utility case. In this subsection, in addition to (2.2) and (4.5), we assume that (5.4)

ν(dz) = fν (z)dz

with fν ∈ L∞ (Rn ), which is continuous at 0 ∈ Rn .

We define two functions, (

( ) |x|2 exp − , 2 ( ( ) n2 ∫ ) t t 2 ψ(t, x) := exp − |x − z| ν(dz), 2π 2 Rn ϕ(x) :=

1 2π

) n2

recalling that ( ˜ Eh

) ∫ ˜t W +λ = h(x)ψ(t, x)dx t Rn

14

for any bounded Borel measurable function h : R → R. Using these, we see that (5.5) G(T,γ) (t, y) √ √ )} γ1 ( ) {( ) n2 ( ∫ |y + tz|2 2π y + tz = exp ψ T, ϕ(z)dz 2γT T T Rn n ( ( ) 2γ ) n2 { } 2π γT |y|2 = exp T γT − t 2(γT − t) ( √ √ )1 2 ) ( ) n2 ∫ ( γT − t y + tz γ γT − t t × z− ψ T, exp − y dz. 2πγT 2γT γT − t T Rn Also, when t = T , we see that ( (5.6)

G(T,γ) (T, y) =

2π T

n ( ) 2γ

γ γ−1

) n2

{ exp

|y|2 2(γ − 1)T

} Ψ(γ) (T, y),

where we define Ψ(γ) (T, y) := ∫ ( Rn

γ−1 2πγT

) n2

(

2 ) ( )1 γ − 1 1 y+z γ exp − y dz. z− ψ T, 2γT γ−1 T

We deduce the following. 1

γ for (T, y) ∈ Lemma 5.1. (1) |ψ(t, y)| ≤ ∥fν ∥∞ and |Ψ(γ) (T, y)| ≤ ∥fν ∥∞ n R+ × R . (2) lim ψ(t, x) = fν (x) for each x ∈ Rn , if fν is continuous at x ∈ Rn .

t→∞

1

(3) lim Ψ(γ) (T, y) = fν (0) γ for each y ∈ Rn . T →∞

Proof. (1) The assertion is straightforward to see. (2) We see that ( lim ψ(t, x) = lim

t→∞

t→∞

1 2π

) n2 ∫

( ) ( ) 1 2 1 exp − |y| fν x − √ y dy = fν (x) 2 t Rn

by the dominated convergence theorem. (3) We see that 1 (γ) Ψ (T, y) − fν (0) γ ( 2 ) ( )n )1 ∫ ( 1 γ − 1 1 y+z γ γ−1 2 γ dz. y exp − z − ψ T, − f (0) ≤ ν 2πγT 2γT γ−1 T Rn By the dominated convergence theorem, the desired assertion follows. 15

With the help of this lemma, we obtain the following. Recall a notation in asymptotic analysis: we write “a(T ) ∼ b(T ) as T → ∞” when limT →∞ a(T )/b(T ) = 1 holds. Proposition 5.2. It holds that { (5.7)

U

(T,γ)

rT

(x) =u(γ) (xe

) {

∼u(γ) (xerT ) and that (5.8)

2π T 2π T

( (

γ γ−1 γ γ−1

)γ } n2 Ψ(γ) (T, 0)γ )γ } n2 fν (0)

as T → ∞

{

ˆ (T,γ) X T

( ) n 1 log 1 − + rT − log Ψ(γ) (T, 0) 2 γ )} ( ˜T − W ˜0 1 ˜ 1 W 2 ˜ + |WT − W0 | + log ψ T, . 2γT γ T

=x exp

Proof. (5.7) is derived from (4.9) and (5.6), using Lemma 5.1. (5.8) is computed from (4.7), (5.5), and (5.6). Remark 5.3 (Hyperbolic Growth). From (5.7), we see that ∂T log U (T,γ) (x) = (1 − γ)r −

n1 + ϵ(T ), 2T

where we set ϵ(T ) := γ∂T log Ψ(γ) (T, 0). The residual term ϵ(T ) is “smaller” than 1/T as T → ∞ in the sense that ∫ T lim ϵ(t)dt = log fν (0) − log Ψ(γ) (1, 0) < ∞, T →∞ 1 which is deduced from Lemma 5.1 (3). Remark 5.4 (Bayesian Fractional Kelly Portfolio). Let γ > 1. Define the (γ) Bayesian fractional Kelly portfolio π ˜ (γ) := (˜ πt )t≥0 ∈ A by (γ)

π ˜t

:=

1 ⊤ −1 ˆt. (σ ) (t, St )λ γ

As mentioned in Remark 3.1, from a practical point of view, this portfolio may be a candidate for long-term “risk-averse” investment. Write the associated ˜ (γ) := (X ˜ t(γ) )t≥0 , i.e., wealth process as X ˜ t(γ) := Xtx,˜π X

16

(γ)

t ≥ 0.

We can deduce that ˜ (γ) = log X ˆ (T,γ) + log G(T,γ) (T, 0) + γ − 1 log X T T 2γ 2 and that



T

ˆ t |2 dt |λ 0

} 1{ ˜ (γ) − log X ˆ (T,γ) = γ − 1 |λ|2 log X T T T →∞ T 2γ 2 lim

5.3

P-a.s..

Cost of Uncertainty

In this subsection, we evaluate a “cost of uncertainty of λ” in the long run, which is proposed and discussed in [9]: We consider an “inside” investor, whose available information flow is (Gt )t≥0 , where we use (2.6). Note that, for this insider, the market price of risk vector λ is observable at time 0 and so, it can be regarded as a constant. The insider’s CRRA-utility maximization is described as [ ] sup E u(γ) (XTx,π )|G0 , π∈ATG

where maximization is considered over the space ATG of n-dimensional Gt -progressively ∫T measurable processes (ft )t∈[0,T ] so that 0 |ft |2 dt < ∞ a.s. From (3.3), we deduce that ( ) [ ] 2 1 sup E u(γ) (XTx,π )|G0 = u(γ) xe(r+ 2γ |λ| )T . π∈ATG

Let

[ ¯ (T,γ)

U

(x) :=E

] [ ] x,π sup E u(γ) (XT )|G0

π∈ATG

)] ( 2 1 =E u(γ) xe(r+ 2γ |λ| )T [

be the expected optimal utility for the inside investor, and we are interested in ¯ (T,γ) (x). evaluating the ratio of two optimized expected utilities U (T,γ) (x) and U We then see the following. Proposition 5.3. It holds that ¯ (T,1) (x) U (T,1) (x) ∼ U

(5.9)

as T → ∞

and that, for γ > 1, ( (5.10)

U

(T,γ)

(x) ∼ U

¯ (T,γ)

(x)

γ γ−1

) (γ−1)n 2

as T → ∞.

We may interpret that, for log-utility-investors, “cost of uncertainty of λ” becomes negligible as T → ∞, while for power-utility-investors who are risk-averse than log-utility-investors, the “cost of uncertainty of λ” does not disappears even when T → ∞. 17

Proof. We see that, from Theorem 4.1, ( ) ¯ (T,γ) (x) log x + r + 21 E|λ|2 T U ) ( = U (T,γ) (x) log x + r + 1 ∫ T E|λ ˆ t |2 dt T 2T 0 ( ) 1 x + r + 12 E|λ|2 T log ( ) → 1 as T → ∞ = ∫T 1 1 ˆ t |2 dt log x + r + E| λ T 2T 0 ∫T ˆ t |2 dt → E|λ|2 as we see in Proof of Proposition 5.1. since limT →∞ T1 0 E|λ Hence, (5.9) follows. To obtain (5.10) with γ > 1, we write the optimal expected power-utility of the “insider” as ∫ ( rT ) γ−1 2 (T,γ) ¯ U (x) =u(γ) xe e− 2γ T |z| fν (z)dz. Rn

By Laplace’s method, we see that { lim

T →∞

T (γ − 1) 2πγ

} n2 ∫

e−

γ−1 2 2γ T |z|

Rn

fν (z)dz = fν (0).

Hence, it follows that ( ) ¯ (T,γ) (x) ∼ u(γ) xerT U

{

2π T

(

γ γ−1

)} n2

fν (0) as T → ∞.

Combining it with Proposition 5.2, we complete the proof.

References ¨ rk, T. and A. Murgoci (2010). A general theory of Markovian time [1] Bjo inconsistent stochastic control problems, preprint. [2] Brennan, M. and Y. Xia (2001). Assessing asset pricing anomalies. Review of Financial Studies, 14, 905–945. ´, J., A. Lazrak, L. Martellini, and F. Zapatero (2006). [3] Cvitanic Dynamic portfolio choice with parameter uncertainty and the economic value of analysts’ recommendations. Review of Financial Studies 19 (4), 1113–1156. [4] Ekeland, I., O. Mbodji, and T.A. Pirvu (2011). Time consistent portfolio management, preprint. [5] Hata. H., H. Nagai and S.J. Sheu (2010). Asymptotics of the probability minimizing a “down-side” risk, Annals of Applied Probability 20 (1), 52–89.

18

[6] Hayashi, T., H. Miyata, and J. Sekine (2012). On hyperbolic growth of optimal Bayesian power-utility, preprint. [7] Karatzas, I. (1997). Adaptive control of a diffusion to a goal, and an associated parabolic Monge-Ampere-type equation. Asian J. Mathematics 1, 324–341. [8] Karatzas, I. and S. Shreve (1998). Methods of Mathematical Finance. Springer-Verlag, Berlin. [9] Karatzas, I. and X. Zhao (2001). Bayesian adaptive portfolio optimization. Handbook of Mathematical Finance, Optimization, Pricing, InterestRates, and Risk Management. Cambridge University Press, 632–669. [10] Kelly, J. L., Jr. (1956). A new interpretation of information rate. Bell System Technical Journal 35, 917–926. [11] Lakner, P. (1995). Utility maximization with partial information. Stoch. Proc. and their Appl. 56, 247–273. [12] Maclean, L.C., Thorp, E.O. and Ziemba, W.T. (2011). The Kelly capital growth investment criterion. World Scientific. [13] Merton, R. C., (1969). Lifetime portfolio selection under uncertainty: the continuous time case. Review of Economics and Statistics, 51, 247–257. [14] Merton, R. C., (1971). Optimum consumption and portfolio rules in a continuous-time model. Journal of Economic Theory, 3, 373–413. [15] Miyata, H. (2012). Lifetime consumption problems under partial observations, preprint. [16] Pham, H. (2003). A large deviations approach to optimal long term investment. Finance and Stochastics, 7, 169–195. [17] Pham, H. and M.-C. Quenez (2001). Optimal portfolio in partially observed stochastic volatility models. Ann. Appl. Probab. 11(1), 210–238. ¨uerle (2005). Portfolio optimization with unob[18] Rieder, U. and N. Ba servable Markov-modulated drift process. Journal of Applied Probability, 42, 362–378. [19] Xia, Y. (2001). Learning about predictability: the effect of parameter uncertainty on dynamic asset allocation. Journal of Finance, 56 (1), 205– 246. [20] Zervos, M. (2008). Optimal consumption and investment with habit formation and hyperbolic discounting. Lecture in Workshop on “Finance and related mathematical and statistical issues”, CSFI, Osaka University. [21] Zohar, G. (2001). A generalized Cameron-Martin formula with applications to partially observed dynamic portfolio optimization. Mathematical Finance 11(4), 475–494. 19

A

Appendix: HJB Approach

In this appendix, we sketch a standard dynamic programming approach for solving Bayesian CRRA-utility maximization (2.15), which is different to the martingale method, employed in [9]. First, we reformulate (2.15), introducing a measure-change. Note that ˜ T u(γ) (X x,π ) = E ˜ Z˜T u(γ) (X x,π ), Eu(γ) (XTx,π ) = EZ T T where we write ˜ t |Ft ] = F (t, W ˜t − W ˜ 0) Z˜t := E[Z and use (2.7) and (2.11) since X x,π is Ft -adapted. Combining (2.1), (2.3) and (2.14), we see { } ˜ t , X x,π = x. dXtx,π = Xtx,π rdt + πt⊤ σ(t, St )dW 0 So,

] [ { 2 } γ ˜t . d(Xtx,π )1−γ = (Xtx,π )1−γ (1 − γ) r − σt⊤ πt dt + (1 − γ)πt⊤ σt dW 2 Hence, we have { } ∫ γ(1 − γ) T ⊤ 2 x,π 1−γ 1−γ (1−γ)rT ⊤ =x e MT ((1 − γ)σ π) exp − (XT ) σt πt dt , 2 0 where we define (A.1)

(∫ Mt (α) := exp

t

˜u − αu⊤ dW

0

1 2



t

) 2 |αu | du .

0

(1) UT

Let be the totality of n-dimensional progressively measurable process p := ∫T (pt )t∈[0,T ] on the time-interval [0, T ] so that 0 |pt |2 dt < ∞ a.s. and that (1) ˜ (α) on EMT ((1 − γ)α) = 1. For α ∈ UT , we define the probability measure P T (Ω, FT ) by the formula ˜ (α) dP T := Mt ((1 − γ)α), t ∈ [0, T ]. ˜ F dP t (α)

˜t By Cameron-Martin-Maruyama-Girsanov’s theorem, the process (W defined by ∫ t (α) ˜ ˜ Wt := Wt − (1 − γ) αu du,

)t∈[0,T ] ,

0

˜ (α) , Ft )-Brownian motion. Recall that, when α := σ ⊤ π ∈ is an n-dimensional (P T (1) UT , we have (A.2)

log ET (XTπ )1−γ = (1 − γ)(log x + rT ) } { ∫ T γ(1 − γ) (α) 2 ˜ exp log F (T, W ˜T − W ˜ 0) − |αt | dt , + log E T 2 0

20

˜ (α) (·) denotes expectation with respect to P ˜ (α) . where E T T We now consider, for 0 ≤ t ≤ T < ∞, (A.3) V¯ (T ) (t, y) :=

{ } ∫ γ(γ − 1) T −t (α) (y) 2 ˜ ˜ inf log ET exp log F (T, YT −t − W0 ) + |αs | ds , α∈UT −t 2 0 (1)

(y)

where we set a suitable UT −t , a subset of UT −t , and define the process (Ys )s∈[0,T −t] by ˜ s(α) , Y (y) = y. dYs(y) = (1 − γ)αs ds + dW 0 The associated HJB equation is written down as { } ) γ(γ − 1) 2 1( |α| , −∂t V = ∆V + |∇V |2 + infn (1 − γ)α⊤ ∇V + α∈R 2 2 (A.4) ˜ V (T, y) = log F (T, y − W0 ). Here, we see

2 γ(γ − 1) 2 γ(γ − 1) 1 γ−1 2 (1 − γ)α ∇V + |α| = |∇V | . α − ∇V − 2 2 γ 2γ ⊤

So, the minimizer in (A.4) is given by α ¯ :=

1 ∇V γ

and (A.4) is rewritten as 1 1 −∂t V = ∆V + |∇V |2 , 2 2γ ˜ 0 ). V (T, y) = log F (T, y − W

(A.5) 1

Noting that L := e γ V satisfies 1 ˜ 0 ) γ1 , ∆L, L(T, y) = F (T, y − W 2 we deduce the expression for the solution to (A.5) [ ] ˜ F (T, W ˜T − W ˜ 0 ) γ1 W ˜t = y . V (t, y) = γ log E −∂t L =

From (A.2) and (A.3), we see that the relation { } ˜ 0) U (T,γ) (x) = u(γ) (xerT ) exp V¯ (T ) (0, W holds. So, we can deduce the expression ]}γ { [ ˜ F (T, W ˜T − W ˜ 0 ) γ1 U (T,γ) (x) = u(γ) (xerT ) E , which is nothing but the representation (4.9). After demonstrating the so-called verification steps, we can establish all assertions in Theorem 4.2. 21

Bayesian Optimal Power-utility Grows Hyperbolically in ...

Mar 5, 2012 - (see (3.3)), i.e., this optimal non-Bayesian power-utility grows exponentially ... standard probability space, endowed with the n-dimensional ...

162KB Sizes 4 Downloads 111 Views

Recommend Documents

Optimal Bayesian Hedging Strategies
Agent will typically want to use model to price and hedge an instrument but before she can ..... delta hedge when true underlying is Heston stochastic volatility.

Bayesian-Optimal Image Reconstruction for ...
May 12, 2008 - marginalized likelihood, is called free energy. In the present ..... control which part of the frequency domain is amplified by the filter. .... 100. 102. 104. (a). (b). Fig. 6. Optimal filter parameter for different noise and smoothne

A Bayesian approach to optimal monetary policy with parameter and ...
This paper undertakes a Bayesian analysis of optimal monetary policy for the United Kingdom. ... to participants in the JEDC conference and the Norges Bank conference, ... uncertainty that confront monetary policy in a systematic way. ...... 2 call f

A Bayesian approach to optimal monetary policy with parameter and ...
more useful communication tools. .... instance, we compare micro-founded and non micro-founded models, RE vs. non-RE models, .... comparison with the others. ...... Kimball, M S (1995), 'The quantitative analytics of the basic neomonetarist ...

Implicit Regularization in Variational Bayesian ... - Semantic Scholar
MAPMF solution (Section 3.1), semi-analytic expres- sions of the VBMF solution (Section 3.2) and the. EVBMF solution (Section 3.3), and we elucidate their.

Probabilistic inferences in Bayesian networks
tation of the piece of evidence that has or will have the most influence on a given hypothesis. A detailed discussion of ... Causal Influences in A Bayesian Network. ... in the network. For example, the probability that the sprinkler was on, given th

Bayesian sampling in visual perception
more, we show that attractor neural networks can sample proba- bility distributions if input currents add linearly ...... Sundareswara R, Schrater PR (2008) Perceptual multistability predicted by search model for Bayesian decisions. .... the text, we

Bayesian reasoning in science
Consider a contract whereby one party agrees to exchange ... p:(1 — p) fair, in the sense that to the best of .... Modern science deals extensively with statistical ...

United eXtra Electronics grows eCommerce ... Developers
Push Notifications for Android web users visiting their mobile website. These. Push Notifications sent via the mobile web look and feel exactly like native app.

Guggenheim grows beyond its insurance roots
Aug 5, 2013 - an core fixed-income manager, eVestment data show. Its five-year numbers ... The CIO is not worried about growing too big. He insists .... manager universe Comparison*: ... The EA institutional investment database consists of.

OPTIMAL PARAMETER SELECTION IN SUPPORT ...
Abstract. The purpose of the paper is to apply a nonlinear programming ... convex optimization, large scale linear and quadratic optimization, semi-definite op-.

Optimal Content Downloading in Vehicular Networks.pdf ...
by M2M (P2P) the links from a moving (parked) vehicle. to another, and by ... an already cached item, according to a Poisson distribution. with rate λ. We also ...

Optimal Transmission Regulation in Restructured ...
Nov 3, 2013 - ity supply into three categories: power producers (generators), power deliverers (transmission firms), and ... up the generator and retailer functions to entry and competition while keeping the separate ...... find a statistically signi

OPTIMAL PARAMETER SELECTION IN SUPPORT ...
Website: http://AIMsciences.org ... K. Schittkowski. Department of Computer Science ... algorithm for computing kernel and related parameters of a support vector.

Determining Optimal Parameters in Magnetic ...
difficulties, we present a solution approach based on derivative-free optimization. ..... D.R. Jones, “DIRECT global optimization”, in Encyclopedia of Optimization, ...

Guggenheim grows beyond its insurance roots
Aug 5, 2013 - such as energy or aviation, while the portfolio manager focuses on ... first place, said Josh Charney, alternatives investment strategist at.

Optimal Fees in Internet Auctions
Jan 14, 2008 - Keywords: Internet auctions; auctions with resale; auction house; ... by such giant commercial institutions as eBay, Yahoo, Amazon, and others.

Optimal information transmission in organizations
upon this characterization to address a design problem: the determination of ... (a star-like organization), any problem could reach its solution in, at most, two ...

Trust Grows From Faith -
higher experiences I choose to live the life of a meditator with purity in thought ... appearance is beautiful for others, we forget to make up our minds. This is not ...

MakeupAlley.com improves user experience, grows ... Services
affiliate programs, and has been using Google AdSense since 2003. Enhancing user experience with a light landing ... of this experiment, the MakeupAlley.com team set up Google Analytics Custom. Variables to track and compare visitor activity, .... a

Optimal Taxation in Life-Cycle Economies - ScienceDirect
May 31, 2002 - System Macro Meeting in Cleveland, as well as James Bullard and Kevin Lansing ... Key Words: optimal taxation; uniform taxation; life cycle.