Semiparametric regression for the mean and rate functions of recurrent events Chihyun Lee April 1, 2013

1

Introduction

In the analysis of recurrent event data, research interest may either lie on the gap times between successive events or on the mean and rate of events. This paper [5] focuses on the latter. Conventionally, counting process model with Cox [2] type of intensity functions introduced by Andersen and Gill [1] which is of the following form,

λZ (t) = exp{β0T Z(t)}λ0 (t), 1

(1)

have been assumed often for recurrent events. This model, however, requires an accurate specification of the dependence of recurrent events within subject. Thus, Pepe and Cai [6], Lawless and Nadeau [3], and Lawless et al. [4] have studied the following mean function model E{dN ∗ (t)|Z(t)} = dµZ (t) = exp{β0T Z(t)}dµ0 (t)2

(2)

which is more versatile than (1), allowing unknown dependence structure and being flexible to any counting process. A simple illustration of this point is provided in the paper. This works in the same fashion as the GEE approach where unknown correlation or unmeasured 1 2

All notations follow the notations used in class unless specified Let N ∗ (t) be the number of events that occur in [0, t].

1

dependence structure between outcomes are used. It is known to provide robust standard error estimates. In this paper, rigorous justification of using model (2) in inferences on the regression parameters is provided. In addition, model checking techniques are illustrated as well as methods of constructing simultaneous bands for µ0 (·).

2

Inferences on the regression parameters

Inferences on the regression parameters under two different model assumptions are compared in this paper. Under the intensity function assumption (1), the solution of the partial likeP Rτ lihood score function for β0 , U (β, τ ) = ni=1 0 {Zi (u) − ε(β, u)}dNi (u), βˆ becomes the estiRτ mates, and the covariance matrix is A−1 where A ≡ E[ 0 {Z(t)−¯ ε(β0 , t)}⊗2 Y (t)exp{β0T Z(t)}dµ0 (t)] and ε¯(β0 , t) is the limit of ε(β0 , t). This is called the naive method. For arbitrary counting processes, model (1) cannot be used. The distribution of U (β0 , τ ) = Rt Pn R τ {Z (u) − ε(β, u)}dM (u) where M (t) = N (t) − Y (u)exp{β0T Zi (u)}dµ0 (u) should i i i i i=1 0 0 i be derived through model (2). The limiting distribution of n−1/2 U (β0 , t) has mean zero and i hR Rt s T covariance matrix, Σ(s, t) = E 0 {z1 (u) − ε¯(β0 , u)}dM1 (u) 0 {Z1 (ν) − ε¯(β0 , ν)} dM1 (ν) . It is shown that the covariance matrix of βˆ is asymptotically normal with the sandwich covariance matrix, Γ ≡ A−1 ΣA−1 where Σ = Σ(τ, τ ). This covariance matrix Γ can be ˆ Aˆ−1 and refer as the robust estimator. ˆ ≡ Aˆ−1 Σ estimated by Γ

3

Inferences on the mean function

Let µ0 (t) be estimated by µ ˆQ (t) which is derived by incorporating a random weight function Q into the score function. Define V (t) ≡ n1/2 {ˆ µQ (t)−µ0 (t)}. The convergence of V (t) to the zero mean Gaussian process shows the asymptotic normality of µ ˆQ (t), and the estimates with

2

the derived consistent variance estimator enables the construction of pointwise confidence interval. The paper introduce a simple trick of applying a log transformation in the procedure as µ0 (t) is non-negative. In the construction of the simultaneous confidence bands, the supremum of V (t) needs to be evaluated. However, it is impossible to express the distribution analytically. Thus, the approximation of V (t), V˜ (t) was used to generate a number of realization. With the ˜

(t) realizations, we can find the value of cα/2 that satisfies P r{sup| Vξˆ1/2 | < cα/2 } = 1 − α where

ξˆ1/2 is the standard error estimator. Then, the confidence band for µ0 (t) can be easily established using the log-transformation as for the confidence interval.

4

Model checking techniques

As the recurrent event data is correlated within subjects, the difference between the observed ˆ i (t) is not martingales, so the martingale central limit and the predicted numbers of events, M theorem cannot be directly applied. Certain values are defined and the null distribution of those values are derived in the aim of observing unusual patterns. This paper presents model checking methods for 4 different things, i.e. the functional forms of the covariates, the exponential link function of model (2), the proportional rates or means assumption with respect to the jth covariate component, and the overall fit of the model. For all four model checking procedures, numbers are generated from the derived null distribution of defined values under each case. Then, evaluations on how extreme the actually observed values are compared to the generated values can be made. This is considered as a formal way of checking the adequacy of model because p-values can be obtained computing the probability of the unusual values.

3

5

Conclusion

If the dependence structure is complex or unknown, the intensity function model (1) may not be applicable since the specification of the dependence of recurrent events is impossible. The robust rate and mean model (2), however, can be used. This proposed method remains valid whether or not the dependence structure is correctly modelled. Simulation studies conducted in the paper show that the robust method of estimating variance assuming (2) presents more accurate estimation than the naive variance estimator which results in underestimated values. Also the coverage probabilities of the robust estimator seem to be more reasonable as it is close to 0.95. The robust estimator is applied to the real data of the CGD study, and the results are consistent to the simulation results. In the appendix of this paper, underlying asymptotic theory of the proposed methods are rigorously provided.

References [1] Andersen, P. K. and Gill, R. D. (1982) Cox’s regression model for counting processes: a large sample study. Annals of Statistics 10, 1100 - 1120. [2] Cox, D. R. (1972) Regression models and life-tables (with discussion). Journal of the Royal Statistical Society: Series B 34, 187 - 220. [3] Lawless, J. F. and Nadeau, C. (1995) Some simple robust methods for the analysis of recurrent events. Technometrics, 37 158 - 168. [4] Lawless, J. F., Nadeau, C. and Cook, R. J. (1997) Analysis of mean and rate functions for recurrent events. In Proc. 1st Seattle Symp. Biostatistics: Survival Analysis (eds D. Y. Lin and T. R. Fleming), pp. 37 - 49. New York: Springer. [5] Lin, D. Y., Wei, L. J., Yang, I., and Ying, Z. (2000) Semiparametric regression for the 4

mean and rate functions of recurrent events. Journal of the Royal Statistical Society: Series B, 62 711 - 730. [6] Pepe, M. S. and Cai, J. (1993) Some graphical displays and marginal regression analyses for recurrent failure times and time dependent covariates. Journal of the American Statistical Association 88, 811 - 820.

5

Semiparametric regression for the mean and rate ...

Apr 1, 2013 - E{dN∗(t)|Z(t)} = dµZ(t) = exp{βT. 0 Z(t)}dµ0(t)2. (2) which is more .... Biostatistics: Survival Analysis (eds D. Y. Lin and T. R. Fleming), pp. 37 - 49.

182KB Sizes 1 Downloads 163 Views

Recommend Documents

Real Exchange Rate Volatility and Mean Reversion
Adopting such working hypothesis, we implement panel data methods of East Asian versus Latin .... Koray (2001) use data from the Bretton Woods and the modern floating periods and find that real ...... HL takes into account the b (1) correction factor

Nonparametric/semiparametric estimation and testing ...
Mar 6, 2012 - Density Estimation Main Results Examples ..... Density Estimation Main Results Examples. Specification Test for a Parametric Model.

EFFICIENCY BOUNDS FOR SEMIPARAMETRIC ...
Nov 1, 2016 - real-valued functions on Rk. Assume that we are given a function ψ which maps Rp ×Rk into Rq with ..... rt = Λft + ut,. (6) with Λ = (λ1,λ2) ∈ R2, ft a common factor that is a R-valued process such that E(ft|Ft−1) = 0 and. Var

Identification and Semiparametric Estimation of ...
An important insight from these models is that plausible single-crossing assump- ...... in crime and commuting time to the city center in estimation using a partially.

Semiparametric forecast intervals
May 25, 2010 - include quarterly inflation fan charts published by the Bank of ... in Wiley Online Library ... Present value asset pricing models for exchange rates (e.g., Engel .... has also shown that the ANW estimator has good boundary ...

Nonparametric/semiparametric estimation and testing ...
Mar 6, 2012 - Consider a stochastic smoothing parameter h with h/h0 p−→ 1. We want to establish the asymptotic distribution of ˆf(x, h). If one can show that.

SAMPLING ALGORITHMS AND CORESETS FOR lp REGRESSION
Define the random variable Xi = (Tii|Ai⋆xopt −bi|)p, and recall that Ai⋆ = Ui⋆τ since ... Thus, since Xi − E[Xi] ≤ Xi ≤ |Ai⋆xopt − bi|p/qi, it follows that for all i such.

SAMPLING ALGORITHMS AND CORESETS FOR lp REGRESSION
regression problem for all p ∈ [1, ∞), we need tools that capture the geometry of lp- ...... Define the random variable Xi = (Tii|Ai⋆xopt −bi|)p, and recall that Ai⋆ =.

Sampling Algorithms and Coresets for lp Regression
Email: [email protected]. ‡Computer Science, University of Pennsylvania, Philadelphia,. PA 19107. Work done while the author was visiting Yahoo! Research. Email: [email protected] ficient sampling algorithms for the classical ℓp regres- sion p

The Bootstrap for Regression and Time-Series Models
Corresponding to a bootstrap resample χ∗ is a bootstrap replication ...... univariate bootstrap estimation of bias and variance for an arbitrary statistic, theta. It.

A Semiparametric Test of Agent's Information Sets for ...
Mar 15, 2012 - Both the assumptions of independent and correlated private .... In particular, our second test relaxes the independence of private shocks, which ...

A Semiparametric Test of Agent's Information Sets for ...
Mar 15, 2012 - analyst can recover from the data as a function of observable covariates coincide with the ... We apply our test to data on entry in the US airline.

(EBMAL) for Regression
†Human Research and Engineering Directorate, U.S. Army Research Laboratory, Aberdeen Proving Ground, MD USA ... ¶Brain Research Center, National Chiao-Tung University, Hsinchu, Taiwan ... Administration (NHTSA) [36], 2.5% of fatal motor vehicle ..

Identification and Semiparametric Estimation of Equilibrium Models of ...
Research in urban and public economics has focused on improving our under- standing of the impact of local public goods and amenities on equilibrium sort- ing patterns of households.1 These models take as their starting point the idea that households

pdf-1831\mean-monthly-seasonal-and-annual-pan-evaporation-for ...
... apps below to open or edit this item. pdf-1831\mean-monthly-seasonal-and-annual-pan-evapor ... oaa-technical-report-nws-by-richard-k-farnsworth.pdf.

Mean-shift and hierarchical clustering for textured ...
Sci. & Software Eng. Dept., Laval Univ., Quebec City, Que., Canada . Touzi, R.‪‬ ... collective works, for resale or redistribution to servers or lists, or reuse of any.

Semiparametric Estimation of the Random Utility Model ...
Apr 15, 2017 - ... the consistent estimation of the ratios of coefficients despite stochastic mis- ... is asymptotically normal, meaning that it is amenable to the ...

heat exchanger mean temperature differences for ... - CiteSeerX
performance possibilities, but the solution appears to lie with synthetic mixtures. ... In the analysis of heat exchangers, the Log Mean Tempera- ture Difference ...

Confidence Sets for the Aumann Mean of a Random ... - CiteSeerX
indeed enough to observe their support lines on a grid of directions); this .... Section 4 presents an application to simulated data and Sec- .... Z (ui) , ui ∈ Sd−1; ...

Confidence Sets for the Aumann Mean of a Random ... - CiteSeerX
Almost the same technique can be used to obtain a confidence region for a set ..... Hajivassiliou V., McFadden D.L., Ruud P.: Simulation of multivariate normal rectangle ... Ihaka R., Gentleman R.: R: a language for data analysis and graphics.

mean for digital photographers? - Lightroom Killer Tips
BIT-DEPTH FILES POTENTIALLY CONTAIN MUCH MORE INFORMATION THAN 8-BIT FILES. -. -. Laura Shoe is an Adobe Community Professional and Adobe ...

On the horizontal mean curvature flow for axisymmetric ...
Aug 7, 2013 - We study the horizontal mean curvature flow in the Heisenberg group by using the level- set method. We prove the uniqueness, existence and stability of axisymmetric viscosity solutions of the level-set equation. An explicit solution is

Deterministic Performance Bounds on the Mean Square Error for Near ...
the most popular tool [11]. However ... Date of publication November 27, 2012; ... of this manuscript and approving it for publication was Dr. Benoit Champagne.