A Constrained Quadratic Spline as a Model for the Cumulative Hazard Function V.V. Krivtsov*, I.V. Kolmanovsky, T.P. Davis Ford Motor Co., MD 606, 14555 Rotunda Drive, Dearborn, MI 48120

Abstract Probability distributions traditionally used in reliability analysis (e.g., exponential, Weibull, lognormal, normal, gamma and inverse Gaussian) do not always provide enough flexibility to model real world lifetime data. Considered in this paper is a quadratic spline with a single free knot as a model of the cumulative hazard function for increasing failure rate distributions. A parsimonious (essentially closed form) solution with a simple numerical search is proposed for estimating spline parameters. Automotive data examples are discussed as an illustration. Keywords: cumulative hazard function, spline models. Introduction Most parametric modeling work in lifetime data analysis relates to (log-)location-scale distributions and some other suitable distributions such as gamma and inverse Gaussian. Despite (or, arguably, because) of their parsimonious structure, these distributions do not provide enough flexibility in some real world data scenarios. Models, in which the hazard function (or some transform of it) is a low-order polynomial, may provide a more reasonable fit to data in these situations. Notable particular cases of the polynomial hazard models include the Rayleigh distribution with the linear hazard function and the Gompertz distribution with the log-linear hazard function. An alternative way to achieve flexibility in modeling hazard functions is to use so-called spline functions, which are comprised of polynomial pieces connected at special points referred to as knots. Often, spline parameters need to be constrained in order to keep the hazard function non-negative. Rosenberg (1995) considers various spline models for hazard functions and Kooperberg et al. (1995) for

*Corresponding author. Phone/Fax: 313-323-8711. E-mail: [email protected] (Vasiliy Krivtsov) 1

the logarithm of the hazard function. Lawless (2003) discusses applications of cubic splines for a wide range of lifetime data. Working with automotive warranty data, we noticed that a spline of the form 2b1t ,

h( t ) =

0≤t≤k

2b1t + 2b2 ( t − k ), t > k

with b1, b2, k > 0 fits quite well for many data sets exhibiting a linearly increasing hazard function. As an example, in Figure 1, consider the cumulative hazard function for an automotive chassis component data. It is evident from the figure that the proposed spline provides a better fit to the data than some common probability distributions estimated via a non-linear least square method. 0.006 Nonparameteric (actual data) Quadratic Spline

0.005

H(t)

0.004

Normal

0.003

Weibull Lognormal

0.002 0.001 0 0

10000

20000

30000

40000

50000

t, miles

Figure 1. Nonparametric estimate of cumulative hazard function versus normal, lognormal, Weibull and the quadratic spline fits. For estimation convenience, we will proceed with the respective cumulative hazard function, which takes the form of a quadratic spline with a single free knot: H( t ) =

b1t 2 ,

0≤t≤k

b1t + b2 ( t − k ) 2 , 2

t>k

where b1, b2 and k > 0 need to be estimated to best fit the nonparametric estimates of H(t) available in the form of (ti; yi), i = 1, … , N; 0 ≤ t1 < t 2 < … < t N . Considered in this paper is a restricted least2

squares procedure limited to point estimation only. (Due to considerably large sample sizes of field data, the respective interval estimation is of a lesser interest.) The rest of the paper is structured as follows. First, the objective function that corresponds to the sum of squared residuals is analyzed and decomposed into a sum of fixed cost (which is not affected by the values of b 1 , b 2 and k) and variable cost (which is affected by the values of b 1 , b 2 and k). Then, we analyze the monotonicity and convexity constraints and translate them into conditions on b 1 , b 2 and k. The optimization is reduced to a two-stage process, where in the inner stage b 1 and b 2 are optimized using Lagrange multiplier rule (the closed form solution to the inner stage problem is given in a separate section) and in the outer stage the value of k is optimized via one dimensional grid search. In the final section, we consider practical examples illustrating applications examples of the proposed model. Objective Function Let n(k)=max{i : ti ≤ k}. In the least-squares fit, our objective is to minimize the sum of squares: f=

n(k ) i =1

(b1t i2 − y i ) 2 +

N

(b1t i2 + b2 (t i − k ) 2 − y i ) 2

i = n ( k ) +1

After algebraic transformations we obtain, f = ff + fv , where ff =

N

yi2

,

i =1

is the fixed cost (independent of k, b 1 , b 2 ) and fv = β 1 b 22 + β 2 b 1 b 2 + β 3 b 1 + β 4 b 1 + β 5 b 2 ,

β1 = β2 =

N

N i =1

t i4 ,

(t i − k ) 4 ,

i = n ( k ) +1 N

β3 = 2

t i2 ( ti − k )2

i = n( k )+1

β 4 = -2

N i =1

3

y i t i2 ,

,

min.

β 5 = -2

N

yi (t i − k ) 2 ,

i = n ( k ) +1

is the variable cost. In the sequel, we analyze the problem of minimizing fv with respect to b1 and b2 for a fixed k, subject to the monotonicity constraints discussed in the following section. This optimization problem can be solved essentially in closed form, resulting in the optimal cost fmin(k), see Section 4. The function fmin is then minimized with respect to k via one-dimensional grid search to find an optimal value of k. The grid search can be replaced by a more efficient, interval sub-division algorithm (Fibonacci division or Golden Section), but this assumes that fmin is unimodal, which may not always be the case. Constraints

The constraint that H(t) be monotonically non-decreasing in interval [0, k u ] (where k u may be + ∞ ) translates into non-negativity condition for the derivative: 2b 1 t ≥ 0; if t ≤ k, 2b 1 t + 2b 2 (t-k) ≥ 0, if t>k. Since the derivative is a piecewise linear function in t, it is sufficient to enforce the constraint at t=0, k, k u that results in b 1 ≥ 0, b 1 k u + b 2 (k u - k) ≥ 0, If the monotonicity is desired on an infinite time interval [0, + ∞ ] (i.e., k u

+ ∞ ), then the constraints

become b 1 ≥ 0, b 1 + b 2 ≥ 0, To summarize both of these cases, the constraints can be written in the form: b 1 ≥ 0, α 1 b 1 + α 2 b 2 ≥ 0, where α 1 = k u , α 2 =(k u - k) if k u < + ∞ , k u > k > 0; and α 1 = 1, α 2 = 1 if k u = + ∞ . The constraint that H(t) be convex translates into a requirement that the second derivative be nonnegative, i.e., b 1 ≥ 0, b 1 + b 2 ≥ 0. This is the same as the monotonicity requirement on the infinite time interval and thus can be enforced by setting k u =+ ∞ . Lagrange Multiplier Rule Based Optimization of b1 and b2 (Inner Stage)

The problem of minimizing fv with respect to b 1 and b 2 for a fixed k subject to the constraints b1 ≥ 0,

4

α 1 b 1 + α 2 b 2 ≥ 0 is a constrained optimization problem (in fact, it is a quadratic programming problem). To solve it we use the Lagrange multiplier rule. The Lagrangian function has the form L = λ0 fv+λ1 (-b 1 )+λ2 (-α 1 b 1 -α 2 b 2 ), where λ 0 , λ 1 , λ 2 are Lagrange multipliers. The optimal b 1* , b *2 must satisfy the following conditions for some λ 0 ∈ {0, 1}, λ 1 ≥ 0, λ 2 ≥ 0, λ 0 +λ 1 +λ 2 >0, ∂L ∂L |b =b* ,b =b* = 0, | * * = 0, ∂b1 1 1 2 2 ∂b2 b1 =b1 ,b2 =b2

and

λ 1 (− b 1* ) = 0, λ 2 ( α 1 b 1* + α 2 b *2 ) = 0, Note that ∂L = λ0 (2 β 1b1 + β 3 b2 + β 4 ) − λ1 − α 1λ 2 , ∂b1 ∂L = λ0 (2 β 2b2 + β 3b1 + β 5 ) − α 2 λ 2 . ∂b2

In the sequel, we analyze in more detail different cases that these conditions generate. Each of the cases results in a potential candidate solution; the final solution is selected from these candidate solutions. It is the one that minimizes fv while satisfying the constraints b 1 ≥ 0, α 1 b 1 + α 2 b 2 ≥ 0. Case 1: λ 0 = 1; λ 1 > 0; λ 2 > 0 In this case, b 1* = 0, b *2 = 0 is the candidate solution. Case 2: λ 0 = 1; λ 1 = 0; λ 2 > 0 In this case, α 1 b 1* +α 2 b *2 =0. Since α 1 > 0, b 1* = -

α 2 b2* . From α1

∂L = 0 , i = 1,2 , and ∂bi

system of two linear equations in b *2 , λ 2 : ( 2 β1 ( − (2β 2 (−

α2 ) + β 3 )b2* − α 1λ 2 = − β 4 , α1

α2 α ) − β 3 2 )b2* − α 2 λ 2 = − β 5 . α1 α1

α 2 b2* These linear equations are solved for b , λ 2 and then b = . α1 * 2

* 1

5

λ 1 = 0, we obtain a

When solving this system of linear equations, it may turn out (as a rare possibility) that the determinant is zero (i.e., the system is singular). Although the analysis can be continued, in the actual code, we simply discard the candidate solution for this case. Case 3: λ0 = 1 , λ1 > 0 , λ 2 = 0

In this case, b1* = 0 and the two linear equations

β 3 b2* − λ1 = − β 4 , 2 β 2 b2* + 0λ1 = − β 5 ,

are solved for b2* , λ0 . Clearly, b1* = 0 , b2* = −

β5 is the candidate solution. 2β 2

Case 4: λ0 = 1 , λ1 = 0 , λ 2 = 0

In this case, the following two linear equations are solved for b1* , b2* 2 β 1b1* + β 3b2* = − β 4 , 2 β 2 b2* + β 3b1* = − β 5 ,

to obtain a candidate solution. Case 5: λ0 = 0 , λ1 > 0 , λ 2 > 0

In this case, b1* = 0 , α 1b1* + α 2 b2* = 0 implying b2* = 0 form the candidate solution. Case 6: λ0 = 0 , λ1 = 0 , λ 2 > 0

In this case b1* = −

α 2 b2* ∂L and = −λ1 − α 1λ 2 = 0 . Hence, λ 2 = 0 which is not possible, since α1 ∂b1

λ0 + λ1 + λ 2 > 0 . This case does not provide candidate solutions. Case 7: λ0 = 0 , λ1 > 0 , λ 2 = 0

In this case,

∂L = 0 and, hence, λ1 = 0 , which is not possible. This case does not provide candidate ∂b1

solutions. Case 8: λ0 = 0 , λ1 = 0 , λ 2 = 0

This case is not possible, since λ0 + λ1 + λ 2 > 0 . Practical Examples

We would like to first consider a fleet battery data set (Davis, 2003). This example pertains to the ability of the battery to have enough power to turn over the starter motor and start the engine. Note that 6

time in service is used as the life variable. This is considered as a surrogate for engine starting cycles, which might have been more appropriate, but the investigating engineer did not know the number of times each vehicle had been started. Time in service was considered a more suitable surrogate than, perhaps, mileage. The cumulative hazard function estimated from data collected from a large fleet of captive vehicles is illustrated in Figure 2. The quadratic spline parameter estimates are: bˆ1 =1.51·10-4, bˆ2 = 2.88·10-5, kˆ

= 13.97. Weibull distribution also provides a reasonable approximation for this data set, although,

H(t)

with a larger value of the cost function (the sum of residual squares) than the quadratic spline. 0.200 0.180 0.160 0.140 0.120 0.100

Nonparametric (actual data) Weibull Lognormal

0.080 0.060 0.040

Normal Q-Spline

0.020 0.000 0

10

20

30

40

t, months in service

Figure 2. A fleet battery example.

Another example we would like to share involves an axle seal component, for which mileage is believed to be a better life variable for obvious engineering reasons. The data are obtained from a warranty database system. The estimated quadratic spline ( bˆ1 = 5.65·10-12, bˆ2 =1.43·10-11, kˆ

= 20,468) shows a better approximation to the actual data than some conventionally used probability

distributions.

7

0.02

H(t)

0.018 0.016 0.014

Nonparametric (actual data) Q-Spline

0.012 0.01 0.008

Weibull Lognormal

0.006 0.004

Normal

0.002 0 -

10,000

20,000 30,000

40,000

50,000

t, miles

Figure 3. An axle seal example. References

1. Davis, T.P. (2003). "Reliability improvement in automotive engineering." In Global Vehicle Reliability - Prediction and Optimization Techniques, edited by JE Strutt and PL Hall. Professional

Engineering Publications, IMechE, London. 2. Kooperberg, C.; Stone, C.; Truong, Y. (1995). "Hazard regression." Journal of American Statistical Association, 90, pp.78-94.

3. Lawless, J.F. (2003). Statistical models and methods for lifetime data, Wiley, New Jersey. 4. Rosenberg, P.S. (1995). "Hazard function estimation using B-splines." Biometrics, 51, pp. 874-887.

8

A constrained quadratic spline as a model for the ...

Probability distributions traditionally used in reliability analysis (e.g., exponential, Weibull, ..... to be a better life variable for obvious engineering reasons.

93KB Sizes 0 Downloads 166 Views

Recommend Documents

The Viable System Model as a Framework for Understanding ...
The Viable System Model as a Framework for Understanding Organizations.pdf. The Viable System Model as a Framework for Understanding Organizations.pdf.

Constrained Efficiency in a Human Capital Model
Jul 29, 2017 - the degree of underaccumulation is much smaller — the ..... time, and the previous studies show that the optimal tax and education policy can be .... In the online appendix E, we discuss the impact of the modeling choice (time vs. mo

A Newton method for shape-preserving spline ...
http://www.siam.org/journals/siopt/13-2/39312.html. †Mathematical Reviews, Ann Arbor, MI 48107 ([email protected]). ‡School of Mathematics, University of New ...

ketamine as a pharmacological model
Jun 29, 2007 - Not for commercial use or unauthorized distribution. .... Delusions then, are erroneous conclusions that two unrelated stimuli or events are ...

Inference complexity as a model-selection criterion for ...
I n Pacific Rim International. Conference on ArtificialIntelligence, pages 399 -4 1 0, 1 998 . [1 4] I rina R ish, M ark Brodie, Haiqin Wang, and ( heng M a. I ntelligent prob- ing: a cost-efficient approach to fault diagnosis in computer networks. S

Quiz Games as a model for Information Hiding
Nov 17, 2015 - Email address: [email protected]. 2Departamento de Computación .... combination of polynomial equations from k[X], namely as a finite union of sets of solutions of equalities and ..... quizmaster evaluates v := µ (u) and c

A Behavioural Model for Client Reputation - A client reputation model ...
The problem: unauthorised or malicious activities performed by clients on servers while clients consume services (e.g. email spam) without behavioural history ...

The subspace Gaussian mixture model – a structured model for ...
Aug 7, 2010 - We call this a ... In HMM-GMM based speech recognition (see [11] for review), we turn the .... of the work described here has been published in conference .... ize the SGMM system; we do this in such a way that all the states' ...

A quadratic string adapted barrier exploring method for ...
Feb 8, 2007 - This paper proposes a novel method for locating relevant transition states that contain crucial information on rare events of transition from the energy barriers, such as ionic diffusion in a crystalline material with vacancies and adat

An Adaptive Tradeoff Model for Constrained ...
and Engineering, Central South University, Changsha, Hunan, China (e-mail: ... As we know, the ultimate goal of COEAs is to find the fea- sible optimal solution. .... and the method [22] based on the Pareto archived evolutionary strategy ...

Glucose Control as a Model for Implementation of a ...
performed using full blood samples (venous and arterial), which are determined at a nearby (approximately 25m) laboratory, where the sample is analyzed and the values are automatically entered into the Hospital Information. System. There is an approx

A spatially constrained clustering program for river ... - Semantic Scholar
Availability and cost: VAST is free and available by contact- ing the program developer ..... rently assigned river valley segment, and as long as its addition ..... We used a range of affinity thresholds ..... are the best set of variables to use fo

Time-Constrained Services (TiCS): A Framework for ...
successful/erroneous invocations, downtime/uptime information for relevant hosts). – the wiring of IPCs, PLCs, and manufacturing devices for service deployment. Details of the Framework Monitor are discussed in a previous paper [11]. The developmen

Oscillation Theory for a Quadratic Eigenvalue Problem
Sep 15, 2008 - For example, Roach and Sleeman [19, 20] recast (1.1. - 1.3) as a linked two parameter system in L2(0, 1)⊗C2 and set their completeness results in this space. Binding [2] establishes the equivalence of L2(0, 1)⊗C2 with L2(0, 1)⊕L2

i Edge Replacement as a Model of Causal ...
Figure 20: Graph classes, data, and model predictions, for Mayrhofer et al (2010). .... auto mechanic has detailed knowledge about which intermediate events ...

Nonparametric Panel Data Models A Penalized Spline ...
In this paper, we study estimation of fixed and random effects nonparametric panel data models using penalized splines and its mixed model variant. We define a "within" and a "dummy variable" estimator and show their equivalence which can be used as

Forwardflow: A Scalable Core for Power-Constrained ...
scale, like centralized register files and bypassing networks. Instead ...... 2D Mesh, 16B bidirectional links, one transfer per cycle, 1-cycle 5-ary routers, 5 virtual channels per link. ED ..... donations from Microsoft and Sun Microsystems/Oracle.

A Gradient Based Method for Fully Constrained Least ...
IEEE/SP 15th Workshop on. IEEE, 2009, pp. 729–732. [4] J. Chen, C. Richard, P. Honeine, H. Lantéri, and C. Theys, “Sys- tem identification under non-negativity constraints,” in Proc. of. European Conference on Signal Processing, Aalborg, Denma

Executive vetoes as electoral stunts: a model with ...
Feb 7, 2007 - plaide alors sa cause, et fait entendre ses raisons.” —Alexis de Tocqueville, De la démocratie en Amérique1. Veto gates of one sort or another are found in all democracies (Lijphart 1999). Such. “internal checks” to decision-m