2013.5.7.(minor edit: 2015.8.28)

Implications of KKT conditions in quantile regression Kengo Kato Let y = (y1 , . . . , yn ) ∈ R and X = (x1 , . . . , xn ) ∈ R be a pair of a vector of dependent variables and a design matrix. Consider to solve the following minimization problem: T

n

T

(QR) minp β∈R

n ∑

n×p

ρτ (yi − xTi β),

i=1

where ρτ (u) = (τ − 1(u ≤ 0))u is the check function used in quantile regression. This minimization problem reduces to the following linear programming problem: (QR-LP)

min

u,v∈Rn ,β∈Rp

τ 1Tn u + (1 − τ )1Tn v

s.t. u − v = y − Xβ, u ≥ 0n , v ≥ 0n , where 1n = (1, . . . , 1)T ∈ Rn and 0n = (0, . . . , 0)T ∈ Rn . The inequalities u ≥ 0 and v ≥ 0 are interpreted coordinatewise. In the problem (QR), the set of feasible solutions is non-empty and the objective function is non-negative on the set of feasible solutions. Hence there exists at least one optimal solution to the problem (QR-LP) and so to (QR). The purpose of this note is to to prove the following lemma, which roughly describes “first order conditions” for the problem (QR), by using the KKT theorem. The following lemma is a small modification of Lemma 2.1 in [2] where only the LAD case (i.e. the case with τ = 1/2) is handled. Lemma 1. Let β ∗ be an optimal solution to the problem (QR). Let I ∗ = {i ∈ {1, . . . , n} : yi = xTi β ∗ }. Then there exist ai ∈ [−1, 0], i ∈ I ∗ such that n ∑ ∑ ai xi . {τ − 1(yi ≤ xTi β ∗ )}xi = i∈I ∗

i=1

Hence we have

n ∑ {τ − 1(yi ≤ xTi β ∗ )}xi ≤ Card(I ∗ ) max |xi |. 1≤i≤n i=1

Proof. Let u∗i = max{yi − xTi β ∗ , 0}, vi∗ = max{−yi + xTi β ∗ , 0}. Then u∗ − v ∗ = y − Xβ ∗ and (u∗ , v ∗ , β ∗ ) is an optimal solution to the problem (QR-LP). Defining f (u, v, β) = τ 1Tn u + (1 − τ )1Tn v, g(u, v, β) = (g1 (u, v, β), . . . , g2n (u, v, β))T = (−uT , −v T )T , h(u, v, β) = (h1 (u, v, β), . . . , hn (u, v, β))T = u − v − y + Xβ, the problem (QR-LP) can be written as min

u,v∈Rn ,β∈Rp

f (u, v, β)

s.t. g(u, v, β) ≤ 02n , h(u, v, β) = 0n . 1

2

Let ei ∈ Rn denote the vector of which only the i-th element is 1 and the other elements are all zero. Then the gradient vectors of f (u, v, β), gi (u, v, β), gn+i (u, v, β) and hi (u, v, β) are given by     τ 1n −ei ∇f (u, v, β) = (1 − τ )1n  , ∇gi (u, v, β) =  0n  , 0p 0p     0 ei ∇gn+i (u, v, β) = −ei  , ∇hi (u, v, β) = −ei  , i = 1, . . . , n. 0p xi Since all the constraints are linear, by the KKT theorem [1, Proposition 3.3.7], there exist µ1 , . . . , µ2n ≥ 0 and λ1 , . . . , λn such that         n n n τ 1n −ei 0n ei ∑ ∑ ∑ (1 − τ )1n  + µi  0n  + µn+i −ei  + λi −ei  = 02n+p , i=1 i=1 i=1 0p 0p 0p xi and µi u∗i = 0, µn+i vi∗ = 0, i = 1, . . . , n. Recall I ∗ = {i ∈ {1, . . . , n} : yi = xTi β ∗ }. Let ∗ ∗ I+ = {i ∈ {1, . . . , n} : yi > xTi β ∗ }, I− = {i ∈ {1, . . . , n} : yi < xTi β ∗ }.

Observe that ∗ ⇒ u∗i > 0 ⇒ µi = 0 ⇒ λi = −τ, i ∈ I+ ∗ i ∈ I− ⇒ vi∗ > 0 ⇒ µn+i = 0 ⇒ λi = 1 − τ,

Therefore,



τ xi + (τ − 1)

∗ i∈I+





xi =

∗ i∈I−

λi xi .

i∈I ∗

The left side is expressed as ∑

{τ − 1(yi ≤ xTi β ∗ )}xi =

∗ ∪I ∗ i∈I+ −

so that

n ∑ ∑ λi xi , {τ − 1(yi ≤ xTi β ∗ )}xi + (1 − τ ) i∈I ∗

i=1 n ∑ ∑ (λi − 1 + τ )xi . {τ − 1(yi ≤ xTi β ∗ )}xi = i=1

i∈I ∗

For i ∈ I ∗ , we have τ − µi + λi = 0 ⇒ λi ≥ −τ, 1 − τ − µn+i − λi = 0 ⇒ λi ≤ 1 − τ, so that λi ∈ [−τ, 1 − τ ], i ∈ I ∗ . This completes the proof.



The next question is how large Card(I ∗ ) is. Suppose that y is random but X is fixed (if X is random, consider the conditional distribution of y given X). Lemma 2. Suppose that n ≥ p and the distribution of y is absolutely continuous with respect to the Lebesgue measure on Rn . Then Card(I ∗ ) ≤ p, a.s.

3

Proof. For I ⊂ {1, . . . , n}, let SI = {y ∈ Rn : yi = xTi β, ∀i ∈ I, ∃β ∈ Rp }. Then as long as Card(I) ≥ p + 1, the set SI is a linear subspace of Rn of dimension at most n − 1, so that its Lebesgue measure in Rn is zero. The conclusion follows from the fact that ∪ {Card(I ∗ ) ≥ p + 1} ⊂ {y ∈ SI }, I⊂{1,...,n}

Card(I)≥p+1

and the absolute continuity of the distribution of y.



References [1] Bertsekas, D. (1999). Nonlinear Programming (2nd edition). Athena Scientific. [2] El-Attar, R.A., Vidyasagar, M. and Dutta, S.P.K. (1979). An algorithm for l1 -norm minimization with application to nonlinear l1 -approximation. SIAM J. Numer. anal. 16 70-86.

Implications of KKT conditions in quantile regression

May 7, 2013 - Let y = (y1,..., yn)T ∈ Rn and X = (x1,..., xn)T ∈ Rn×p be a pair of a .... l1-norm minimization with application to nonlinear l1-approximation.

42KB Sizes 1 Downloads 207 Views

Recommend Documents

Quantile Regression
The prenatal medical care of the mother is also divided into four categories: those ..... Among commercial programs in common use in econometrics, Stata and TSP ... a public domain package of quantile regression software designed for the S.

Online Supplement to nPredictive Quantile Regression ...
VInference in predictive quantile regressions,V Unpublished. Manuscript. Nadarajah, S., & Kotz, S. (2007). Programs in R for computing truncated t distributions. Quality and. Reliability Engineering International, 23(2), 273'278. Phillips, P. C. B. (

Adaptive penalized quantile regression for high ...
For example, there is no guarantee that K-fold cross-validation would provide a choice of ln with a proper rate. Third, their statistical properties are still uncharted ...

Why you should care about quantile regression - Research at Google
Permission to make digital or hard copies of all or part of this work for personal or .... the beginning of execution and only writes the data to disk once the.

Binary Quantile Regression with Local Polynomial ...
nhc ∑n i=1 Xi Xi kc (Xi b hc ), which can be viewed as a consistent estimator for E (Xi Xi |Xi b = 0)gZb (0). Multiplication by An (b) can be thought of as imposing a penalty term for those values of b with An (b) close to be singular. With the abo

kkt-kt.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Main menu.

Petrenko_Prevention of Secondary conditions in FASD.pdf ...
Page 1 of 10. Prevention of Secondary Conditions in Fetal Alcohol Spectrum. Disorders: Identification of Systems-Level Barriers. Christie L. M. Petrenko • Naira ...

Regression models in R Bivariate Linear Regression in R ... - GitHub
cuny.edu/Statistics/R/simpleR/ (the page still exists, but the PDF is not available as of Sept. ... 114 Verzani demonstrates an application of polynomial regression.

Miscarriages and Congenital Conditions in Offspring of Veterans of ...
Miscarriages and Congenital Conditions in Offspring of ... of the British Nuclear Atmospheric Test Programme.pdf. Miscarriages and Congenital Conditions in ...

Conditions of Application.pdf
Page 1 of 1. ULUDAĞ UNIVERSITY. APPLICATION REQUIREMENTS. CANDIDATES ELIGIBLE FOR APPLICATION IN. ACCORDANCE WITH FOREIGN STUDENT ADMISSION. REGULATION. ARTICLE 6 - On condition that they are senior students at high schools or graduates;. The applic

Steady%State Conditions in Structural Estimation of ...
function of the steady%state earnings distribution, and the completed job durations are ... Firms post wages and maximize profits given a linear production technology. ..... I apply the discrete approach when dealing with the heterogeneity in ...

Necessary and sufficient conditions in the problem of ...
Savings account with zero interest rate. M - family of martingale ... Primal Problem u(x) = sup c,XT : X0=x,X≥0. E. [∫ T. 0. U1(t, ω,ct )dt + U2(ω,XT ). ] . Is u a utility ...

Steady%State Conditions in Structural Estimation of ...
For the individuals in the unemployment state, the residual unemployment spells +) ..... of Income and Program Participation (SIPP) is a four year panel and the ...

Human Rights Implications of Crime Control in the ...
This is an Open Access article distributed under the terms of the Creative ... Data were collected during the Southern Ute Indian Community Safety Survey.

Human Rights Implications of Crime Control in the ...
commercial use, distribution, and reproduction in any medium, provided the ... social construction and policy implications of Internet crime, the dichotomous ...

Implications of action-oriented paradigm shifts in ... - Semantic Scholar
and our use of tools and instruments as part of cognition (see Menary 2007,. 2010). Enactive theories ... Theoretical: Cognition-for-action takes a new look at cognitive func- tions (e.g., perception ...... Automation (ICRA), pp. 1268–1273. Rome: .

Human Rights Implications of Crime Control in the Digital Age
ethical issues of criminological research and possible strategies for novice .... have an influence on the grade they receive in this professor's course (Berg 2004).

Human Rights Implications of Crime Control in the ...
This license does not permit commercial exploitation or the creation of ... Keywords: virtual sex offending; sex offender; cyber crime; Internet enabled pathology;.

Human Rights Implications of Crime Control in the ...
All rights reserved. Under a creative commons Attribution-Noncommercial-Share Alike 2.5 India License. 1 ...... to juvenile courts. Journalism Quarterly, 751, 753.

Human Rights Implications of Crime Control in the ...
discipline their subjects but information technology and the human rights ..... computer they knew that the systems administrator could and likely would monitor ...

Speaker Recognition in Adverse Conditions
cation is performed in the presence of little or no interference. In general, these systems ..... ing the Gaussian kernel is that it does not have finite support. However, for ..... AT&T Bell Labs in Murray Hill, New Jersey, and also for the. Speech 

Human Rights Implications of Crime Control in the Digital Age
Though aspiring journalists undergo some training with respect to ... children's right to freedom of expression (Article 13); to protection of privacy and against.

Implications of Avian Flu in Africa
disease surveillance systems limited capacity to detect transmission of avian influenza to ... affected farms and 45% on unaffected farms lost their jobs (4). .... phone networks throughout Africa could be extremely helpful for reporting unusual ...