COMMUNICATIONS IN STATISTICS Theory and Methods Vol. 33, No. 10, pp. 2299–2305, 2004

A Comparison of Bounds on Sets of Joint Distribution Functions Derived from Various Measures of Association ´ beda-Flores2 Roger B. Nelsen1,* and Manuel U 1

Department of Mathematical Sciences, Lewis & Clark College, Portland, Oregon, USA 2 Departamento de Estadı´stica y Matema´tica Aplicada, Universidad de Almerı´a, Spain

ABSTRACT We find pointwise best-possible bounds on the bivariate distribution function of continuous random variables with given margins and a given value of the medial correlation coefficient, and compare those bounds to those obtained from a given value of Kendall’s tau and Spearman’s rho. Key Words: Blomqvist’s beta; Copulas; Kendall’s tau; Medial correlation coefficient; Spearman’s rho.

*Correspondence: Roger B. Nelsen, Department of Mathematical Sciences, Lewis and Clark College, Portland, OR 97219, USA; E-mail: [email protected]. 2299 DOI: 10.1081/STA-200031367 Copyright # 2004 by Marcel Dekker, Inc.

0361-0926 (Print); 1532-415X (Online) www.dekker.com

ORDER

REPRINTS

´ beda-Flores Nelsen and U

2300

1. INTRODUCTION This article shows that for finding pointwise best-possible bounds on sets of joint distribution functions with given continuous margins and the population version of a nonparametric measure of association, the medial correlation coefficient (also known as Blomqvist’s beta) outperforms both Kendall’s tau and Spearman’s rho for moderate values of these coefficients. In a previous article Nelsen et al. (2001), the authors (with J. J. Quesada Molina and J. A. Rodrı´guez Lallena) illustrated a procedure for finding pointwise best-possible bounds on sets of joint distribution functions with given continuous margins and a given value of the population version of a measure of association, such as Kendall’s tau or Spearman’s rho. The bounds attained are readily evaluated, and hence can be compared (see Sec. 3 below). Before doing so, we use the procedure from Nelsen et al. (2001) to find the bounds on the set of joint distribution functions with given continuous margins and a given value of the population version of the medial correlation coefficient. As is often the case when dealing with bivariate distribution functions, the use of copulas simplifies matters. A copula is a function C : I2 ! I ¼ ½0; 1 that satisfies the boundary conditions Cðt; 0Þ ¼ Cð0; tÞ ¼ 0

and

Cðt; 1Þ ¼ Cð1; tÞt; t 2 I;

ð1Þ

and the two-increasing property Cðb; dÞ  Cða; dÞ  Cðb; cÞ þ Cða; cÞ  0

ð2Þ

for all a, b, c, d in I such that a  b and c  d. Equivalently, a copula is the restriction to the unit square of a bivariate distribution function whose margins are uniform on I. Recall from Sklar’s theorem that any bivariate distribution function H with marginal distribution functions F and G can be written as Hðx; yÞ ¼ CðFðxÞ; GðyÞÞ, where C is a copula. Finally, each copula C satisfies the Fre´chet– Hoeffding inequality Wðu; vÞ ¼ maxð0; u þ v  1Þ  Cðu; vÞ  minðu; vÞ ¼ Mðu; vÞ for u; v in I; furthermore, the Fre´chet–Hoeffding bounds W and M are themselves copulas. For further details, see Nelsen (1999).

ORDER

REPRINTS

Comparison of Bounds on Distribution Functions

2301

2. BOUNDS WHEN THE MEDIAL CORRELATION COEFFICIENT IS KNOWN The population version of the medial correlation coefficient for a pair X,Y of continuous random variables, which we denote as bX;Y , was first ~ and ~y denote medians of X and Y , discussed by Blomqvist (1950). If x respectively, then ~ÞðY  ~yÞ > 0  P½ðX  x ~ÞðY  ~yÞ < 0: bX;Y ¼ P½ðX  x Note that 1  bX;Y  þ1 and that the bounds are sharp. When H denotes the joint distribution function of X and Y , it readily follows that x; ~yÞ  1: bX;Y ¼ 4Hð~ Letting C denote the copula of X and Y , we have Hð~ x; ~yÞ ¼ CðFð~ xÞ; Gð~yÞÞ ¼ Cð1=2; 1=2Þ, and hence, bX;Y ¼ bðCÞ ¼ 4Cð1=2; 1=2Þ  1: For any t in ½1; 1, let Bt denote the set of copulas with a common value t of the medial correlation coefficient; that is, Bt ¼ fC j Cis a copula; bðCÞ ¼ tg: Let Bt and Bt denote, respectively, the pointwise infimum and supremum of Bt , i.e., for each (u; v) in I2 , Bt ðu; vÞ ¼ inf fCðu; vÞ j C 2 Bt g and Bt ðu; vÞ ¼ supfCðu; vÞ j C 2 Bt g:

ð3Þ

The bounds Bt and Bt for Bt are given in Theorem 1. Theorem 1. Let Bt and Bt denote the pointwise infimum and supremum (3) of Bt , for t in [1,1]. Then for any (u; v) in I2 ,   Bt ðu; vÞ ¼ max 0; u þ v  1; ðt þ 1Þ=4  ðð1=2Þ  uÞþ  ðð1=2Þ  vÞþ ð4Þ

ORDER

REPRINTS

´ beda-Flores Nelsen and U

2302

and   Bt ðu; vÞ ¼ min u; v; ðt þ 1Þ=4 þ ðu  ð1=2ÞÞþ þ ðv  ð1=2ÞÞþ ;

ð5Þ

where xþ ¼ maxðx; 0Þ. Hence, if X and Y are continuous random variables with joint distribution function H and marginal distribution functions F and G, respectively, and such that bX;Y ¼ t, then the best-possible bounds for H are Bt ðFðxÞ; GðyÞÞ  Hðx; yÞ  Bt ðFðxÞ; GðyÞÞ

ð6Þ

for all (x,y) in ð1; 1Þ2 . Proof. Let C 2 Bt . Then, for all (u; v) in I2 , the defining properties (1) and (2) for copulas readily yield the inequalities ðð1=2Þ  uÞþ  Cðu;vÞ  Cð1=2;vÞ  ðu  ð1=2ÞÞþ and ðð1=2Þ  vÞþ  Cð1=2;vÞ  Cð1=2; 1=2Þ  ðv  ð1=2ÞÞþ ; hence, ðt þ 1Þ=4  ðð1=2Þ  uÞþ  ðð1=2Þ  vÞþ  Cðu;vÞ  ðt þ 1Þ=4 þ ðu  ð1=2ÞÞþ þ ðv  ð1=2ÞÞþ . Furthermore, Wðu; vÞ  Cðu; vÞ  Mðu; vÞ; thus, Bt ðu; vÞ  Cðu; vÞ  Bt ðu; vÞ, where Bt and Bt satisfy (4) and (5), respectively. However, functions on I2 of the form maxf0; u þ v  1; y  ða  uÞþ  ðb  vÞþ g and min½u; v; y þ ðu  aÞþ þ ðv  bÞþ g are copulas for any (a; b) in I2 and y in ½Wða; bÞ; Mða; bÞ (Nelsen, 1999, Theorem 3.2.2), so Bt and Bt are copulas. Because bðBt Þ ¼ bBt Þ ¼ t; Bt and Bt are both in Bt ; hence, Bt and Bt are the pointwise best-possible bounds for Bt . Because Bt and Bt are copulas, the bounds for H in (6) are distribution functions. In the following corollary, whose proof is straightforward, we present some additional facts about the bounds Bt and Bt . Corollary 2. Let Bt and Bt be as in Theorem 1. Then, (a) (b) (c) (d)

Bt and Bt are continuous and nondecreasing in t. Bt ¼ W if and only if t ¼ 1; Bt ¼ M if and only if t ¼ þ1. Bt (u,v) ¼ u – Bt (u,1 – v)¼ v – Bt (1 – u, v), and similarly for Bt . Both Bt and Bt are radially symmetric, that is, Bt (u,v) ¼ u þ v – 1 þ Bt (1 – u,1 – v), and similarly for Bt . 3. A COMPARISON OF THE BOUNDS

Bounds on sets of copulas with a common value of the population versions of the measures of association known as Kendall’s tau (t) and

ORDER

REPRINTS

Comparison of Bounds on Distribution Functions

2303

Spearman’s rho (r) also exist. Analogous to (3), we let T t ; T t ; P t , and P t denote the pointwise best-possible lower and upper bounds on Tt ¼ fC j C is a copula, tðCÞ ¼ tg and Pt ¼ fC j C is a copula, rðCÞ ¼ tg, respectively. These bounds are given explicitly in Nelsen et al. (2001), and like Bt and Bt , are copulas. Which of the coefficients, b; t, or r, is more effective in narrowing the Fre´chet–Hoeffding bounds? To measure the effectiveness of the coefficients for this purpose, we use the function ZZ ma ðtÞ ¼ 1  6

 2

I

 At ðu; vÞ  At ðu; vÞ du dv;

ð7Þ

where a denotes a measure of association such as b; t; or r, and At and At are the bounds on the set At ¼ fC j C is a copula, aðCÞ ¼ tg. The double integral in (7) represents the volume between the surfaces z ¼ At ðu; vÞ and z ¼ At ðu; vÞ in I3 , and ma is scaled so ma ðtÞ ¼ 0 when there is no improvement in the bounds (i.e., At ¼ W and At ¼ M ), and ma ðtÞ ¼ 1 when the bounds coincide. In the first four columns of Table 1, we present the values of mb ðtÞ, mt ðtÞ, and mr ðtÞ for jtj 2 ½0; 1 (note that ma ðtÞ ¼ ma ðtÞ for a ¼ b; t; or rÞ. Although mb ðtÞ can be computed explicitly [mb ðtÞ ¼ 3ð3t2 þ 1Þ=16], mt ðtÞ, and mr ðtÞ must be computed numerically. All results have been rounded to four places. Comparing column (b) with columns (c) and (d), we see that the medial correlation coefficient is dramatically better than either Kendall’s tau

Table 1. A comparison ma ðtÞ for a ¼ b; t, and r. (a) jtj

(b) mb ðtÞ

(c) mt ðtÞ

(d) mr ðtÞ

(e) mr ðt Þ

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.1875 0.1931 0.2100 0.2381 0.2775 0.3281 0.3900 0.4631 0.5475 0.6431 0.7500

0 0.0013 0.0078 0.0228 0.0495 0.0922 0.1562 0.2497 0.3860 0.5929 1

0.0295 0.0327 0.0425 0.0596 0.0851 0.1209 0.1700 0.2386 0.3389 0.5019 1

0.0295 0.0367 0.0588 0.0971 0.1535 0.2308 0.3332 0.4646 0.6271 0.8175 1

ORDER

REPRINTS

´ beda-Flores Nelsen and U

2304

Table 2. A second comparison of b, t, and r. jtj

mb ðtÞ

mt ð8t=9Þ

mr ð4t=3Þ

0 0.05 0.1 0.15 0.2 0.25

0.1875 0.1889 0.1931 0.2002 0.2100 0.2227

0 0.0002 0.0010 0.0028 0.0058 0.0103

0.0295 0.0309 0.0352 0.0425 0.0531 0.0671

or Spearman’s rho for values of jtj  0.9. Furthermore, mr ðtÞ is greater than mt ðtÞ for jtj  0.6, but the inequality is reversed for jtj  0.7. However, this comparison is specious because the three coefficients are rarely equal for a given copula. For example, when C is the copula associated with a standard bivariate normal distribution, tðCÞ ¼ bðCÞ; however, rðCÞ ¼ ð6=pÞarcsin½ð1=2Þ sinðpbðCÞ=2Þ (see Kruskal, 1958). If we set t ¼ ð6=pÞarcsin½ð1=2Þ sinðpt=2Þ and compare columns (b), (c), and (e) of Table 1, we see that when the dependence structure is described by a normal copula, Spearman’s rho is more effective than Kendall’s tau in narrowing the bounds for all t, but better than the medial correlation coefficient only for jtj  0:7. Other families of copulas yield different relationships among b, t, and r. Consider, for example, the Farlie–Gumbel–Morgenstern (FGM) family of copulas: for y in [1, 1] and all ðu; vÞ in I2 , Cy ðu; vÞ ¼ uv þ yuvð1  uÞð1  vÞ. For members of this family, b ¼ y=4; t ¼ 2y=9, and r ¼ y=3, so t ¼ 8b=9 and r ¼ 4b=3. To compare the effectiveness of b, t, and r for copulas with this relationship among b; t, and r, we compare mb ðtÞ, mt ð8t=9Þ, and mr ð4t=3Þ for jtj in [0, 1/4] in Table 2. As with normal copulas, the medial correlation coefficient is substantially better (for jtj in [0, 1/4]) than either Kendall’s tau or Spearman’s rho in narrowing the bounds. A similar relationship among b; t, and r holds, at least approximately for values of y near 0, in families of copulas for which FGM copulas are first-order approximations. Examples of such families include the Ali–Mikhail–Haq family, the Frank family, and the Plackett family. See Nelsen (1999) for details.

4. CONCLUSION There is a simple geometric explanation for the effectiveness of the medial correlation coefficient in narrowing the Frechet–Hoeffding

ORDER

REPRINTS

Comparison of Bounds on Distribution Functions

2305

bounds. Recall that all six of the bounds, Bt , Bt , T t , T t , P t , and Pt , are copulas, and hence, as a result of Nelsen et al. (2001), coincide on the boundary of the unit square I2 ; and as a result of Nelsen (1999), are uniformly continuous. However, only Bt and Bt also coincide at the center of the square, i.e., Bt ð1=2; 1=2Þ ¼ Bt ð1=2; 1=2Þ ¼ ðt þ 1Þ=4. Because mb , mt , and mr are each based on the volume between the graphs of the lower and upper bounds, it is reasonable to expect a smaller volume between the graphs of Bt and Bt than between T t and T t , or between P t and P t . REFERENCES Blomqvist, N. (1950). On a measure of dependence between two random variables. Ann. Math. Statist. 21:593–600. Kruskal, W. H. (1958). Ordinal measures of association. J. Am. Statist. Assoc. 53:814–861. Nelsen, R. B. (1999). An Introduction to Copulas. New York: Springer. ´ beda Nelsen, R. B. Quesada Molina, J. J., Rodrı´guez Lallena, J. A. U Flores, M. (2001). Bounds on bivariate distribution functions with given margins and measures of association. Commun. Statist.—Theory Methods 30:1155–1162.

Request Permission or Order Reprints Instantly! Interested in copying and sharing this article? In most cases, U.S. Copyright Law requires that you get permission from the article’s rightsholder before using copyrighted content. All information and materials found in this article, including but not limited to text, trademarks, patents, logos, graphics and images (the "Materials"), are the copyrighted works and other forms of intellectual property of Marcel Dekker, Inc., or its licensors. All rights not expressly granted are reserved. Get permission to lawfully reproduce and distribute the Materials or order reprints quickly and painlessly. Simply click on the "Request Permission/ Order Reprints" link below and follow the instructions. Visit the U.S. Copyright Office for information on Fair Use limitations of U.S. copyright law. Please refer to The Association of American Publishers’ (AAP) website for guidelines on Fair Use in the Classroom. The Materials are for your personal use only and cannot be reformatted, reposted, resold or distributed by electronic means or otherwise without permission from Marcel Dekker, Inc. Marcel Dekker, Inc. grants you the limited right to display the Materials only on your personal computer or personal wireless device, and to copy and download single copies of such Materials provided that any copyright, trademark or other notice appearing on such Materials is also retained by, displayed, copied or downloaded as part of the Materials and is not removed or obscured, and provided you do not edit, modify, alter or enhance the Materials. Please refer to our Website User Agreement for more details.

Request Permission/Order Reprints Reprints of this article can also be ordered at http://www.dekker.com/servlet/product/DOI/101081STA200031367

A Comparison of Bounds on Sets of Joint Distribution ...

1Department of Mathematical Sciences, Lewis & Clark College,. Portland, Oregon, USA. 2Departamento de Estadıstica y Matemلtica Aplicada, Universidad.

125KB Sizes 10 Downloads 134 Views

Recommend Documents

Upper Bounds on the Distribution of the Condition ...
be a numerical analysis procedure whose space of input data is the space of arbitrary square complex .... The distribution of condition numbers of rational data of.

A Winnerls Curse for Econometric Models: On the Joint Distribution of ...
)Qη50* 7 η. ...... the value of shrinkage becomes a trade$off between the positive effect it has on η ... 4.1 An Empirical Illustration: VAR for the US Term Structure.

A Winnerls Curse for Econometric Models: On the Joint Distribution of ...
fit the data in$sample, while the contrary tends to be true out$of$sample. See, e.g. ...... For simplicity we focus on the case without a risk$free asset. The average in$sample ..... tion, model averaging, and the effects of data mining. Much caution

On Decomposability of Multilinear Sets - Semantic Scholar
May 28, 2016 - Examples include quadratic programs, polynomial programs, and multiplicative ... †Graduate Program in Operations Research and Industrial ...

A Note on Separation of Convex Sets
A line L separates a set A from a collection S of plane sets if A is contained in one of ... For any non-negative real number r, we denote by B, the disk with radius r.

Guidance-Material-on-Comparison-of-Surveillance-Technologies ...
Page 2 of 47. Guidance Material on Surveillance Technology Comparison. Edition 1.0 September 2007 Page 2. TABLE OF CONTENTS. 1. Introduction.

On the contact domain method: A comparison of ...
This work focuses on the assessment of the relative performance of the so-called contact domain method, using either the Lagrange multiplier or the penalty ...

A Comparison of Scalability on Subspace Clustering ...
IJRIT International Journal of Research in Information Technology, Volume 2, Issue 3, March ... 2Associate Professor, PSNA College of Engineering & Technology, Dindigul, Tamilnadu, India, .... choosing the best non-redundant clustering.

Views on Abortion: A Comparison of Female Genetic Counselors ...
regarding abortion, followed by mainline Protestants, those .... A Comparison of Female Genetic Counselors and Women from the General Population.pdf.

Bounds on the Lifetime of Wireless Sensor Networks Employing ...
each sensor node can send its data to any one of these BSs (may be to the ... deployed as data sinks along the periphery of the observation region R.

Bounds on the Lifetime of Wireless Sensor Networks Employing ...
Indian Institute of Science. Bangalore – 560012. INDIA .... deployed as data sinks along the periphery of the observation region R. – obtaining optimal locations ...

on the probability distribution of condition numbers of ...
βm := (0,..., 0,Xdm. 0. ). Observe that the first m coordinates of any system h′ := [h′. 1,...,h′ m] ∈ Hm. (d) in this basis are exactly h′(e0)=(h′. 1(e0),...,h′.

THE NUMBER OF B3-SETS OF A GIVEN ...
Feb 7, 2014 - In view of the fact that Fh(n) = Θ(n1/h), one sees that c0hn1/h log ...... (55). It follows by (23) and Proposition 3.6 that. 360. eC. (A). eeCG0. (A).

On upper bounds for high order Neumann eigenvalues of convex ...
Let Ω be a bounded convex domain in Euclidean space Rn. By considering a manifold ... where f1/(n−1) is a positive and concave function on the interval (0,1).

Uniform bounds on the number of rational points of a ...
−log | |p, where either p = ∞ and |F|p := edeg(F), or p runs over the set of. 6 ..... Now we are going to express these estimates in terms of the height of N/D. Let g be the gcd ...... monodromy, volume 40 of AMS Colloquium Publications, American

Lower Bounds on the Minimum Pseudo-Weight of ...
Nov 30, 2003 - indices are in Vr. We call C a (j, k)-regular code if the uniform column weight ..... Proof: In App. E of [14] the above lower bound for the minimum ...

On Apéry Sets of Symmetric Numerical Semigroups
In this paper, we give some results on Apéry sets of Symmetric. Numerical Semigroups with e(S)=2. Also, we rewrite the definitions n(S) and H(S) by means of ...

Comparison of Square Comparison of Square-Pixel and ... - IJRIT
Square pixels became the norm because there needed to be an industry standard to avoid compatibility issues over .... Euclidean Spaces'. Information and ...

Bounds on the Lifetime of Wireless Sensor Networks Employing ...
Wireless Research Lab: http://wrl.ece.iisc.ernet.in ... Key issues in wireless sensor networks ... NW lifetime can be enhanced by the use of multiple BSs. – deploy ...

A Comparison of Medium-Chain
From the Department of Surgery, Peking Union Medical College Hospital, Beijing, China, and the ..... An infusion of fat-free nutrition solution was started at. 8:00 A.M. and .... using standard software StatView SE.25 Results were ex- pressed as ...

A Comparison Study of Urban Redevelopment Strategies_final.pdf ...
Whoops! There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. A Comparison Study of Urban Redevelopment Strategies_final.pdf. A Comparison Study of Urban Redevelo

Inferring bounds on the performance of a control policy from a ... - ORBi
eralizations) of the following discrete-time optimal control problem arise quite frequently: a system, ... to high-enough cumulated rewards on the real system that is considered. In this paper, we thus focus on the evaluation of ... interactions with

Inferring bounds on the performance of a control policy from a ... - ORBi
The main philosophy behind the proof is the follow- ing. First, a sequence of .... Athena Scientific, Belmont, MA, 2nd edition, 2005. [2] D.P. Bertsekas and J.N. ...