Judicial precedent as a dynamic rationale for axiomatic bargaining theory Marc Fleurbaeyy, John E. Roemerz August 23, 2010

Abstract Axiomatic bargaining theory (e.g., Nash’s theorem) is static. We attempt to provide a dynamic justi…cation for the theory. Suppose a Judge or Arbitrator must allocate utility in an (in…nite) sequence of two-person problems; at each date, the Judge is presented with a utility possibility set in R2+ . He/she must choose an allocation in the set, constrained only by Nash’s axioms, in the sense that a penalty is paid if and only if a utility allocation is chosen at date T which is inconsistent, according to one of the axioms, with a utility allocation chosen at some earlier date. Penalties are discounted with t, and the Judge chooses any allocation, at a given date, that minimizes the penalty he/she pays at that date. Under what conditions will the Judge’s chosen allocations converge to the Nash allocation over time? We answer this question for three canonical axiomatic bargaining solutions: Nash’s, Kalai-Smorodinsky’s, and the ‘egalitarian’ solution, and generalize the analysis to a broad class of axiomatic models. We are grateful to Emeric Henry, Alain Trannoy, and Debraj Ray and three referees for comments and suggestions. y CNRS, Université Paris-Descartes, Sciences Po, CORE (Université catholique de Louvain) and IDEP. Email: marc.‡[email protected]. z Yale University. Email: [email protected].

1

Keywords: axiomatic bargaining theory, judicial precedent, dynamic foundations, Nash’s bargaining solution. JEL codes: C70, C78, K4.

1

Introduction

Axiomatic bargaining theory is timeless. In Nash’s (1950) original conception, the apparatus is meant to model a bargaining problem between two individuals, each of whom initially possesses an endowment of objects, and von Neumann-Morgenstern (vNM) preferences over lotteries on the allocation of these objects to the two individuals. An impasse point is de…ned as the pair of utilities each receives if no trade takes place, that is, if no bargain is reached (here, particular vNM utility functions are employed). Nash quickly passes to a formulation of the problem in utility space, where a bargaining problem becomes a convex, compact, comprehensive utility possibilities set, containing the impasse point. He then imposes the axioms of Pareto e¢ ciency, symmetry, independence, and scale invariance, and proves that the only “solution” satisfying these axioms on an unrestricted domain of problems is the Nash solution — for any problem, the utility point which maximizes the product of the individual gains from the threat point.1 We say the theory is timeless, because of the independence axiom. For this axiom requires consistency in bargaining behavior between pairs of problems. What kind of experience might lead the bargainers to respect the independence axiom? Presumably, if they bargained for a su¢ ciently long period of time, facing many di¤erent problems, they might come across a pair of problems related as the premise of the independence axiom requires: problem S is contained in problem Q (as utility possibilities sets), the bargainers 1

Axiomatic bargaining theory has two major applications: one to bargaining, and the other to distributive justice. Of course, Nash (1950) pioneered the …rst interpretation, and the second was pioneered by Thomson and Lensberg (1989), who showed that many of the classical bargaining solutions (Nash, Kalai-Smorodinsky, egalitarian) could be characterized by sets of axioms with ethical interpretations. See Roemer (1996) for a history of the subject in its two variants.

2

faced problem Q last year and chose allocation q 2 Q, and it so happens that q 2 S . It is certainly reasonable, they reason, to agree upon q when facing S this year, because of something like Le Chatelier’s principle. (“If we chose q when all those allocations in QnS were available, we e¤ectively had decided to restrict our bargaining to S last year anyway, so let’s choose q 2 S again now.”) But if this is the way that bargainers might “learn”how independence bears on decisions, then Nash’s theory seems quite unrealistic. For with an unrestricted domain of problems, how often will bargainers face two problems related as the premise of the independence axiom requires? Almost never. Notice the same argument of timelessness does not apply to the scale invariance axiom, even though that axiom compares the behavior of the solution on pairs of problems, because that axiom is meant to model the idea that only von Neumann Morgenstern preferences count, not their particular representation as utility functions. While the independence axiom can be viewed as a behavioral axiom, the scale invariance axiom is an informational axiom. The other axioms — symmetry and Pareto — are also behavioral but not timeless in our sense. It is not a mystery why bargainers should learn to cooperate (Pareto), or that two bargainers with the same preferences (and the same strengths) and the same endowments should end up at a symmetric allocation. Thus, the critique we are proposing of Nash bargaining theory is that one of the behavioral axioms (independence) has no apparent justi…cation via some kind of learning through history, in the presence of another axiom (unrestricted domain), which essentially precludes that learning could ever take place. Our goal in this article is to replace the timelessness of axiomatic bargaining theory with a dynamic approach, in which decision makers learn from history. Indeed, there is, we think, an obvious judicial practice, which provides a way of rendering the theory dynamic. Suppose a judge or a court or an arbitrator faces a number of cases over time. There is a constitution that prescribes what the judicial decision must be in certain clear 3

and polar cases. But most cases do not …t the speci…cations of these constitutionally described cases. So judges rely on judicial precedent or case law: they look for a case in the past that is similar in important respects, or related, to the one at hand, and decide the present case in like manner. Thus judicial precedent is a procedure providing a link to the past that is similar to the links between problems that the independence axiom — and, indeed, from a formal viewpoint, the scale invariance axiom — impose. Of course, there is a possibility that the case being considered at present time, i, has two precedent cases j and k each of which is related to i in some important way, but which were decided di¤erently. In general, the judge cannot decide the present case in a way to satisfy both precedents, and we will represent this con‡ict in our formal model.2 Imagine, then, that there is a domain of “cases” D, which is some set of Nash-type bargaining problems (convex, compact, comprehensive sets in R2+ ). Suppose that the domain is rich enough that there are pairs of cases related by the scale invariance axiom, and pairs of cases related by the independence axiom; there are also some symmetrical cases in D. At each date t = 1; 2; 3; : : : a case is drawn randomly by Nature, according to some probability distribution on D. This in…nite sequence of cases is called a history. The judge must decide each case sequentially (here, how to choose a feasible utility allocation), and he is restricted to obey the Nash axioms. What does this mean? If the case is symmetric, he must choose a symmetric point in the case, or pay a penalty of one; for every case, he must choose a Pareto e¢ cient point, or pay a penalty of one. If a case is related to a prior case in the history by the scale-invariance or independence axiom, and he does not choose the allocation in the present case which is consistent with his prior choice according to the salient axiom, he must pay a penalty of appeared t periods ago, where 0 <

t

, if the prior case

< 1 is a given discount factor. (Thus, paying a

penalty of one if a Pareto e¢ cient point is not chosen in the case at hand is just a special 2

Real judges tend to decide which precedent …ts the case at hand more closely, arguments revolving around the proximity of various precedents to the case at hand; but we will not follow this tack.

4

case of this rule, because

0

= 1.) If a case comes up which is not symmetric and is not

related to any prior case by scale invariance or independence, he can choose any Pareto e¢ cient point with zero penalty. At each date, the judge must choose an allocation which minimizes his penalty. In general, at a given date, he may end up paying penalties with respect to a number of cases in the past which are precedents, and so his penalty would P be a sum of the form i2P t , for some set of non-negative integers P . Now suppose that we consider a domain D where Nash’s theorem is true: that is,

any solution ' : D ! R2+ satisfying '(i) 2 i for all i 2 D that satis…es the Nash axioms on D is, in fact, the Nash solution on D, denoted N . Call such a domain a Nash domain. (The simplest Nash domain consists of precisely one symmetric set. Any solution on this domain must obey the symmetry axiom and Pareto. Thus any solution obeying the axioms coincides with N on this domain.) Our question is this: When is it the case that a judge who plays by the above rules, and faces an in…nite history of cases, will converge over time almost surely to prescribing the Nash solution to the cases he faces? To be precise, consider a super-domain HD of all possible histories over a given Nash domain, D, endowed with the product probability measure induced on histories by the given probability measure on D. When would the judge almost surely converge to prescribing the Nash solution as time passes on histories in HD ? We prove, under some simple additional assumptions, that convergence to the Nash solution occurs almost surely for every set of histories HD , where D is a …nite Nash domain satisfying a speci…c condition, if and only if 0 <

1=3 — that is, if and only if history is discounted

at a su¢ ciently high rate. (Recall that the discount factor are related by the formula

and the discount rate r

= 1= (1 + r) :) This is our dynamic justi…cation of Nash’s

theorem. However, we also show that there are Nash domains for which convergence to the Nash solution does not occur almost surely. In that sense, we can say that the Nash characterization theorem is dynamically imperfect. In contrast, we show that Kalai and Smorodinsky’s (1975) characterization of their alternative solution, as well as Kalai’s 5

(1977) characterization of the egalitarian solution, are dynamically perfect in the sense that for every …nite domain on which the theorem is true, almost sure convergence to the solution is obtained for appropriate values of : We extend the results to more general penalty systems and to a general class of axiomatic theorems. The rest of the paper is structured as follows. Section 2 introduces the axiomatic framework. Sections 3–4 successively deal with the Nash solution, the KalaiSmorodinsky solution, and the egalitarian solution. Section 5 shows how such results can be generalized and applied to any characterization theorem in a general axiomatic framework. Section 6 considers the possibility for the Judge to make decisions not only on the basis of penalties currently incurred but also on the basis of future possible penalties. Section 7 concludes.

2

Framework and axioms

A domain D = fi; j; k; : : :g contains problems, namely, subsets of R2+ which are compact, convex, and comprehensive.3 We restrict attention throughout the paper to …nite domains. For simplicity we also restrict attention to sets having a non-empty intersection with R2++ : Let @i denote the upper frontier of i; i.e.4

@i = fx 2 i j @y 2 i; y

xg ;

and let @ i denote the subset of Pareto-e¢ cient points of i :

@ i = fx 2 i j @y 2 i; y > xg : 3 4

A set i is comprehensive when for all x 2 i; all y Vector inequalities are denoted ; >; :

6

x; one has y 2 i:

Let I(i) denote the vector of ideal points, i.e.

I(i) = (max fx1 2 R+ j 9x2 ; (x1 ; x2 ) 2 ig ; max fx2 2 R+ j 9x1 ; (x1 ; x2 ) 2 ig) : For any

2 R2++ ; a set j is an –rescaling of i if j = x 2 R2+ j 9y 2 i; x1 =

1 y1 ;

x2 =

2 y2

;

A solution ' : D ! R2+ is a mapping such that for all i 2 D; '(i) 2 i: The following axioms appear in the landmark theorems by Nash (1950), Kalai and Smorodinsky (1975) and Kalai (1977): Weak Pareto (WP): 8i 2 D; '(i) 2 @i: Symmetry (Sym): 8i 2 D; if i is symmetric, then '1 (i) = '2 (i): Scale Invariance (ScInv): 8i; j 2 D; if j is an –rescaling of i for some '(j) = (

1 '1 (i);

Nash Independence (Ind): 8i; j 2 D; if i Monotonicity (Mon): 8i; j 2 D; if i

2 '2 (i)) :

j and '(j) 2 i; then '(i) = '(j):

j, then '(i)

'(j):

Individual Monotonicity (IMon): 8i; j 2 D; p 2 f1; 2g ; if i then '3 p (i)

2 R2++ ; then

j and Ip (i) = Ip (j);

'3 p (j):

Consider a domain D and an in…nite number of periods t = 1; 2; : : : A history H is a sequence of problems and chosen points

H = ((i1 ; x1 ) ; (i2 ; x2 ) :::) 7

such that at every period t; xt 2 it : At each t a random process picks it 2 D: For any given i 2 D; the probability that it = i may depend on the previous part of the history ((i1 ; x1 ) ; :::; (it 1 ; xt 1 )) : We assume throughout the paper that the random process is regular in the sense that it never ascribes a zero probability (or a probability converging to zero) to any given problem, i.e., if for every i 2 D; there exists

i

> 0 such that for

every t 2 N; and for every past history ((i1 ; x1 ) ; :::; (it 1 ; xt 1 )) ; the probability that it = i is at least

i:

At each period t the Judge chooses xt 2 it : His objective at each period is to minimize the penalty for this period, which is the sum of penalties incurred for a violation of each axiom. Each violation of an axiom implies a penalty of one unit. However, the penalty for violating an axiom involving a reference to past problems is discounted by a factor 2 (0; 1) : the farther back in the past the reference problem is, the lower the penalty. Let r denote the corresponding discount rate:

= 1= (1 + r) :

In order to avoid any ambiguity, it is useful to specify what a violation of an axiom is exactly. Choosing xt 2 it may entail the following penalties: (WP) 1 if xt 2 = @it : (Sym) 1 if it is symmetric and '1 (it ) 6= '2 (it ): s

(ScInv) (Ind)

s

if it is a –rescaling of it

and '(it ) 6= (

if '(it ) 6= '(it s ) and either [it s

(IMon)

s

it

s

and '(it )

2 '2 (it s )) :

s

it ; '(it ) 2 it s ]:

it ; Ip (it+1 ) = Ip (it ); and '3 p (it+1 )

it+1 ; Ip (it+1 ) = Ip (it ); and '3 p (it )

if either it

1 '1 (it s );

it s ; '(it s ) 2 it ] or [it

if for some p 2 f1; 2g ; either it+1

'3 p (it ); or it (Mon)

s

'(it s ); or it

'3 p (it+1 ): s

it and '(it s )

'(it ):

One restriction of this system of penalties is that the violation of any axiom involving the past always counts less than the violation of any axiom that does not refer to the past. We will examine more general systems of penalties in Section 5. 8

Given a domain D and a random process selecting problems we say that the Judge converges almost surely to the solution ' if with probability 1 there is a date T such that for all t

3

T; the Judge chooses '(it ):

Nash

The Nash solution, denoted N; is de…ned by

N (i) = fx 2 i j 8y 2 i; x1 x2

y1 y2 g :

The domain D is called a Nash domain if Nash’s theorem is holds on D; i.e., if N (:) is the only solution satisfying WP, Sym, ScInv and Ind on D: We will be interested in domains satisfying the following condition. Condition (CN ) For all i 2 D, there exists a sequence j1 ; : : : jn 2 D such that j1 = i; jn is symmetric and for all t = 1; : : : ; n (i) jt

1; either

jt+1 and N (jt+1 ) 2 jt ; or

(ii) 9 2 R2++ , jt+1 is an –rescaling of jt : Call such a sequence a special chain beginning at i. Proposition 1 D is a Nash domain if it satis…es Condition (CN ). The converse is not true. Proof. If: Let ' be any solution on D satisfying Nash’s axioms. Let i 2 D: By Condition (CN ) there is a special chain j1 ; : : : jn beginning at i: By Sym and WP, '(jn ) = N (jn ): One can now roll back along the special chain to i; and at each step '(jk ) = N (jk ) either by Ind (case (i)) or by ScInv (case (ii)). For k = 1, we have '(i) = N (i): It follows that ' = N on D:

9

Converse: Let D = fi; j; k; l; mg ; for i = co f(0; 0) ; (3; 0) ; (2; 2) ; (0; 4)g j = co f(0; 0) ; (3; 0) ; (2; 2) ; (0; 3)g k = co f(0; 0) ; (2; 0) ; (2; 2) ; (0; 4)g l = co f(0; 0) ; (4; 0) ; (4; 4) ; (0; 8)g m = co f(0; 0) ; (8; 0) ; (0; 8)g : By WP and Sym, '(j) = N (j) = (2; 2) and '(m) = N (m) = (4; 4): By Ind, due to l

m;

'(l) = N (l) = (4; 4): By ScInv, as k is a rescaling of l; '(k) = N (k) = (2; 2) : Now consider i: There is no special chain beginning at i: It is not symmetric, it is not the rescaling of another set and it is not included in another set for which the Nash point is in i: Yet, one must have '(i) = N (i) = (2; 2): By WP, '(i) must belong either to the segment (3; 0) (2; 2) or to the segment (2; 2) (0; 4) : Suppose one took '(i) from a point x of the segment (3; 0) (2; 2) di¤erent from (2; 2) : Then, as j

i; by Ind one should have

'(j) = x; a contradiction. Suppose one took '(i) from a point y of the segment (2; 2) (0; 4) di¤erent from (2; 2) : Then, as k

i; by Ind one should have '(k) = y; a contradiction.

Therefore Nash’s theorem holds on D even though Condition (CN ) does not hold. We can now study the convergence of the Judge’s decisions toward the Nash solution. The following proposition states that with probability one the Judge’s decisions will exactly coincide with the Nash solution within a …nite number of periods. The argument is that, when Condition (CN ) holds for D; with probability one there will be some …nite time at which all the elements of D will appear in a row, each preceded by the special chain beginning at it, in reverse order: jn ; jn 1 ; :::; j2 ; i. When encountering jn ; the Judge will choose N (jn ) in order to avoid the penalties for violation of WP and Sym, and this

10

will induce him to choose the Nash point in the subsequent problems, in order to avoid the penalties for violation of Ind or ScInv. This happens, however, only if earlier possible “mistakes”, and the related penalties, are not overwhelming. Therefore this requires the past to be su¢ ciently discounted. When the past is strongly discounted, however, one may fear that once this particular sequence is past, the Judge may err again when confronted with an arbitrary following sequence of problems. We prove, however, that the particular sequence of special chains is powerful enough to impose the Nash solution on all subsequent problems. Theorem 1 The Judge converges almost surely to the Nash solution on every domain satisfying Condition (CN ) if and only if

1=3:

Proof. If. Let D be a domain satisfying Condition (CN ). Recall that by assumption D is …nite and the random process is regular. 1. Enumerate the problems in D as 1; 2; : : : ; M . For each problem i de…ne the special chain beginning at i as i; j2 (i); :::; jn(i) (i): Consider the following sequence of problems:

jn(1) (1); jn(1) 1 (1); :::; 1; jn(2) (2); ::::; 2; jn(3) (3); ::::; 3; :::; jn(M ) (M ); :::; M:

At every period, the probability that this sequence will occur at the next period is, by the assumption that the random process is regular, at least

jn(1) (1) jn(1)

1 (1)

:::

1 jn(2) (2) ::: 2 ::: jn(M ) (M ) ::: M

> 0:

Therefore, with probability one, this sequence occurs at a …nite date T . If N (jn(1) (1)) is not chosen, the penalty will be at least 1, since either WP or Sym will be violated. If, however, N (jn(1) (1)) is chosen, this will entail at most two violations with respect to all previous choices — namely, for any previous date, a violation of ScInv

11

and/or Ind. So the worst penalty that can be incurred is 2

2

T 1 X

t

<2

t=1

Since

1 X

t

=2

t=1

PT

1 t=1

t

: As

> 0;

:

1

1=3; one has 2

1;

1

so the Judge will choose N (jn(1) (1)). Indeed, this argument shows that any symmetric problem will be assigned the Nash point by the Judge, when it occurs. 2. Now consider a later element jn(1)

k

in the sequence, for k = 1; :::; n(1)

1. If the

Judge does not choose N (jn(1) k (1)), he violates either ScInv or Ind w.r.t. the previous date, so the penalty is at least . If he does choose N (jn(1) k (1)), he is penalized at most

2

T X

t

<2

1 X

k+1 t

=2

t=k+1

t=k+1

1

:

As one has k+1

2

k

1

;

the Judge chooses N (jn(1) k (1)): In this way we see that we have the Nash choice on the whole sequence. 3. Now let the element that occurs after this sequence be i. If the Judge does not choose N (i), he violates two axioms with respect to the previous occurrence of i in the sequence — namely, ScInv and Ind. The penalty is therefore at least 2 t for some PM PM 1 t j=2 n(j) + 1. (The lowest penalty is when i = 1.) Let Q = j=2 n(j) + 1.

On the other hand if he chooses N (i), he at most violates ScInv and Ind with respect to

all problems preceding the sequence (from the beginning of the history till Q + 1 periods

12

before) and is therefore penalized by no more than 1 X

2

Q+1 t

=2

t=Q+1

:

1

So he will choose N (i) as long as Q+1

2

This is equivalent to

2

1

Q

:

1=2; which holds true.

4. Assume that the Judge has chosen the Nash point for S periods after the end of the sequence (in the previous step we have shown this to be true for S = 1). Let the element that occurs at S + 1 be i. If the Judge does not choose N (i), he violates at least ScInv and Ind with respect to the previous occurrence of i in the sequence and the penalty is therefore at least 2

t

for some S + 1

S + Q. If he chooses N (i), he at

t

most violates ScInv and Ind with respect to all problems preceding the sequence (from the beginning of the history till S + Q + 1 periods before) and is penalized by no more than 1 X

2

S+Q+1 t

=2

t=S+Q+1

:

1

So he will choose N (i) as long as S+Q+1

2

which is equivalent to

2

1

S+Q

;

1=2:

By induction he chooses Nash henceforth. Only if. Suppose 1=3 <

< 1:

Let D = fi; jg ; as described in Fig. 1. The problem j is symmetric.

13

x

i j

Figure 1: Example: the domain D = fi; jg The fact that

> 1=3 is equivalent to 1 2

1

< 1:

Let T be an integer satisfying

T >

1 2

ln 1

There is a positive probability (at least

1

ln T i )

:

that history starts with T occurrences of i:

Suppose the Judge picks point x in i for t = 1; :::; T: Let j occur at t = T + 1: If the Judge chooses N (j) he violates ScInv and Ind w.r.t. P the previous T periods and the penalty is 2 Tt=1 t : If he chooses x he violates Sym and P the penalty is 1. If he chooses another point the penalty is 1 + 2 Tt=1 t : This last option

14

is therefore dominated by N (j): We have

2

T X t=1

t

>1 ,2

T

1 1

,T >

>1 ln 1

b c+d

1

ln

;

which is true by assumption. Therefore the judge picks x. Consider a period S > T + 1 and assume that x has been chosen at all times before (we know this to be true for S = T + 2). If i occurs; x is picked again without any penalty while any other point costs a penalty. If j occurs, picking x costs 1 while picking N (j) P P costs 2 St=11 t > 2 Tt=1 t : So, again x is chosen. By induction, at no period in the future can the Nash point be chosen.

Note that the result holds only if, as assumed in this paper,

> 0: When

= 0 the

Judge is tied only by WP and Sym and this is clearly insu¢ cient to make him converge to the Nash solution. Remark 1 Th. 1 remains true if we assume that the Judge takes his o¢ ce at a certain point in time, after an arbitrary history has unfolded, and feels bound by the previous decisions and the attached penalties. No matter how far from the Nash solution the antecedent decisions have been, he will converge almost surely to the Nash solution under the conditions of the theorem. Remark 2 These results depend on the Judge being myopic. For instance, in the second part of the proof of Th. 1 the Judge could anticipate that j will occur at some date and that the only way not to incur any penalty is to take N (i) right from the beginning. More on this issue will be said in Section 6. The main limitation of Th. 1 is that it applies only to domains for which Condition (CN ) holds. By Prop. 1, this is a strict subset of the set of Nash domains. It is easy to 15

weaken Condition (CN ) in such a way that Th. 1 remains valid over the corresponding larger set of domains. But the next proposition shows that Th. 1 does not generalize to the full set of all Nash domains. Moreover, this problem is independent of the particular system of penalties adopted. (This result does not even require the random process to be regular.) Proposition 2 There exists Nash domains such that, whatever ; whatever the value of the penalty attached to each axiom, and whatever the random process, convergence to N does not occur almost surely on such domains. The proof involves a tedious example with a ten-problem domain and is available on the authors’websites. This negative result is due to the particular way in which Ind may work in the characterization of the Nash solution for some domains. Observe that in the example given in the proof of Prop. 1, one must have '(i) = N (i) because j; k

i and the

constraints known about ' (j) and ' (k) force '(i) to belong to two di¤erent segments of @i; the intersection of which is fN (i)g : This is the static form of the axiomatic analysis. In the dynamic setting in which the Judge operates, this kind of constraint may be too weak to force him to choose N (i):5 This does not happen in this particular example because the constraints on ' (j) and ' (k) are ' (j) = ' (k) = N (i), so that, given the shape of these sets, a violation of Ind in j or k would occur if the Judge chose any non-Nash point in i: The proof therefore requires a more complicated example in which the constraints on the smaller sets are less precise so that the Judge may pick other points than the Nash point in these sets, and then also pick non-Nash points in the large set. One can see from the example proving Prop. 2 that the failure of convergence is not a convergence to another solution, but an oscillation between several solutions. One may then wonder if a stronger form of failure can occur, namely, convergence to another 5

When Ind imposes a penalty on the Judge only when the past set is the larger set (as discussed in fn ??), this constraint simply vanishes.

16

solution. The answer is, fortunately for the Nash approach, negative. (We assume again that the random process is regular.) Proposition 3 If

1=3 and convergence to a particular solution ' occurs with positive

probability in a Nash domain, then ' = N: Proof. Let D be a Nash domain and assume that convergence to a particular solution ' occurs with positive probability. This means that there is a set H of histories occurring with positive probability such that, for every history h 2 H, there is a …nite Th such that for all t

Th ; '(it ) is chosen in every it :

De…ne the following subset of H:

H0 = fh 2 H j 9i; j 2 D; the sequence (i; j) occurs only a …nite number of times in hg : H0 is a set of histories of measure zero because the process is regular. Thus, the set of histories H 0 = H n H0 is not empty (it has the same mass as H), and for every h 2 H 0 , for every i; j 2 D, the sequence (i; j) occurs an in…nite number of times. A fortiori, note that every i also occurs an in…nite number of times. We will now prove that ' obeys all the Nash axioms on D; since D is a Nash domain, it must be that ' = N: First, ' must satisfy Sym, because

1=3 and therefore, as shown in the proof of

Th. 1, the Judge always selects the Nash point (which is symmetric) in symmetric sets. Second, suppose ' does not satisfy WP. Let h 2 H 0 , and let date t be the …rst date in h at which '(it ) 2 = @it . By the argument of the previous paragraph, it is not symmetric. If the Judge selects '(it ) the penalty is at least 1 (for a violation of WP). If the Judge selects a point in @it ; he does not violate WP or Sym but may at worst violate ScInv and Ind with respect to all t

1 periods, so that the penalty is less than

2

+

2

+ ::: = 2 17

1

:

The penalty is therefore less than 1, as

1=3: Therefore the Judge will never choose

'(it ): As h 2 H 0 ; it occurs an in…nite number of times, which contradicts the assumption that convergence to ' occurs in every h 2 H 0 . Third, suppose that ' violates ScInv with respect to a particular pair (i; j): Let h 2 H 0 : As ' selects the Nash point in symmetric sets, and i and j occur in…nitely many times in h; i and j are not symmetric if convergence to ' is obtained in h. Moreover, h contains in…nitely many occurrences of (i; j): When such a sequence occurs, the fact that the combination of '(i) and '(j) violates ScInv implies that choosing '(j) costs at least : Choosing a point x 2 @j n '(j) costs less than 2

2

which is less than

as

2

+

3

+ ::: = 2

1

;

1=3: Therefore it is impossible for the Judge to choose '(j)

from j when (i; j) occurs. Convergence to ' cannot occur in h; a contradiction. Fourth, ' must satisfy Ind. Suppose that it violates it with respect to a particular pair (i; j): As ' selects the Nash point in symmetric sets, necessarily one of them is not symmetric, say j: One can then repeat the rest of the argument developed for ScInv and derive a contradiction.

4

Other solutions

We now examine how similar results can be obtained for the other two classical solutions of bargaining theory, the Kalai-Smorodinsky solution and the Egalitarian solution. They reveal interesting di¤erences with the Nash solution. One di¤erence is that special chains can now be found which exactly delineate the domains for which the characterization theorems hold true. Another di¤erence is that the theorem characterizing the Egalitarian solution has a smaller number of axioms.

18

4.1

Kalai-Smorodinsky

The Kalai-Smorodinsky solution is denoted KS: One has

KS(i) = fx 2 @i j x1 =x2 = I1 (i)=I2 (i)g : A domain D is called a Kalai-Smorodinsky domain if the Kalai-Smorodinsky theorem holds on D; i.e., if KS(:) is the only solution satisfying WP, Sym, ScInv and IMon on D: Condition (CKS ) For all i 2 D; there exists a sequence j1 ; : : : jn such that j1 = i; jn is symmetric and for all t = 1; : : : ; n (i) jt

jt+1 (or jt

1; either

jt+1 ) and I(jt ) = I(jt+1 ) and KS(jt+1 ) 2 @jt ; or

(ii) 9 2 R2++ , jt+1 is an –rescaling of jt : Again, and without risk of confusion with the previous section, let us call such a sequence a special chain beginning at i. Proposition 4 A domain D is a Kalai-Smorodinsky domain if and only if it satis…es Condition (CKS ). The proof of this proposition is tedious and is available on the authors’ websites. This result makes it possible to obtain the following theorem. Theorem 2 The Judge converges to KS almost surely on all Kalai-Smorodinsky domains if and only if

1=3:

Proof. The proof closely mimics the proof of Th. 1, with IMon replacing Ind.

4.2

Egalitarian solution

The Egalitarian solution is denoted E: One has

E(i) = fx 2 @i j x1 = x2 g : 19

A domain D will be called an E domain if the egalitarian solution is the only solution satisfying WP, Sym, and Mon on D: This egalitarian theorem is a variant of Th. 1 in Kalai (1977) and can be found in Thomson and Lensberg (1989, Th. 2.5) and Peters (1992, Th. 4.31). Condition (CE ) For all i 2 D; there exists a sequence j1 ; : : : jn such that j1 = i; jn is symmetric and for all t = 1; : : : ; n jt+1

1; E(jt ) = E(jt+1 ) and either jt

jt+1 or

jt :

Again the sequence j1 ; : : : jn will be called a special chain beginning at i: Proposition 5 A domain D is an E domain if and only if it satis…es Condition (CE ): Proof. If: Let ' be any solution on D satisfying the axioms of the egalitarian theorem. Let i 2 D: By Condition (CE ) there is a special chain j1 ; : : : jn beginning at i: By Sym, E(jn ) is chosen from jn and one rolls back along the special chain by applying Mon. This implies '(i) = E(i): Only if: Let D+ be the subset of D containing the problems i with a special chain. We must show that D+ = D if the egalitarian theorem holds on D: Suppose that there exists a problem k 2 D n D+ . Let Z = x 2 R2+ j 9i 2 D+ ; x = E(i) : Construct a monotone path P from zero for which the intersection with the 45 line coincides with Z on R2++ : More precisely, P is the graph of an increasing function f such that f (0) = 0; and fx 2 P j x1 = x2 g = Z [ f0g: Let ' be de…ned by: for all i 2 D; f'(i)g = P \ @i: By construction ' satis…es WP and Mon. It satis…es Sym because all symmetric problems are in D+ ; and ' coincides with E on D+ : But ' 6= E unless D = D+ ; which proves the “only if”part of the proposition. 20

The next theorem displays a more favorable threshold for ; thanks to the presence, in the egalitarian theorem, of fewer axioms involving a reference to past decisions. Theorem 3 The Judge converges to E almost surely on all E domains if and only if 1=2: The “if” part is a corollary of Th. 4. The converse is an immediate adaptation of the second part of the proof of Th. 1.

5

Generalization

The similarity between the results of the previous sections suggest an underlying common structure. In this section we provide a general result that covers more theorems and other frameworks than the bargaining model. Consider an abstract setting in which a problem is a subset i of a general set O of options and a solution '; de…ned on a domain D; has to pick an element of this set: '(i) 2 i. The axioms of a characterization theorem have two general forms and will be labelled 1k for k = 1; :::; K1 and 2k for k = 1; :::; K2 ; respectively. The …rst type of axiom, “unary” axioms, require the solution to be chosen from a speci…c subset of i whenever i is of a particular sort. Let D1k be a subset of the domain D and let Gk1 be a correspondence from D to 2O such that for all i 2 D; Gk1 (i)

i.

Axiom 1k: 8i 2 D; if i 2 D1k ; then '(i) 2 Gk1 (i): The second type of axiom, “binary”axioms, require the points chosen by the solution for two sets i; j to stand in a particular relation whenever these two sets are themselves related in a speci…c way. Let D2k be a subset of D2 that contains the pair (i; i) for all i 2 D; and let Gk2 be a correspondence from D2 to 2O Gk2 (i; j)

i

O

such that for all (i; j) 2 D2 ;

j. Moreover, we impose that for all i 2 D; Gk2 (i; i) contains no (x; y) such

that x 6= y: 21

Axiom 2k: 8 (i; j) 2 D2 ; if (i; j) 2 D2k ; then ('(i); '(j)) 2 Gk2 (i; j): Let us illustrate these general formulations with the axioms introduced in Section 2. Weak Pareto and Symmetry are of the …rst kind. For Weak Pareto, D1k = D and Gk1 (i) = @i: For Symmetry, D1k is the subset of symmetric problems and Gk1 (i) is the intersection of i with the 45 line. The other axioms are of the second kind. For Scale Invariance, D2k is the subset of pairs such that one set is a rescaling of the other, and Gk2 (i; j) is the set of pairs in i

j

such that one point is the rescaling of the other in the same proportion as for the sets i; j: For Nash Independence, D2k is the subset of pairs (i; j) such that i is the set of pairs (x; y) 2 i

j such that if y 2 i then x = y:

Gk2 (i; j) = f(x; y) 2 i

j j x = y or y 2 = ig :

For Monotonicity, D2k is also the subset of pairs (i; j) such that i set of pairs (x; y) 2 i

j; and Gk2 (i; j)

j such that x

j; and Gk2 (i; j) is the

y: And so on.

The binary axioms used in the previous sections all satisfy the restriction that for all i 2 D; Gk2 (i; i) contains no (x; y) such that x 6= y: This restriction is not needed in static axiomatics because by de…nition, '(i) is only one element of i: But in the sequential framework of the Judge, it is possible for him to choose di¤erent elements of i at di¤erent occurrences of i: It is then important that binary axioms give him incentives to choose consistently. One could imagine other types of axioms, involving a greater number of problems, such as: '(i) = '(j) =) '(i [ j) = '(i): This would require de…ning a system of penalties when the Judge violates such an axiom involving two problems having been treated at two di¤erent periods in the past. This

22

extension is left for future research. Let D be given, with a set of K1 unary axioms and K2 binary axioms. A special chain for a solution ' beginning at i in D is a sequence of problems j1 ; :::; jn 2 D such that j1 = i and: (i) for a subset K

f1; :::; K1 g ; jn 2

T

D1k and

k2K

k2K

(ii) for all t = 1; :::; n 1; there is a subset K and

(

x 2 jt j (x; '(jt+1 )) 2

T

\

k2K

Gk1 (jn ) = f' (jn )g;

f1; :::; K2 g such that (jt ; jt+1 ) 2 )

Gk2 (jt ; jt+1 )

T

D2k

k2K

= f' (jt )g :

What these conditions say is simple: for any solution '0 satisfying all the axioms, '0 (jn ) = ' (jn ) is imposed by the unary axioms, while for all pairs (jt ; jt+1 ) ; '0 (jt ) = ' (jt ) is imposed by the binary axioms if '0 (jt+1 ) = '(jt+1 ). One then sees that, rolling back the sequence from jn to j1 ; '0 (i) = '(i) is imposed by the combination of all the axioms. Note that in condition (ii) one could incorporate constraints on '0 (jt ) imposed by unary axioms in conjunction with binary axioms. One could also consider additional T constraints from binary axioms based on the symmetrical situation: (jt+1 ; jt ) 2 D2k k2K

and the set

(

x 2 jt j ('(jt+1 ); x) 2

\

k2K

)

Gk2 (jt+1 ; jt )

= f' (jt )g :

Such possibilities were actually used in the special chains de…ned in the previous sections. We ignore them here because it would not alter the results obtained in this section, but it would complicate the presentation. By the “rolling back”argument we have obtained the …rst part of the following result. Proposition 6 A solution ' is the only one that satis…es all the K1 + K2 axioms on a …nite domain D if for all i 2 D there is a special chain for ' beginning at i: The converse does not hold in general. We do not need to prove the second part of this statement because from Prop. 1 we 23

already know that the converse is not true in general. Indeed, in general there are many other ways of forcing a precise value of '(i) than by a special chain beginning at i; and it is somewhat surprising that we could obtain the converse for the Kalai-Smorodinsky and the Egalitarian solutions. Let us assume that the minimal (undiscounted) penalty for the violation of any axiom in the Judge’s court is a; and that for a binary axiom, the average penalty is b. The key number in the following theorem is the ratio of penalties K2 b=a; which is a lower bound for the “interest rate”with which the Judge discounts the past. The critical interest rate never increases when a penalty involving a unary axiom increases. Indeed, a greater weight for these axioms reinforces the right choice when a set jn occurs and never encourages the Judge to preserve past “mistakes”. The role of the binary axioms and their penalties is more subtle. The critical interest rate increases with a penalty for a binary axiom if it is greater than another penalty, because this raises b without altering a; but r decreases if the penalty for a binary axiom is lower than all other penalties, because K2 b and a then increase by the same increment. This pattern can be explained as follows. When a binary axiom has heavy relative weight, this may give too much in‡uence to past “mistakes”. However, when its associated penalty is small relative to the others, it is good to increase it in order to force the Judge to take account of the good decisions that have been made under the stronger pressure of the other axioms. Theorem 4 Assume that for every i 2 D there is a special chain for ' beginning at i: The Judge converges almost surely to the solution ' if

r

K2 b : a

(1)

Proof. The structure of the proof is similar to the proof of Th. 1. The quantity K2 b2 is the greatest penalty that the Judge may incur for a violation of binary axioms. The

24

second inequality in (1) is equivalent to

K2 b

a:

1

1. Enumerate the problems in D as 1; 2; : : : ; M . For each problem i de…ne the special chain beginning at i as i; j2 (i); :::; jn(i) (i): Consider the following sequence of problems:

jn(1) (1); jn(1) 1 (1); :::; 1; jn(2) (2); ::::; 2; jn(3) (3); ::::; 3; :::; jn(M ) (M ); :::; M:

With probability one, this sequence occurs at a …nite date T . If '(jn(1) (1)) is not chosen, the penalty will be at least a, since a unary axiom will be violated. If, however, '(jn(1) (1)) is chosen, this will entail at most K2 violations of binary axioms with respect to all previous choices. So the worst penalty that can be incurred is P K2 b Tt=11 t : As > 0; K2 b

T 1 X

t

< K2 b

t=1

1 X

t

= K2 b

t=1

:

1

Since by (1) K2 b

a;

1

the Judge will choose '(jn(1) (1)). For the same reason, the Judge will choose '(jn(t) (t)) for t = 2; :::; M: 2. Consider another element jn(1) k (1); k = 1; :::; n(1)

1; in the sequence. If the

Judge does not choose '(jn(1) k (1)), he violates at least one binary axiom with respect to the previous date, so the penalty is at least a . If he does choose '(jn(1) k (1)), he is penalized at most K2 b

T X

t=k+1

t

< K2 b

1 X

t=k+1

25

k+1 t

= K2 b

1

:

As k+1

K2 b

a ;

1

the Judge chooses '(jn(1) k (1)): Therefore ' is chosen throughout the sequence. 3. Let the element that occurs after this sequence be i. If the Judge does not choose '(i), he violates all binary axioms with respect to the previous occurrence of i in the P sequence. The penalty is therefore K2 b t for some 1 t Q = M j=2 n(j) + 1. On the other hand if he chooses '(i), he at most violates K2 binary axioms with respect to all

problems preceding the sequence (from the beginning of the history till Q + 1 periods before) and is therefore penalized by not more than 1 X

K2 b

Q+1 t

= K2 b

t=Q+1

:

1

So he will choose '(i) as long as Q+1

K2 b

K2 b

1

Q

;

which is satis…ed because a K2 b

1

1:

4. Assume that the Judge has chosen the ' point for S periods after the end of the sequence. Let the element that occurs at S + 1 be i. If the Judge does not choose '(i), he violates K2 binary axioms with respect to the previous occurrence of i in the sequence and the penalty is therefore K2 b

t

for some S + 1

t

S + Q. If he chooses '(i), he

at most violates K2 binary axioms with respect to all problems preceding the sequence (from the beginning of the history till S + Q + 1 periods before) and is penalized by less than K2 b

1 X

S+Q+1 t

= K2 b

t=S+Q+1

26

1

:

So he will choose '(i) as long as S+Q+1

K2 b

K2 b

1

S+Q

:

This is equivalent to the condition obtained in step 3. It seems di¢ cult to obtain a converse to Th. 4 because the counterexamples constructed in the previous sections relied on the speci…cs of the models and solutions under consideration. Remark 3 In the previous sections we assumed a = b = 1, in which case the premise in Th. 4 becomes r

K2 ; or equivalently,

1 . 1+K2

This explains why the upper bound for

with the egalitarian solution (1=2, for K2 = 1) di¤ers from the bound for the Nash and Kalai-Smorodinsky solutions (1=3; for K2 = 2). Remark 4 We noticed in Section 2 that the assumption that the violation of a binary axiom never counts for more than ; which is less than the penalty for a unary axiom, appears restrictive. It is di¢ cult to escape this pattern, though. The more general system of penalties considered in this section allows for a relative penalty for binary axioms, b=a; that is as large as one wishes. However, the inequality r

K2 b=a implies that one always

has K2 b < a; because a 1 < : K2 b 1 + Ka2 b The inequality K2 b < a means that all the binary axioms together always have a lower discounted penalty than any unary axiom. It is intuitive that this must hold if one wants the Judge to always make the right choice in every set jn of a special chain.

27

6

Foresight

It would be against the philosophy of our approach to endow the Judge with foresight, according to which he would compute the e¤ect of his present decision on penalties he is likely to incur in the future. For our approach is one of bounded rationality and learning, not full rationality. In addition, foresight is not an important aspect of the doctrine of real-world judicial precedent, because the judges typically focus on consistency with past judgments rather than on the constraints their current decisions will impose on future related cases, so our approach is not far-fetched. Even with foresight, however, the problem does not become trivial if the Judge has to live with an arbitrary set of precedents which he inherits upon taking o¢ ce, and which will determine penalties he incurs in the future. It is then possible that historical errors will continue to in‡uence his decisions, and prevent convergence to the “correct”solution. In this section, we present an example showing that this can indeed occur when the Judge has foresight. We adopt the framework of Section 3 (focusing on the Nash solution in the axiomatic bargaining model) and assume that the Judge knows the probability law governing the occurrence of successive problems. He discounts the future penalties with a factor : Suppose he starts his job at time 0; after an arbitrary sequence of decisions have been made for periods

T; :::; 1: He faces a problem i0 and devises a conditional

strategy x0 ; x1 (i1 ); x2 (i1 ; i2 ); :::; xt (i1 ; :::; it ); ::: When a particular history of problems i1 ; i2 ; ::: is realized he must pay the total discounted P penalty t 0 t pt ; where pt is the penalty paid in t for violations of unary axioms in t and violations of binary axioms in t with respect to past decisions (with the discount

factor ). Knowing the probability of occurrence of all possible histories he can then P compute the expected value of t 0 t pt for a given conditional strategy, and select the

conditional strategy that minimizes this quantity. When history unfolds, he only has to 28

follow the conditional strategy. Note that the conditional strategy, and the computation P of the expected value of t 0 t pt ; can incorporate the fact that the observed sequence of problems up to t may alter the probability of occurrence of future problems for t+1; t+2; :::

What has been done in the previous sections corresponds to the special case in which = 0: The Judge then only has to choose xt so as to minimize pt ; and it su¢ ces that he does so sequentially for the actual sequence of problems, ignoring the counterfactual problems. Consider for a moment that history does start at period 0, i.e., there is no arbitrary sequence of precedents. If the domain satis…es the chain condition (i.e., a special chain begins at every member), and the random process is regular, then the only way to avoid penalties in the future is to follow the solution characterized by the axioms. Whenever > 0; the Judge then always follows the solution. We now show that, in contrast, when an arbitrary sequence of precedents encumbers the Judge’s decisions, a positive

may not su¢ ce to converge to the solution. Consider

the example of Th. 1. Suppose that the past history consists of T times x (it does not matter whether i or j was the set). Let j occur at period 0. Suppose the Judge knew that the history that will occur beginning at t = 0 is an in…nite sequence of j’s. We will calculate in a moment the condition under which it would minimize his total discounted penalty to continue playing x forever, and hence never converge to the Nash solution. Now if this condition holds, it must be the case that, not knowing what the sequence will be except that j has occurred at t = 0, his best strategy is to play x forever. For when playing x forever, the largest total penalty he can ever incur is when the sequence beginning at t = 0 is an in…nite sequence of j’s. (He would never pay a penalty when i occurs in a history under this strategy, but he pays a penalty whenever j occurs.) And if under this special history it is rational for him to stick to playing x, then it must be the optimal conditional strategy as well. Let us prove that, if the Judge thinks that only j will occur from t = 0 on, at period 29

0 he adopts the strategy to retain x forever. If he retains x he pays an expected penalty of 1= (1

) : If he switches to N (j) he pays

2

T X

t

+ 2

t=1

T +1 X

t

t=2

T

1 + ::: = 2 1

1 1

:

The latter is greater than the former if 11 2 1

ln 1 T >

1

:

ln

This formula requires 11 2 1

1

< 1;

which is true if < The condition on

1

1 1

(1

1 2

)

1 2

and

1 > : 3

is the same as in the counterexample of Th. 1, which is interesting

because it shows that the presence of foresight does not radically alter the constraints on : To illustrate, one obtains a lack of convergence with, e.g.,

= 0:8;

= 0:95; and

T = 5:

7

Conclusion

An interesting fact is that, in all the results of this paper, we get convergence to the solution precisely when discounting the future is large. This is somewhat counterintuitive: one might have thought that convergence to the solution would occur only for intermediate values of the discount rate, because even if the past decisions must be easily forgotten when they are bad, they must also retain some force when they are good. As it turns

30

out, for the latter concern it is enough if the past is not completely ignored ( > 0). This can be understood by the fact that when convergence is taking place, the good decisions are typically more recent than the bad decisions. Forgetting the latter is then at least as important as remembering the former, and is obtained with a low . However, the analysis of general systems of penalties in Section 5 has shown that it is indeed bad for convergence if some binary axiom induces too low a penalty relatively to the other axioms. This indeed creates the risk that the recent good decisions are binding only through this “feeble”axiom and their in‡uence on the current decision may be overwhelmed by the previous bad decisions which may bind through other axioms. It is in this mechanism that the intuition that the past must retain some power is vindicated. The approach proposed in this paper may suggest a ranking of characterization theorems. Suppose we have a set T of axiomatic theorems of the type we have discussed here, and for each theorem

2 T we prove that in the benchmark case, almost-sure convergence

to the appropriate solution occurs if and only if

2 (0;

]. This provides a way of ranking

the axiomatic theorems in terms of plausibility: the greater is

, the more plausible is

the theorem, in the sense that the dynamic version of the theorem (as developed here) holds for a larger set of discount factors. Thus, we would say that the egalitarian theorem is more plausible than Nash’s theorem or Kalai and Smorodinsky’s. To be precise, we are saying that if we observe societies which abide by an egalitarian constitution and societies which abide by a Nash constitution, and discount factors vary across societies randomly, then it is more likely that we will observe allocations looking like the egalitarian solution in the egalitarian societies than allocations looking like the Nash solution in Nash societies, because (0; 1=3]

(0; 1=2].

An issue that we did not explore in this paper is the speed of convergence. Almostsure convergence is obtained in our results with the help of a particular sequence of problems, all special chains for all members of the domain in a row, which is a rather unlikely event. For a domain with n problems, each having a special chain of average 31

length m; this requires a particular arrangement of nm problems, with n! acceptable permutations of this arrangement. The expected number of periods needed for one of these arrangements to occur is large. For n = 10; m = 2; and assuming a random process with iid draws and equiprobable problems, the expected number of periods is around 7:6

1026 :6 Convergence can nevertheless occur in other cases, for instance if all special

chains occur in a sequence, but without a repetition of problems (i.e., if a special chain has appeared, its elements do not appear again in the arrangement; if two or more sets share the end of their special chains but not the beginning, one chain is followed by the remaining part of the other chains). The length of the special sequence of problems is then reduced from nm to n: In the above example with n = 10; m = 2; assuming that there are four special chains the expected time of convergence is reduced to 1:7

1017 ; still a large

number but signi…cantly less so. The adaptation of our proofs of almost-sure convergence to this shorter sequence is straightforward. This, however, provides only a very rough upper bound of the expected time of convergence. We leave this issue for future research. A related issue, also left for future research, is the computation of the probability of convergence, which may be high without being equal to one when

is greater than the

threshold identi…ed here.

References [1] Kalai E. 1977, “Proportional solutions to bargaining situations: interpersonal utility comparisons”, Econometrica 45: 1623-1630. [2] Kalai E., M. Smorodinsky 1975, “Other solutions to Nash’s bargaining problem”, Econometrica 43: 513-518. [3] Nash J.F. 1950, “The bargaining problem”, Econometrica 18: 155-162. 6

The probability of one of the acceptable arrangements to start at any given time is p = n!=nnm ; and the expected time of occurrence is (1 p) =p2 :

32

[4] Peters H.J.M. 1992, Axiomatic Bargaining Game Theory, Dordrecht: Kluwer. [5] Roemer J.E. 1996, Theories of Distributive Justice, Cambridge: Harvard University Press. [6] Thomson W., T. Lensberg 1989, Axiomatic theory of bargaining with a variable number of agents, Cambridge: Cambridge University Press.

Appendix (made available on the authors’websites) Proof of Prop. 2. Let D = fi; j; 3j; k; 2k; l; 3l; m; 2m; ng ; for i = co f(0; 0) ; (12; 0) ; (0; 12)g j = co f(0; 0) ; (3; 0) ; (2; 2) ; (0; 2)g k = co f(0; 0) ; (3; 0) ; (2; 2) ; (0; 2:5)g l = co f(0; 0) ; (2; 0) ; (2; 2) ; (0; 3:5)g m = co f(0; 0) ; (2:5; 0) ; (2; 2) ; (0; 3:5)g n = co f(0; 0) ; (6; 0) ; (4; 4) ; (0; 7)g : The ten domains are illustrated in Figure 2. 1. The Nash theorem holds on D: By WP and Sym, '(i) = N (i) = (6; 6): By Ind, '(3j) = N (3j) = (6; 6) and '(3l) = N (3l) = (6; 6): By ScInv, '(j) = N (j) = (2; 2) and '(l) = N (l) = (2; 2): By Ind, '(k) must be on the segment (2; 2) (0; 2:5) and '(m) must be on the segment (2:5; 0) (2; 2) : By ScInv, '(2k) must be on the segment (4; 4) (0; 5) and '(2m) must be on the segment (5; 0) (4; 4) : Therefore, by Ind, '(n) must be on the segment (4; 4) (0; 7) and on the segment (6; 0) (4; 4) ; which implies that '(n) = N (n) = (4; 4):

33

6

j

6

s

s s

n s

3j

s

-

6

2k

s

-

ss

-

s

k

6 s

s

s

i

s s

s

l

s

3l

n s

i -

s

s s

m 2m

Figure 2: The domain used in the proof of Prop. 2 Then, by Ind, '(2k) = N (2k) = (4; 4) and '(2m) = N (2m) = (4; 4). By ScInv, '(k) = N (k) = (2; 2) and '(m) = N (m) = (2; 2), which completes the proof. 2. Consider the following strategy for the Judge (it is illustrated in Fig. 2): - for i; pick N (i) or (9; 3), whichever has lower penalty (this clause applies to all choices below); - for j; pick N (j) or (2:5; 1); - for 3j; pick N (3j) or (7:5; 3); - for k; pick (1; 2:25) or (2:5; 1); - for 2k; pick (2; 4:5) or (5; 2); - for l; pick N (l) or (1; 2:75); - for 3l; pick N (3l) or (3; 8:25); 34

- for m; pick (2:25; 1) or (1; 2:75); - for 2m; pick (4:5; 2) or (2; 5:5); - for n; pick (5; 2) or (2; 5:5). Note that for the sets k; 2k; m; 2m and n; the Nash point is not one of the points possibly selected by this strategy. We now prove that whatever the previous sequence of problems (it )t=1;:::;T

1

(including the case in which there is no previous sequence and

T = 1); if this strategy has been followed on t = 1; :::; T

1 the Judge can always minimize

penalty by sticking to the strategy in T . We consider a general system of penalties: the penalty attached to WP, Sym, ScInv, Ind is, respectively, a; b; c; d (with discounting by for the last two). In order to check that the strategy is optimal, we have to check that its recommendations minimize the penalty in every set of the domain. In every set, we will ignore the points which do not belong to the upper frontier, as every such point always entails a greater penalty (due to WP) than some point on the frontier. - For i : the choice in @i is constrained only by Sym, and by choices made in i; 3j and 3l: Let

T1 = ft < T j (it ; xt ) = (3j; (7:5; 3)) or (3l; (3; 8:25))g ; T10 = ft < T j (it ; xt ) = (i; N (i))g ; T100 = ft < T j (it ; xt ) = (i; (9; 3))g : P P The penalty for choosing N (i) is t2T1 T t d + t2T 00 T t (c + d) : The penalty for 1 P T t choosing (9; 3) is b + t2T 0 (c + d) : The penalty for choosing any other point on @i 1 P is b + t2T 0 [T 00 T t (c + d) : The last value is never lower than the penalty for (9; 3); and 1

1

therefore the Judge minimizes the penalty by choosing either N (i) or (9; 3); as dictated by the strategy. Observe that the reasoning can be simpli…ed. The penalties due to previous choices 35

made in i push toward making the same choices as previously and therefore reinforce the contemplated strategy. In the sequel we will ignore the penalties due to previous choices in the same set. If we can prove that the strategy minimizes the sum of the remaining penalties, a fortiori it minimizes the complete sum of penalties. - For j : the choice in @j is constrained only by choices made in 3j and in k (ignoring j itself). Let T2 = ft < T j (it ; xt ) = (3j; (7:5; 3))g ; T3 = ft < T j (it ; xt ) = (k; (2:5; 1))g ; T4 = ft < T j (it ; xt ) = (3j; N (3j))g :

P P The penalty for choosing N (j) is t2T2 T t c+ t2T3 T t d: The penalty for choosing P (2:5; 1) is t2T4 T t c: The penalty for choosing any other point on @j is the sum of these penalties. Whichever of the …rst two values is lower, it can be obtained by the strategy.

- For k : the choice in @k is constrained only by choices made in j and in 2k (ignoring k itself). Let T5 = ft < T j (it ; xt ) = (2k; (5; 2))g ; T6 = ft < T j (it ; xt ) = (2k; (2; 4:5))g ; T7 = ft < T j (it ; xt ) = (j; N (j))g :

P The penalty for choosing (1; 2:25) is t2T5 T t c: The penalty for choosing (2:5; 1) P P T t is c + t2T7 T t d: The penalty for choosing any other point is at least t2T6 P T t c: This is necessarily as least as great as the lowest of the …rst two valt2T5 [T6 ues because for non-negative numbers x; y; z one always has x + y

min fx; y + zg :

- For 3j : the choice in @j is constrained only by choices made in i and in j (ignoring 3j itself). Let T8 = ft < T j (it ; xt ) = (j; (2:5; 1))g ; T9 = ft < T j (it ; xt ) = (j; N (j))g ; T10 = ft < T j (it ; xt ) = (i; N (i))g :

P The penalty for choosing N (3j) is t2T8 T t c: The penalty for choosing (7:5; 3) is P P T t c + t2T10 T t d: The penalty for choosing any other point on @3j is the sum t2T9 P P of the previous two values, t2T8 [T9 T t c + t2T10 T t d: - For 2k : the choice in @2k is constrained only by choices made in k and in n (ignoring 36

2k itself). Let T12 = ft < T j (it ; xt ) = (k; (1; 2:25))g ; T13 = ft < T j (it ; xt ) = (k; (2:5; 1))g ; T14 = ft < T j (it ; xt ) = (n; (5; 2))g :

P The penalty for choosing (5; 2) is t2T12 T t c: The penalty for choosing (2; 4:5) is P P P T t T t c at least, c + t2T14 T t d: Any other point on @2k incurs t2T12 [T13 t2T13 which is not less than the lowest of the …rst two values.

- The cases of l; m; 3l; 2m are similar to, respectively, j; k; 3j; 2k: - For n : the choice in @n is constrained only by choices made in 2k and in 2m (ignoring n itself). Let T15 = ft < T j (it ; xt ) = (2k; (2; 4:5))g ; T16 = ft < T j (it ; xt ) = (2m; (4:5; 2))g : P T t The penalty for choosing (5; 2) is d: The penalty for choosing (2; 5:5) t2T15 P is t2T16 T t d: Any other point on @n incurs at least one of the two penalties. This concludes the proof.

Proof of Prop. 4. If: Let ' be any solution on D satisfying the axioms WP, Sym, ScInv and IMon. Let i 2 D: By Condition (CKS ) there is a special chain j1 ; : : : jn beginning at i: By Sym, '(jn ) = KS(jn ) and one can roll back along the special chain to i; and at each step '(jk ) = KS(jk ) either by IMon (case (i)) or by ScInv (case (ii)). For k = 1, we have '(i) = KS(i): It follows that ' = KS on D: Only if: 1. For all solutions ' and all problems k; i 2 D such that either k k

i or

i; let C' (k; i) denote the constraint imposed on '(k) by '(i); IMon and WP, i.e.,

C' (k; i) is the subset of @k such that IMon is not violated if '(k) 2 C' (k; i); given '(i). Speci…cally, there are eight cases: (i) I(k) = I(i) and '(i) 2 @k: Then C' (k; i) = f'(i)g : (ii) I(k) = I(i) and '(i) 2 k n @k: Then C' (k; i) = fx 2 @k j x (iii) I(k) = I(i) and '(i) 2 = k: Then C' (k; i) = fx 2 @k j x

'(i)g :

'(i)g :

(iv) I1 (k) = I1 (i) and I2 (k) < I2 (i): Then C' (k; i) = fx 2 @k j x2

'2 (i)g :

(v) I1 (k) = I1 (i) and I2 (k) > I2 (i): Then C' (k; i) = fx 2 @k j x2

'2 (i)g :

37

(vi) I1 (k) < I1 (i) and I2 (k) = I2 (i): Then C' (k; i) = fx 2 @k j x1

'1 (i)g :

(vii) I1 (k) > I1 (i) and I2 (k) = I2 (i): Then C' (k; i) = fx 2 @k j x1

'1 (i)g :

(viii) I1 (k) 6= I1 (i) and I2 (k) 6= I2 (i): Then C' (k; i) = @k: Note that if ' satis…es WP and '(k) 2 C' (k; i); necessarily '(i) 2 C' (i; k): This can be checked for each case: (i) This also corresponds to case (i) for i : '(k) = '(i) and C' (i; k) = C' (k; i): (ii) This corresponds to case (iii) for i : As '(i) 2 k n @k, '(k) > '(i); '(k) 2 = i and C' (i; k) = fx 2 @i j x

'(k)g :

(iii) This corresponds to case (ii) for i : As '(i) 2 = k; '(k) < '(i); '(k) 2 i n @i and C' (i; k) = fx 2 @i j x

'(k)g :

(iv) This corresponds to case (v) for i fx 2 @i j x2

'2 (i) and C' (i; k)

=

: '2 (k)

'2 (i) and C' (i; k)

=

'2 (k)g :

(v) This corresponds to case (iv) for i fx 2 @i j x2

'2 (k)

:

'2 (k)g :

(vi),(vii) are treated similarly. (viii): This also corresponds to case (viii) for i and C' (i; k) = @i: 2. Let D+ denote the subset of D containing the problems i at which a special chain begins. We must show that D+ = D if the Kalai-Smorodinsky theorem holds on D: Suppose that D n D+ is not empty. For every k 2 D n D+ : 1) k is not symmetric because symmetric problems are in D+ ; 2) k is not a rescaling of a problem i 2 D+ because otherwise k; i; ::: starts a special chain beginning at k, implying that k 2 D+ ; 3) k can be related by IMon to a problem i 2 D+ . But necessarily, either I(k) 6= I(i) or KS(i) 2 = @k; otherwise, a special chain beginning at k can be formed and thus k 2 D+ : The set D n D+ can be partitioned into the equivalence classes of the equivalence relation “is a rescaling of”. Say that two equivalence classes E; E 0 are “linked” if there is i 2 E; i0 2 E 0 such that I(i) = I(i0 ); KS(i) = KS(i0 ); and either i 38

i0 or i

i0 :

The relation “is linked to” is not transitive in general because it may happen that, for a particular triple E; E 0 ; E 00 ; E and E 0 are linked via two problems i; i0 ; E 0 and E 00 are linked via j 0 ; j 00 ; and E and E 00 are not linked. We will also be interested in its transitive closure, called “directly or indirectly linked to”. 3. Pick one particular equivalence class E (for the rescaling relation) and all the equivalence classes that are directly or indirectly linked to it. Call the union of these classes D : This is a subset of D n D+ (not necessarily a proper subset). Pick a member of E , i : E and i will play a special role in the rest of the proof. If D 6= E ; let E be any other equivalence class in D : Consider …rst the case in which the equivalence classes E and E are linked by two problems i 2 E; j 2 E : Therefore there exists (not necessarily in E) a rescaling of i; denoted iE ; such that I(i ) = I(iE ); KS(i ) = KS(iE ); and either i

iE or i

iE : Consider now the case in which E and

E are only indirectly linked (which means that they are not linked but are directly or indirectly linked). One can then pick an arbitrary i in E and construct a rescaling of i; denoted iE ; such that I(i ) = I(iE ): (In this case there is no guarantee that KS(i ) = KS(iE ); i

iE or i

iE :)

For every equivalence class E in D ; one can construct such a iE following the approach described for each of the two cases in the previous paragraph. The problems iE may or may not belong to D : Let D

be the (possibly empty) subset of these problems

iE that do not belong to D : Note that D

\ D+ = ?; because if one had iE 2 D+ ;

any k 2 E; being a rescaling of iE ; would then be the beginning of a special chain, contradicting the fact that E \ D+ = ?: 4. Consider any two problems k 2 D ; i 2 D n D : Suppose that k is related to i by IMon, which implies that k

i or k

i. Necessarily, either I(k) 6= I(i) or KS(i) 2 = @k;

as we now show. First, suppose that i 2 D+ : Then I(k) = I(i) and KS(i) 2 @k would imply that k 2 D+ ; a contradiction. Second, suppose that i 2 D n D+ : In this case, I(k) = I(i) and KS(i) 2 @k would imply that the equivalence classes of k and i are 39

linked, contradicting the fact that k 2 D and i 2 D n D : We now derive consequences from the fact that either I(k) 6= I(i) or KS(i) 2 = @k whenever k 2 D

and i 2 D n D

are related by IMon.

Consider …rst the

case I(k) 6= I(i): Four subcases are possible: CKS (k; i) = fx 2 @k j x2 CKS (k; i) = fx 2 @k j x2 fx 2 @k j x1

KS2 (i)g ; CKS (k; i) = fx 2 @k j x1

KS2 (i)g ;

KS1 (i)g ; CKS (k; i) =

KS1 (i)g : Focus on the …rst subcase, the other subcases being similar. Be-

cause this …rst subcase corresponds to k

i; I1 (k) = I1 (i) and I2 (k) < I2 (i); one then

has KS2 (k)=KS1 (k) < KS2 (i)=KS1 (i): As k (which belongs to CKS (k; i)) satis…es x^1

i; the point x^ 2 @k such that x^2 = KS2 (i) KS1 (i) and thus x^2 =^ x1

KS2 (i)=KS1 (i):

Therefore KS(k); which is obviously an element of CKS (k; i); satis…es KS2 (k)=KS1 (k) < x^2 =^ x1 and is not an extreme point of CKS (k; i). Consider the second case, KS(i) 2 = @k: As the subcase in which I(k) 6= I(i) has already been examined, we can focus on the subcase in which I(k) = I(i): One then has either KS(k)

KS(i) or KS(k)

KS(i)

and again KS(k) is not an extreme point of CKS (k; i). T 5. For every k 2 D ; let C(k) = i2DnD CKS (k; i): This set contains KS(k) and KS(k) is not an extreme point of it, because these two properties are satis…ed by each of

the CKS (k; i); of which there is a …nite number. Take any k 2 D : Let E be its equivalence class. For every k 2 E; there is

2 R2++

such that k is an -rescaling of k: Let Cjk (k) denote the -rescaling of C(k): This is a subset of @k : Note that KS(k ) is an -rescaling of KS(k): As KS(k) is a non-extreme element of C(k); then KS(k ) is a non-extreme element of Cjk (k): Therefore the subset T k2E Cjk (k) contains KS(k ) and KS(k ) is not an extreme point of this subset. Let C (k ) denote this subset. Note that when k is a rescaling of k 0 ; C (k) is a rescaling of C (k 0 ):

iE

Take any iE 2 D ; where E is an equivalence class in D : Recall that iE 2 = D but T is a rescaling of each member of E: The subset k2E CjiE (k) contains KS(iE ) and

KS(iE ) is not an extreme point of this subset. Let C (iE ) denote this subset. Note that 40

for all k 2 E; C (iE ) is a rescaling of C (k): 6. Now let us look again at the particular i and all the iE (that may belong to D or D ) that were introduced in step 3. One has I(i ) = I(iE ) and therefore KS2 (i )=KS1 (i ) = KS2 (iE )=KS1 (iE ) for all E in D ; while KS(i ) is not an extreme point of C (i ) just as KS(iE ) is not an extreme point of C (iE ): Therefore there is 2 R2++ ;

6= KS2 (i )=KS1 (i ) such that the point x 2 @i such that x2 =x1 =

to C (i ) and for all E in D ; the point x 2 @iE such that x2 =x1 =

belongs

belongs to C (iE ):

Now we are ready to de…ne a solution ' as follows. On D n D ; it coincides with KS: 2 R2++ and E in D such that k is an -rescaling of iE (or i ); then

For k 2 D ; there is

'(k) is the point x 2 @k such that x2 =x1 = (

2= 1)

: Note that, as C (k) is a -rescaling

of C (iE ) (or of C (i )), this implies that '(k) 2 C (k): 7. It is obvious that ' satis…es WP, Sym and ScInv. It also obviously satis…es IMon on D n D . Consider two problems k 2 D ; i 2 DnD that are related by IMon. By construction, '(k) 2 C (k)

C(k)

CKS (k; i) = C' (k; i); where the last equality is due to '(i) =

KS(i): And conversely this implies '(i) 2 C' (i; k): Consider two problems i; k 2 D that are related by IMon. First case: If they belong to the same equivalence class, IMon is satis…ed because it is implied by ScInv in this case. Second case: Suppose i is a rescaling of iE ; k a rescaling of iE 0 (one problem among iE ; iE 0 may be i ). One has '2 (iE )='1 (iE ) = '2 (iE 0 )='1 (iE 0 ) = : Without loss of generality, suppose that i

k and I1 (i) = I1 (k): If I2 (i) = I2 (k); then i and k are

-rescalings of iE and iE 0 ; respectively, for the same Then '2 (i)='1 (i) = '2 (k)='1 (k) = ( then '2 (i)='1 (i) = ( 2

<

0 2:

2= 1)

2= 1)

(recall that I(iE ) = I(iE 0 ) = I(i )).

and IMon is satis…ed. If I2 (i) < I2 (k);

and '2 (k)='1 (k) = (

0 2= 1)

for some

1;

2;

0 2

such that

IMon would be violated if one had '2 (k) < '2 (i): This inequality is equivalent

41

to

'2 (k) = '1 (k)(

0 2= 1)

'1 (k)(

One would then obtain '(i)

< '1 (i)(

0 2= 2)

2= 1)

= '2 (i);

< '1 (i):

'(k); contradicting the fact that i

k and that ' satis…es

WP. Therefore IMon is satis…ed. The solution ' coincides with KS only on D n D ; which shows that if D n D+ is not empty, the Kalai-Smorodinsky theorem does not hold on D. This achieves the proof of the "only if" part of the proposition.

42

Judicial precedent as a dynamic rationale for axiomatic ...

Aug 23, 2010 - k % і, let C#(k;і) denote the constraint imposed on φ(k) by φ(і), IMon and WP, ... C#(k;і) is the subset of дk such that IMon is not violated if φ(k) .

285KB Sizes 1 Downloads 139 Views

Recommend Documents

One Click Youre Guilty- A Troubling Precedent for Internet Ch.pdf ...
One Click Youre Guilty- A Troubling Precedent for Internet Ch.pdf. One Click Youre Guilty- A Troubling Precedent for Internet Ch.pdf. Open. Extract. Open with.

Rating Rationale for JKCL Ltd - Brickwork Ratings
Aug 19, 2015 - Mr. Vijay Biyani and Mr. Rakesh Biyani assuring the performance by FRL & FLFL of their respective obligations under the MLA and Escrow ...

background and rationale
14 Sep 2007 - Nepal mean not that federalism is a burden for us; rather a thorough .... terms of caste/ethnicity, linguistic group, religion and geographical disparity. ...... Caste/Ethnicity and Religion in. Nepal", Population Monograph of Nepal, Vo

Encouraging Essentials for a Dynamic Ministry
Keep your eyes open, hold tight to your convictions, give it all you've got, .... its own way. ... For these and related resources, visit www.insightworld.org/store.

background and rationale
Sep 14, 2007 - The country has three big rivers that flow north to south, originating from the Himalayas, with numerous small rivers, streams and fountains connected to them. These rivers and streams have the capacity to produce 840,000 mega watts of

Google+ Social Media Marketing Pre-Campaign Report Rationale for ...
Google+ Social Media Marketing Pre-Campaign Report. Rationale for ... The biggest part of the content (65 %) will consist of relevant information about spinal ... + 10 new followers. Average of. 2. +1's per post. Average of. 2 comments per post.

European judicial systems
2015, in order to receive new data at the end of 2015, using the electronic ...... In some States or entities, the court budget comprises a very large share ...... prosecutors participate in the planning of surveillance necessary to combat and detect

Judicial Discretion.pdf
filling up of 74 vacant posts of Assistant Prosecuting. Officers without delay. 5. The High Court allowed the Writ Petition. The High. Court relied on Uttar Pradesh Prosecuting Officers. Service Rules, 1991 (hereinafter referred to as 'the. 3. WWW.LI

European judicial systems
by using, amongst other things, common statistical criteria and means of ... allowing each national correspondent to access a secure website to transmit their .... a dynamic data base opened to the public on the Internet, including a data ...

An Empirical Rationale for Foreign Languages in ...
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission,

Lecture: Judicial Activism vs. Judicial Restraint
1979 Carter got to add 152 judges. •. Override Supreme Court - amendment. •. Stare decisis, precedents. •. Constitution. •. Public Opinion. •. Restrictions on Judicial Power. 4. Notes by Vicky Zhang. Lecture: Judicial Activism vs. Judicial

Axiomatic Set Theory - Patrick Suppes.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Axiomatic Set ...

Judicial Branch - USCIS
The judicial branch can probably be covered in one lesson. The handouts focus on ... about all the levels for the test, but you should explain that there are many ...

Axiomatic Set Theory - Patrick Suppes.pdf
Axiomatic Set Theory - Patrick Suppes.pdf. Axiomatic Set Theory - Patrick Suppes.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Axiomatic Set ...

Axiomatic Foundations of Multiplier Preferences
observable choice data and provides a useful tool for applications. .... the value of θ: a big difference x − y reveals that the decision maker has low trust ... s ∈ S, will be realized, but the decision maker has to choose the course of action