Focus, Then Compare Doron Ravid∗ August, 2015

Abstract I study the following random choice procedure. First, the agent focuses on an option at random from the choice set. Then, she compares the focal option to each other alternative in the set. Comparisons are binary, random and independent of each other. The agent chooses the focal option if it passes all comparisons favorably. Otherwise, the agent draws a new focal option with replacement. I characterize the procedure’s revealed preference implications, and show that it can accommodate the Attraction effect, Compromise effect and Choice overload. I then show that the procedure can approximate some deterministic models from the literature.

1

Introduction

The fundamental assumption of the classic theory of utility maximization is that agents have well defined tastes. These tastes are inherent to the agent and do not dependent on the particular set of options confronting her. Thus, the classical theory provides a clear separation between objectives (utility) ∗

Department of Economics, University of Chicago, [email protected]. This paper was written as part of a Ph.D thesis in economics at Princeton University. I would like to thank my adviser, Faruk Gul, for his incredible guidance throughout the writing of this paper. I am also grateful to my committee members, Roland Benabou and Wolfgang Pesendorfer for their insightful comments and suggestions. I have also benefited from the excellent comments of Sylvan Chassang, Anthony Marley and Stephen Morris.

1

and constraints (choice set). Despite its appeal, utility maximization is at odds with a large body of experimental papers published in recent decades1 . The current paper suggests that much of the experimental evidence can be accounted for by boundedly rational agents focusing only on two alternatives at a time while making their choices. The current paper uses random choices as a tool for describing boundedly rational behavior. Many studies of boundedly rational agents assume that the agent’s choices are random (See for example Luce (1959), Matějka and McKay (2013), Manzini and Mariotti (2014), Gul et al. (2014), Woodford (2014) and Ke (2014)). The origins of random choice models come from empirical studies. Such studies use random choices to deal with choice variability (McFadden, 2001). To explain this variability, the analyst usually assumes that choice is influenced by unobserved variables. When fully rational, the decision maker is aware of these variables and takes them into account when making her decisions. However, the decisions of a boundedly rational agent may not be as well-informed. In particular, a boundedly rational agent may be influenced by variables that she herself is unaware of. Thus, the agent herself may see her decisions as random, making random models appropriate for studying boundedly rational behavior. The proposed bondedly rational procedure based on agents focusing only on two alternatives at a time. There are reasons why an agent may avoid considering all alternatives at once. One reason is that alternatives can be difficult to compare. Comparing alternatives requires trading off conflicting motivations, a situation people would like to avoid (Tversky and Shafir, 1992; Janis and Mann, 1977; Festinger, 1964; Knox and Inkster, 1968). In order to avoid such conflict, people may attempt to compare only a small set of options at any given moment. A second reason why agents would avoid considering all alternatives simultaneously is bounded cognitive capacity (e.g. Miller, 1956). Because of this bounded capacity, people might not be able to process all the relevant information. To reduce this informational burden, people often consider only a subset of the available alternatives at any given moment in 1

See Camerer et al. (2011) for a recent survey of violations of classic utility maximization

2

time (For a survey of the limited consideration literature, see Roberts and Lattin (1997)). This paper analyzes the following simple random choice procedure which I call Focus, Then Compare (FTC). Before comparing alternatives, an agent applying the procedure begins by selecting a focal option at random. The focal option is then compared sequentially to each of the other alternatives in the set through a sequence of pair-wise comparisons. The outcome of each comparison is random and independent of the agent’s other comparisons. The agent chooses the focal option if and only if it wins against each of the other available alternatives. Otherwise, the agent restarts the process with replacement. The reader may wonder why the agent does not discard alternatives that lose during the comparison stage. After all, one may think that discarding losing alternatives would lead to better choices. There are, however, a couple of reasons for making this assumption. First, unlike utility maximization, Focus Then Compare does not require the agent to have a well defined preference. Without preferences, it is hard to designate choices as being good or bad. Second, even if the agent had preferences, she may be reluctant to discard losing alternatives. This reluctance may arise due to the randomness in the agent’s comparison process. Such randomness naturally leads to superior alternatives losing to inferior ones by mistake. A heuristic approach to dealing with such mistakes is to give the losing alternatives a second chance by keeping rather than discarding them. I analyze the procedure’s behavioral implications. My approach is similar to the one employed by several recent papers on boundedly rational choice procedures (e.g. Manzini and Mariotti (2007); Salant and Rubinstein (2008); Cherepanov et al. (2010); Manzini and Mariotti (2014)). Thus, I look at several questions. First, I ask how one could test whether an agent is using the FTC procedure using only choice data. Second, I study how well does the FTC procedure address experimental findings. Third, I explore in which way decisions made by FTC differ from choices that originate from other procedures. Hence, this paper centers around the behavior of agents who follow FTC. I show that the FTC procedure is completely characterized by a single, 3

yet simple, testable condition on choice probabilities. This condition is a relaxation of Luce (1959)’s famous Independence of Irrelevant Alternatives (IIA) axiom. Luce (1959)’s IIA states that the likelihood ratio between x and y is does not depend on the choice set. The IIA axiom, which characterizes Luce (1959)’s rule, does not hold for the FTC procedure. In other words, for the FTC procedure switching the choice set will often change the choice likelihood between two given alternatives. However, I show that if the proportional change in this ratio is the same when switching the choice set from A to B as it is when switching the set from A\B to B\A, then the agent’s behavior can be described by the FTC procedure, and vice versa (Proposition 1). Hence, to test for the FTC procedure one need not go far beyond generalizing the tests for the IIA axiom. Agents that follow the FTC procedure can behave in a way that is consistent with a wide range of experimental findings. These experimental findings include choice overload (Iyengar and Lepper, 2000), the attraction effect (Huber, Payne and Puto, 1982), the compromise effect (Simonson, 1989) and intransitive choice (Tversky, 1969). What is common to all these findings is that they suggest that agent’s tastes depend on the choice set. While this dependency contradicts the theory of utility maximization, it is consistent with FTC. In addition to accommodating experimental findings, I show that FTC can approximate the behavior of seemingly unrelated procedures. Such procedures are usually motivated by the same experimental findings that motivate the current work. Intransitivity, for example, can be explained by agents constructing a short list of alternatives before making their choices (see Manzini and Mariotti, 2007). We show that agents who choose by constructing a short list can be approximated by FTC (Proposition 7). FTC can also approximate any agent who chooses by weighting each option’s advantages and disadvantages relative to the other available alternatives. Such a procedure was suggested by Tversky and Simonson (1993). Turns out that the Tversky and Simonson procedure includes all the deterministic models that can be approximated by FTC. In other words, FTC can approximate a deterministic model if and only 4

if the model is a version of Tversky and Simonson’s procedure (Proposition 6). I also study the connection between the FTC procedure and other random models of choice. As mentioned earlier, FTC is characterized by a generalization of Luce (1959)’s IIA axiom, which characterizes the Luce rule. Therefore, the Luce rule is a special case of FTC. Another special case of FTC is the extension of the random consideration sets procedure studied in Manzini and Mariotti (2014) (Section 5.1). While both of these models are special cases of the random utility maximization procedure, I show that random utility maximization differs from FTC: neither procedure nests the other. FTC is, however, a special case of a collection of models known as the binary advantage models (Marley, 1991). Binary advantage models are described, however, in terms of the formula generating their choice probabilities. It is unclear how these choice probabilities may arise from a well defined procedure. I expand on the above-mentioned connections in section 5.

2

Focus, Then Compare

Let X be a finite set of all possible alternatives. A choice set is a non-empty subset of X. I take Ω to denote the set of all possible choice sets. A random choice rule is a function ρ : X × Ω → R satisfying the following conditions for every A ∈ Ω: 1. ρ (x, A) ≥ 0 for all x, with the inequality being strict only if x ∈ A. 2.

P

x∈A

ρ (x, A) = 1.

I assume random choice rules have full support. That is, x is in A only if ρ (x, A) > 0. Given a full support random choice rule ρ and any A ⊂ X, I take ρy (x, A) :=

ρ (x, A ∪ {x, y}) ρ (y, A ∪ {x, y})

to denote the choice likelihood ratio between x and y when the environment is A. ρy (x, A) measures the number of times the agent chooses x for each time she chooses y. A high ρy (x, A) means the agent chooses x more frequently 5

than y. Such behavior will suggest that the agent finds x have a higher value than y. As such, I will often interpret ρy (x, A) as measuring the relative attractiveness of x compared to y in the set A. I will be centering my analysis around random choice rules consistent with the FTC procedure. An agent following the FTC procedure chooses first by randomly selecting a focal option. The focal option is then compared to each other alternative in the choice set through a sequence of stochastic, independent, binary comparisons. If all comparisons turn in favor of the focal option, the procedure terminates with the agent choosing the focal option. Otherwise, the agent restarts the process with replacement, meaning that the same option could be chosen as the focal option more than once2 . Hence, given a choice set A ∈ Ω, the agent chooses x ∈ A if x is selected as the first focal option and x compares favorably to each other alternative in A or that the first focal option is not chosen, x is selected as the second focal option, and x compares favorably to each other alternative in A, and so on. Assume the agent selects each option as focal with equal probability, and define π (x, y) > 0 as the probability of x passing the comparison against y when x is focal. Then the probability that the agent chooses x ∈ A when applying the FTC procedure can be written as: " #! X 1 Y 1 Y 1 Y π (x, y) + 1 − π (z, y) π (x, y) ρ (x, A) = |A| y∈A |A| y∈A |A| y∈A z∈A " #!2 X 1 Y 1 Y + 1− π (z, y) π (x, y) + . . . |A| y∈A |A| y∈A z∈A which, by applying the formula for geometric sums gives: Q π (x, y) hy∈A i ρ (x, A) = P Q π (z, y) z∈A y∈A 2

(1)

Note that this process will end in finite time with probability 1 as long as there is at least one option that passes all of the comparisons with positive probability.

6

I therefore characterize the FTC procedure via equation 1 above. This is formalized in Definition 1 below. Definition 1. A function π : X × X → (0, 1] satisfying π (x, x) = 1 for all x is a FTC representation of ρ if ρ and π satisfy equation 1 for all (x, A). A random choice rule ρ is a FTC rule if it has a FTC representation. It is useful to note that the FTC representation need not be symmetric. More precisely, the sum of π (x, y) and π (y, x) can be different than 1. This potential asymmetry is consistent with experimental evidence showing that evaluations often vary with the direction of comparison. For example, in the study conducted by Mantel and Kardes (1999) participants choices often depended on whether they were asked if they prefer x over y or if they prefer y over x. While discussing such experiments is beyond the scope of this paper, the experiments do suggest that allowing for asymmetry in the primitive may be a desirable property. The following are two examples of the FTC procedure. Example 1. Let X be a subset of R2+ . Thus, alternatives can be described by their values on two attributes. For any two distinct alternatives x and y, set π (x, y) = η and π (y, x) = 1 − η if y1 ≥ x1 and y2 ≥ x2 , with at least one of the inequalities being strict, and π (x, y) = 21 otherwise. Hence, if x is dominated by y the agent has a probability of η of making a mistake and thinking that x compares favorably to y (and that y compares unfavorably to x). However, if neither of the alternatives dominates the other, the agent chooses between them in a random fashion. Example 2. Alternatives are characterized by their values on k attributes, X ⊂ Rk+ . The agent compares x and y via the following formula: (

) X

π (x, y) = exp −

ϕ (yk − xk )

k:yk >xk

where ϕ is some strictly increasing function satisfying ϕ (0) = 0. Thus, if xk ≥ yk for all k, then x always wins against y. Otherwise, the probability 7

that x loses to y when x is focal is decreasing with the difference between yk and xk . Hence, the decision maker has a hard time choosing options that are inferior on some dimensions to one of the other alternatives. As such: ( !) X X ρ (x, A) ∝ exp − ϕ (yk − xk ) y∈A

k:yk >xk

for example, if k = 2 , ϕ (a) = a, x = (1, 2) and y = (3, 1) then: ρ (x, {x, y}) =

1 e−2 = −2 −1 e +e 1+e

if z = (1, 1), then: ρ (x, {x, y, z}) =

e−2 e = −2 −1 −3 e +e +e 1 + e + e2

I now turn to the question of what restrictions does the FTC procedure impose on behavior. My analysis follows the lines of that first conducted by Luce (1959). Luce (1959) studied a random choice model in which the probability that an alternative is chosen is proportional to its value. Formally, ρ is a Luce Rule if there exists a function v : X → R+ such that: ρ (x, A) = P

v (x) y∈A v (y)

In his work, Luce showed that ρ is a Luce rule if and only if it satisfies the Independence of Irrelevant Alternatives axiom (IIA): ρy (x, A) = ρy (x, B) for all x, y ∈ X and A, B ⊂ X. If we think of choice likelihood ratios as measures of relative attractiveness, the IIA implies that the revealed relative value of x against y does not depend on the other alternatives in the choice set.

8

In the next proposition I show that a relaxation of the IIA axiom, Independence of Shared Alternatives (ISA), completely characterizes the FTC procedure. Hence, the FTC procedure can serve as an adequate description of an agent’s behavior if and only if that agent satisfies ISA. Formally, ISA is satisfied by some random choice rule ρ if and only if: ρy (x, A \ B) ρy (x, A) = ρy (x, B) ρy (x, B \ A)

(2)

for all x, y,A and B.3 To understand ISA, remember the interpretation of the choice likelihood ratio,ρy (x, B), as a measure of x’s value relative to y’s in the (x,A) set B. Interpreting ρy (x, B) in this way means that the ratio ρρyy (x,B) measures the change in the x’s value relative to y due replacing the set B with A. ISA says that this change is independent of the alternatives shared by both A and B. The proposition below shows that requiring a random choice rule to satisfy this condition is equivalent to the existence of a FTC representation. Proposition 1. A random choice rule ρ has a FTC representation π if and only if it satisfies ISA. Proof. See appendix. It is straightforward to show the manner in which ISA is implied by IIA. If ρ satisfies IIA then: ρy (x, A) ρy (x, A \ B) =1= ρy (x, B) ρy (x, B \ A) and therefore ISA is satisfied. We therefore have the following corollary: Corollary 1. Every Luce rule has a FTC representation. To further clarify the connection between the FTC and Luce rules, assume that ρ is a Luce rule for some function v. Then we can use v to recover π by 3

Remember that we assume that ρ has full support and that we defined: ρy (x, A) :=

ρ(x,A∪{x,y}) ρ(y,A∪{x,y}) .

9

setting π (x, y) = 1/v (y) for all x 6= y and 1 whenever x = y. Applying the Luce rule formula then gives: v (x) y∈A v (y) Q −1 Q π (x, y) v (x) z∈A v (z) y∈A  Q =P −1 P Q π (z, y) v (z) v (y) z∈A y∈A z∈A y∈A

ρ (x, A) = P =

Reversing the above derivation gives the following simple result. Corollary 2. A FTC rule ρ is a Luce rule if and only if π (x, y) = π (z, y) whenever y ∈ / {x, z}. Corollary 2 states that the difference between the FTC procedure and the Luce rule is that the FTC procedure allows the result of the comparison between the focal option and each alternative to depend on the identity of the focal option. While Proposition 1 shows that satisfying the ISA axiom is tantamount to having a FTC representation, the FTC representation itself is not unique. In fact, if π is a FTC representation of some random choice rule ρ, then the 0 0 0 function π satisfying π (x, y) = απ (x, y) for all x 6= y and π (x, x) = 1 for all x is also an FTC representation of ρ for every α ∈ (0, 1). This is because: Q Q Q α|A|−1 y∈A π (x, y) π (x, y) y∈A απ (x, y) i= i=P i hQ hQ hy∈A Q P P |A|−1 απ (z, y) π (z, y) π (z, y) α z∈A y∈A z∈A y∈A z∈A y∈A The following proposition shows that multiplication by a constant is, in fact, the only available degree of freedom in the FTC representation. 0

Proposition 2. The two FTC representations π and π yield the same random 0 choice rule ρ if and only if there exists α > 0 such that π (x, y) = απ (x, y) whenever x 6= y.

10

3

Matching Experimental Findings

The past few decades have seen the emergence of a large body of experimental regularities that cannot be explained by utility maximization. In this section, I explore the connection between FTC and four well established regularities: Choice overload, attraction effect, compromise effect, and, finally, intransitive choice. I explain each regularity in turn, and show how these can be accounted for by the FTC procedure.

3.1

Choice Overload

In this section we show that the FTC procedure can match findings related to choice overload. I begin by providing a very brief explanation of the effect and the conditions that lead to it. I then provide behavioral definitions aimed at capturing these conditions and show that an agent following the FTC procedure will exhibit behaviors consistent with the choice overload whenever these conditions hold (Proposition 3). Choice overload challenges the notion that adding more alternatives to the choice set can only increase welfare. Perhaps the first to publish a study presenting a violation of this notion were Iyengar and Lepper (2000). The authors set a tasting booth that displayed either 6 (limited selection) or 24 (extensive selection) different flavors of jam at a local grocery store. Consumers who approached the booth received a discount coupon for purchasing one of the jams on display. The authors found that almost 30% of the approaching consumers in the limited selection condition used the coupon. In contrast, the coupon was used by only 3% of the approaching consumers in the extensive selection condition. Similar findings have been documented with respect to a wide variety of objects, including chocolates (Chernev, 2004), pens (Shah and Wolford, 2007) and participation in 401(k) plans (Iyengar, Jiang and Huberman, 2003). Findings of this kind are commonly referred to in the literature as choice overload. A central feature of experiments documenting choice overload is that they present participants with choice sets consisting of very similar alternatives. 11

This similarity usually makes it hard to differentiate between the alternatives. Indeed, there is evidence that this difficulty plays an important role in the over-choice effect (Chernev (2004); White and Hoffrage (2009)). For example, Chernev (2004) replicated the choice overload effect in an experiment involving different flavors of Godiva chocolates. The chocolates were used before the experiments to construct a list of attributes that these chocolates contain. Chernev (2004) found that choice overload disappears when participants articulate their preferences about these attributes before the choice task. To capture the over-choice effect, fix some φ in X and interpret it as the default option. For example, in the above-mentioned study by Iyengar and Lepper (2000), φ represents not using the coupon. Given φ, fix another alternative x. I will analyze what happens to the choice probability of φ as one 0 adds more and more duplicates of x. More precisely, x is an x duplicate if:  0 0 1. ρ (x, A) = ρ x , A whenever A contains both x and x .  0   0 2. ρ (z, B ∪ {x}) = ρ z, B ∪ x for every z ∈ B, where B ∩ x, x = ∅. 0

Thus, x and x are duplicates if they satisfy two conditions. First, the choice 0 probability of x and x in every choice set that contains both alternatives. Sec0 ond, replacing x with x never changes the probability that the agent chooses any other alternative. The key assumption generating choice overload is that the outside option gains some advantage from being focal. More precisely, φ has a focal advantage 0 0 over x if for every y indistinguishable from x: π φ, x > π x, x . Thus, conditional on being focal, φ is more likely to win against an x duplicate than x is4 . Intuitively this means that the agent is less likely to reconsider once she has focused on the the default option. Proposition 3 below says that agents applying the FTC procedure exhibit choice overload in choice sets that contain many indistinguishable alternatives. 4

One should note that it is possible for φ to both have a focal advantage over x, and 1 for x to be superior   in pairwise choices. That is, one can have: ρ (x,  {x, φ}) < 2 and 0 0 0 π φ, x > π x, x . This will occur whenever π (x, φ) > π (φ, x) > π x, x .

12

Proposition 3. Let ρ be a FTC rule. Suppose that φ has a focal advantage over x and that {xn }∞ n=1 is a sequence of distinct x duplicates. Then ρ (φ, {φ, x1 , . . . , xn }) goes to 1 as n goes to ∞. Proof. In the appendix I show that x1 and x are copies only if for all y ∈ / {x, xn } we have both π (x, y) = π (xn , y) and π (y, x) = π (y, xn ). This implies that for all n: [π (φ, x)]n  ρ (φ, {φ, x1 , . . . , xn }) = π (φ, x)n + n π (x, φ) [π (x, x1 )]n−1 1 =   n−1 π(x,φ) π(x,x1 ) 1 + n π(φ,x) π(φ,x1 ) But since φ enjoys a focal advantage over x: π (x, x1 ) /π (φ, x1 ) < 1. Therefore: n (π (x, x1 ) /π (φ, x1 ))n−1 → 0 as n goes to infinity. The conclusion follows. Proposition 3 shows that the FTC procedure is consistent with choice overload. In other words, an agent applying the FTC procedure will be choosing the null option with a probability very close to 1 whenever the choice set contains enough indistinguishable alternatives.

3.2

Attraction Effect

In this subsection I demonstrate how the FTC procedure matches findings related to the attraction effect. I proceed by first providing an explanation of the effect itself and the experimental conditions that generate it. I then show in Proposition 4 that the FTC procedure not only matches the effect, but also parametrizes it in an intuitive way. The attraction effect is usually demonstrated by comparing participants choices across two choice sets. The first choice set contains two options, neither of which is obviously superior to the other. These two options are also available in the second choice set. However, the second choice set also contains a third alternative that is clearly dominated by one of first two options, but not by the

13

Time from 0 to 60mph

y (55%)

y (21%) x (45%)

x (77.5%) z (1.5%) Miles per gallon

Figure 1: An example of a typical attraction effect experiment. Alternatives are cars that differ only in miles per gallon and time from 0 to 60 mph. Each graph represents a different experimental condition. Numbers in parenthesis next to each alternative represent percentage of people who choose that option. other. Surprisingly, the presence of the third alternative results in participants choosing the option that dominates it more frequently. To demonstrate the attraction effect, consider an experiment in which participants need to choose which car to buy. The cars are known to be identical in all respects except for their fuel consumption and acceleration speed. Acceleration speed is measured in the time it takes to accelerate from 0 to 60 mph, while fuel consumption is measured in miles per gallon. To construct the first choice set one would use two cars, one which is more fuel efficient, call it x, and one which accelerates better, denoted by y. y will also accelerate better than the third option, z. While z will still be more fuel efficient than y, it will be dominated by x on both attributes. Typically, x will be chosen at a higher frequency in the second choice set, {x, y, z}, than in the set {x, y}. This example is illustrated in figure 1. The attraction effect has proven to be incredibly robust. The first studies to note the attraction effect were conducted by Huber, Payne and Puto (1982) and Huber and Puto (1983). These studies have inspired a large number of researchers to replicate the effect using a wide variety of choice objects. Objects involved in attraction effect experiments include cameras and computers (Simonson and Tversky, 1992), political candidates (Pan, O’Curry and Pitts, 1995), medical decision making (Schwartz and Chapman, 1999) and job candidates (Slaughter, Sinar and Highhouse, 1999). While most of these studies 14

involved questionnaires, the effect persists even when subjects are given monetary incentives (Simonson and Tversky, 1992). Hence, it seems that the effect is not particularly sensitive to the experimental setting. While the attraction effect is inconsistent with utility maximization, it is consistent with the FTC procedure. To see this, consider example 1 from section 2. Take ρ to be the implied FTC rule, and consider three alternatives, − x, y and x− such that : x1 > x− 1 > y1 , y2 > x2 > x2 . Then for η small enough:   ρ x, x, y, x− =

1/2 (1 − η) 1 − η) = > = ρ (x, {x, y}) 1/2 (1 − η) + 1/4 + (1/2) η 1/2 + 1/4 2 1/2 (1

One can generalize example 1 by examining FTC representations that match the experimental setup used to generate the attraction effect. Recall the attraction effect is attained in experiments in which x clearly dominates x− , but neither the comparison between x and y nor the comparison between x− and y is obvious. I interpret these conditions as suggesting first that π (x, x− ) > π (y, x− ) and second that π (x− , x) is very small. The following proposition shows that these conditions are both necessary and sufficient for the attraction effect to occur in agents following the FTC procedure. Proposition 4. Let ρ be a FTC rule with FTC representation π. Then ρ (x, {x, y, x− }) > ρ (x, {x, y}) if and only if π (x, x− ) > π (y, x− ) and π (x− , x) is sufficiently small. Proof. The inequality ρ (x, {x, y, x− }) > ρ (x, {x, y}) holds if and only if: π (x, y) π (x, y) π (x, x− ) > π (x, y) π (x, x− ) + π (y, x) π (y, x− ) + π (x− , x) π (x− , y) π (x, y) + π (y, x) which is equivalent to: −

π x, x





> π y, x





 + π x ,x



π (x− , y) π (y, x)

Define  as follows: =

  π (y, x) π x, x− − π y, x− − π (x , y) 15



and note that π(x− , x) must be strictly less than  for the first inequality to hold. Proposition 4 characterizes the parametric restrictions required for an agent following the FTC procedure to exhibit the attraction. As explained before, these restrictions coincide with the experimental conditions leading to the effect in practice.

3.3

Compromise Effect

Another experimental regularity that is often considered at odds with utility maximization is the compromise effect. The current subsection describes the experimental setup that leads to the compromise effect and shows how the effect can be accommodated by the FTC representation. I conclude by presenting an example of a FTC rule that exhibits the compromise effect. The compromise effect, first documented by Simonson (1989), refers to people’s tendency to choose the middle option. The typical experiment studying this tendency compares the probability that the middle option is chosen in two different choice sets. The first choice set contains the middle option and two additional, more extreme alternatives. The second choice set contains only one of the two extreme alternatives in addition to the middle option. Typically the likelihood ratio between the middle option and any of the extreme alternatives is higher when both extreme alternatives are present. To illustrate, consider the setting used to demonstrate the attraction effect in subsection 3.2. Participants need to choose which car they prefer to purchase, knowing that all the cars presented to them differ only in their fuel consumption and their acceleration speed. One of the extreme options has a very good fuel consumption, while the other extreme option has excellent acceleration. The middle option provides a compromise between those two extremes and has average values in both attributes. Denoting the extreme alternatives by y and z, and taking x to be the middle option, the compromise effect will happen if ρy (x, {x, y}) < ρy (x, {x, y, z}). It is not difficult to see the conditions that may lead the FTC procedure to 16

exhibit the compromise effect. Let π be the FTC representation of the FTC rule ρ. Then one can easily see that: ρy (x, {x, y, z}) =

π (x, z) ρy (x, {x, y}) π (y, z)

meaning the compromise effect will occur if and only if π (x, z) > π (y, z). To illustrate when such a situation may arise, consider an agent that applies the FTC rule presented in example 2 to choose between alternatives in R2+ . For every three alternatives, x, y and z that satisfy y1 > x1 > z1 and y2 < x2 < z2 we will have: π (x, z) = e−ϕ(z2 −x2 ) > e−ϕ(z2 −y2 ) = π (y, z) thereby giving rise to the compromise effect.

3.4

Intransitive Choice

In the current section I show that the FTC procedure can violate stochastic transitivity. Stochastic transitivity is a generalization of the classic transitivity axiom that allows for people to choose inconsistently across situations, making it easier to test in the lab. A random choice rule ρ is said to satisfy stochastic transitivity5 if for every three distinct alternatives x, y and z: ρ (x, {x, y}) ≥ 1/2 and ρ (y, {y, z}) ≥ 1/2 implies ρ (x, {x, z}) ≥ 1/2

(3)

The following paraphrasing of a famous example due to L.J. Savage6 shows a setting in which stochastic transitivity is violated. Bob, who lives in Chicago, is considering whether he should go on a vacation to New York, n, or to Los Angeles, l. For illustrative purposes assume that Bob is indifferent between the two, and chooses either with equal probability, i.e. ρ (n, {n, l}) = ρ (l, {n, l}) = 1/2. When sharing his thoughts with Ann, his friend, she mentions that there is an online coupon that Bob could use to get a free glass of cheap wine on 5

The random choice literature often refers to the condition in equation 3 as weak stochastic transitivity. 6 see Luce and Suppes (1965), pp. 334-335.

17

the flight to New York. Since Bob enjoys wine, even if it is cheap, he would strictly prefer to go to New York with the coupon, denoted by n+ , than without, i.e. ρ (n+ , {n, n+ }) > 1/2. However, one glass of cheap wine is one not enough to tilt Bob towards preferring New York over Los Angeles, meaning that ρ (n+ , {n+ , l}) = 1/2, thereby violating stochastic transitivity. Similar violations have been documented in experiments based on this example (e.g. Tversky, 1969). FTC rules can violate stochastic transitivity. To see how, assume that Bob is using the FTC procedure to make his choices. Assume further that his choices use a FTC representation satisfying π (n, l) = π (l, n) = 1/2, π (n+ , l) = π (l, n+ ) = 1/2, π (n+ , n) = 1 and π (n, n+ ) = 1/10. It is easy to see that such a random choice rule will satisfy ρ (n, {n, l}) = ρ (n+ , {n+ , l}) = 1/2. In addition: ρ (n+ , {n+ , n}) =

10 1 1 = > 1 + 1/10 11 2

violating stochastic transitivity. One question that may arise at this stage is whether imposing stochastic transitivity implies anything on the FTC procedure. The following proposition shows that stochastic transitivity is equivalent to a very natural restriction on the FTC rule. Proposition 5. A FTC rule ρ satisfies stochastic transitivity if and only if every FTC representation of ρ satisfies: π (x, y) ≥ π (y, x) and π (y, z) ≥ π (z, y) implies π (x, z) ≥ π (z, x)

(4)

Proof. Let π be the FTC representation of some FTC rule ρ. The proposition follows from the fact that: ρ (x, {x, y}) ≥ 21 holds if and only if π (x, y) ≥ π (y, x). Hence, proposition 5 shows that a FTC rule satisfies stochastic transitivity if and only if each of its FTC representations is transitive in the sense of equation 4. Indeed, it is exactly this condition that is violated in the above example since π (n, l) ≥ π (l, n) and π (l, n+ ) ≥ π (n+ , l) but π (n, n+ ) < π (n+ , n). 18

Thus, at least as far as FTC rules are concerned, violations of stochastic transitivity are equivalent to non-transitivity of the FTC representation.

4

Approximating Other Choice Procedures

The experimental evidence contradicting utility maximization has served as motivation for studying a variety of alternative choice procedures. While many of the algorithms in the literature may seem unrelated to the procedure studied in this paper, this section shows that they are actually very similar in the type of behaviors that they generate. However, many of the known models are deterministic, which makes it difficult to relate to the random FTC procedure. To bridge this gap, I ask which deterministic models can be approximated by the random FTC rule. Formally, a choice function is a mapping γ : Ω → X satisfying the condition: γ (A) ∈ A for all A ∈ Ω. I say that FTC can approximate γ if there exists a sequence of FTC procedures {ρn }n≥1 that converges to some ρ satisfying ρ (γ (A) , A) = 1 for all A. The section’s main result, Proposition 6, shows FTC can approximate a deterministic model if and only if the model is the componential context model of Tversky and Simonson (1993). By applying Proposition 6, I show that FTC can approximate any acyclic rational short list method (Manzini and Mariotti, 2007, Apesteguia and Ballester, 2013), but not certain forms of limited attention models (Masatlioglu et al., 2012). In the future one could further use this result to relate FTC to other deterministic models.

4.1

Componential Context Maximization

The attraction and compromise effects have lead Tversky and Simonson (1993) to suggest the Componential Context Maximization (CCM) procedure. Agents following the CCM procedure choose the option with the highest value, similar to agents applying utility maximization. However, unlike utility maximization, the composition of the choice set affects each option’s assigned value. The

19

connection between the composition of the choice set and each option’s value can lead agents applying the CCM procedure to exhibit behaviors such as the attraction and compromise effects. More precisely, a choice function γ is a CCM choice function if there exist two functions, u : X → R and v : X × X → R such that:

γ (A) = arg max x∈A

 

u (x) +



X

v (x, y)

y∈A\{x}

 

(5)



Hence, the value of an option x is equal to its intrinsic value, represented by P u (x), plus a relative advantage component, v (x, y). The following proposition shows that the set of CCM choice functions is identical to the set of deterministic models that FTC can approximate. Proposition 6. FTC can approximate a choice function γ if and only if γ is a CCM choice function. Proof. I show here that FTC can approximate γ only if γ is a CCM. The other direction is in the appendix. Let {ρn }n≥1 be a sequence of FTC procedures that converges to some ρ with ρ (γ (A) , A) = 1 for all A. Since Ω is finite, there exists N large enough such that for all n ≥ N : γ (A) = arg maxx ρn (x, A). Fix one such n, and let πn be the FTC representation of ρn . Note that for every Q x and y in A ρn (x, A) is larger than ρn (y, A) if and only if z∈A π (x, z) is P Q ln π (x, z) |x ∈ A . larger than z∈A π (y, z). Hence: γ (A) = arg max z∈A Setting v (x, y) = ln (x, y) and u (x) = 0 for all x, y ∈ X implies that γ is a CCM choice function. Proposition 6 says that in order to check whether FTC can approximate a deterministic model, one need only to check whether the model can be recast as CCM.

4.2

The Rational Short List Method

Manzini and Mariotti (2007) were the first to study the deterministic choice procedure known as the rational short list method. This method is character20

ized by two asymmetric binary relations which are applied in two stages. In the first stage an agent applying the method discards all the alternatives that do not maximize the first binary relation. In the second stage the agent chooses the option that maximizes the second binary relation among the alternatives that survived the first stage. Hence, in the first stage the agent applies one relation to construct a short list, from which the agent chooses in the second stage using a second relation. Here I will focus on the case in which both relations are acyclic7 . For a given binary relation R over X, define M (A|R) to be the set of all x in A such that there is no y ∈ A satisfying yRx. A choice function γ is said to be an acyclic rational short list (A-RSM) choice function if there are two acyclic binary relations, P1 and P2 over X such that γ (A) = M (M (A|P1 ) |P2 ). Thus, P1 is employed in the first stage to generate the short list, M (A|P1 ). Among the options in M (A|P1 ) the agent chooses in the second stage the ones that maximize P2 . Proposition 7 below shows that any A-RSM choice function is a special case of the CCM model and can therefore be approximated by FTC. Proposition 7. FTC can approximate any RSM choice function γ. Proof. We will use Proposition 6. Since X is finite and P2 is acyclic there exists a complete and transitive relation R with strict part R∗ such that xP2 y implies xR∗ y. By finiteness of X, there is a function u : X → [0, 1] such that M (A|R∗ ) = arg maxx∈A u (x). Define:  −2 if xP y 1 v (x, y) = 0 otherwise and let γ˜ : Ω ⇒ Ω be the correspondence implied by u and v as in the CCM model (equation 5). I will show that γ = γ˜ . Fix any A, and let B be equal P to M (A|P1 ). Then for every x ∈ B and y ∈ A\B: u (y) + z∈A\{x} v (y, z) ≤ −1 < u (x). Therefore: γ˜ (A) ⊂ B. Now take any y ∈ B\γ (A) and note that 7

As mentioned before, Manzini and Mariotti (2007) do require P2 only to be asymmetric. However, P1 must be acyclic for the RSM procedure to be well defined. Apesteguia and Ballester (2013) study the acyclic RSM in more depth.

21

γ (A) P2 y. But this implies γ (A) R∗ y, meaning that u (γ (A)) > u (y). Hence we have y ∈ / γ˜ (A), implying γ˜ (A) = M (M (A|P1 ) |P2 ), as required.

4.3

Necessary Condition for Approximation

Propositions 6 and 7 suggest that the FTC procedure can approximate a wide range of deterministic choice behaviors. However, there are some deterministic behaviors that can not be approximated by the FTC procedure. In particular, one can show that the FTC procedure cannot approximate some types of Limited Attention choice functions of Masatlioglu et al. (2012). Limited attention choice functions can violate the following property, which must be satisfied by CCM models, and is therefore for approximation by FTC: Condition 1. If γ (A) = x and γ (A ∪ {z}) = y for y 6= z, there is no B such that γ (B) = y and γ (B ∪ {z}) = x. Corollary 3. FTC can approximate γ only if γ satisfies Condition 1 Proof. If FTC can approximate γ we have by Proposition 6 that γ is a CCM choice function. Let the functions u and v be γ’s CCM representation as in equation 5. The result follows from the fact that γ (A) = x and γ (A ∪ {z}) = y for y 6= z only if v (y, z) > v (x, z). The following example is a limited attention choice function that violates Condition 1. Example 3. Let X = {w, x, y, z}, and define the strict preference relation  by y  x  w  z. Let Γ : Ω → Ω be the consideration set mapping defined by Γ {x, y} = {x}, Γ {w, x, y, z} = {w, x, z}, and Γ (A) = A whenever A ∈ / {{x, y} , {x, y, w, z}}. Let γ (A) = M (Γ (A) | ). Note that Γ is an attention filter as defined in Masatlioglu et al. (2012), and therefore γ is a choice with limited attention in the sense of Masatlioglu et al. (2012). However, Condition 1 is violated since γ {x, y} = x and γ {x, y, z} = y, but γ {x, y, w} = y and γ {x, y, w, z} = x. Therefore, γ does not satisfy condition 1 and therefore cannot be approximated by the FTC procedure. 22

The above example shows that despite FTC’s flexibility, it is not more permissive than some of other the models studied in the literature.

5

Relation to other random choice procedures

In this section I explore the position of the FTC procedure compared to some existing random choice models. I’ve already shown in section 2 that the FTC procedure nests the Luce model. Here I show that another existing model, namely random consideration sets of Manzini and Mariotti (2014) is also a special case of the FTC procedure. The FTC procedure, in turn, is a special case of the Relative Advantage Model of Marley (1991), but is neither a special case nor a generalization of the random utility or the weighted attribute (Gul, Natenzon and Pesendorfer (2014)) models.

5.1

Random Consideration Sets

The FTC procedure is only one way in which attention and focus can influence choice. Another way attention can influence choice is through the agent’s consideration set, i.e. the set of options that the agent takes into consideration. If the consideration set is formed in a stochastic fashion, one can obtain a random choice rule even if the agent’s underlying preferences are deterministic. In this subsection we show that a certain random consideration set rule studied by Manzini and Mariotti (2014) is, in fact, a special case of the FTC procedure. In the model studied by Manzini and Mariotti (2014), the agent maximizes a deterministic transitive and antisymmetric binary relation, . However, unlike a classic utility maximizer, the agent does not consider all of the available options. Instead, the agent only considers options which appear in her consideration set. The probability that an option x is in the agent’s consideration set is σx ∈ (0, 1), independently of the other alternatives in the set. Thus, the probability that the agent chooses x is the probability that: (1) she considers x, and (2) she does not consider any better alternative. In many cases one may expect the choice set to contain a default option,

23

φ, such as walking out of the store, abstaining from a vote, etc. The idea is that if φ is available, then the agent considers it with probability 1. That is, σφ = 1. However, Manzini and Mariotti (2014) assume that φ is ranked lower than any other alternatives, i.e. x  φ whenever x is different than φ, and restrict attention only to choice sets A that contain φ. Q For sets that do not contain φ there is a probability of x∈A (1 − σx ) that the agent’s consideration set is empty. To avoid this, Manzini and Mariotti (2014) suggest to have the agent redraw the consideration set whenever it turns out empty (see section 7.3 of their paper). More precisely, let A be any subset of X. Then the probability that the agent considers x when the choice set is A is the probability x is in the first consideration set drawn by the agent, plus the probability that the first consideration set is empty and x is in the second consideration set drawn by the agent, and so on. Thus, the agent considers x with probability: σx

∞ X Y n=0

!n (1 − σz )

=

z∈A

1−

Q

σx z∈A (1 − σz )

As such, for any set A, the probability that the agent considers x and does not consider any better alternative is equal to: Q σx y∈A:yx (1 − σy ) Q ρ (x, A) = 1 − z∈A (1 − σz )

(6)

I will say that a random choice rule ρ is an Independent Random Consideration Set (ICS) rule if there exists a mapping: σ : X → (0, 1] and a binary relation  over X such that: 1. σx = 1 if and only if x = φ. 2.  is transitive, antisymmetric and satisfies: x  φ for all x 6= φ 3. ρ satisfies equation 6 for all x and A. Given the definition of ICS rules, one can easily obtain the following corollary.

24

Corollary 4. A random choice rule ρ is an Independent Random Consideration Set rule only if it has a FTC representation. Proof. Let ρ be an ICS rule. Fix any x, y and A, B with x, y ∈ / A ∪ B. Assume without loss of generality that x  y. Then note that: ρy (x, A) =

Y σx (1 − σz ) σy (1 − σx ) z∈A:xzy

therefore: Q Q ρy (x, A\B) ρy (x, A) z∈A\B:xzy (1 − σz ) z∈A:xzy (1 − σz ) =Q =Q = ρy (x, B) ρy (x, B\A) z∈B:xzy (1 − σz ) z∈B\A:xzy (1 − σz ) hence, Random Consideration Set rules satisfy ISA, and therefore admit to a FTC representation by Proposition 1.

5.2

Random Utility Maximization

Perhaps the most well known and most widely used random choice rules are the random utility maximization (RUM) procedures. In this subsection we briefly discuss the connection RUM and FTC. A random utility maximizing agent chooses the option with the highest utility, very much like in the standard model of utility maximization. However, unlike in deterministic utility maximization, the utility function of the random utility maximizing agent is stochastic. More specifically, the utility of each alternative is drawn from some fixed distribution which does not depend on the choice set. Once utility is determined, the agent chooses the option with the highest realized value. Formally, a random choice rule ρ is a RUM rule if there exists some probability space (Ω, F, µ) and a measurable function: U : X × Ω → R such that: ρ (x, A) = µ {ω : Uω (x) ≥ Uω (y) ∀y ∈ A} A famous example of a RUM model is the multinomial logit model. The 25

multinomial logit model is equivalent to the Luce rule discussed in section 2. As such, the multinomial logit model serves as an example of a random choice rule that is both a RUM model and a FTC rule. Not every FTC rule is a RUM model. This is because every RUM model must satisfy a condition known as regularity (Tversky and Simonson (1993)). A random choice rule ρ is regular if the choice probability of existing options in the choice set are monotonically decreasing with the addition of new alternatives. As demonstrated in subsections 3.2, FTC rules often violate this condition. Therefore, there are FTC rules that cannot be recast as RUM models. The opposite is also true: There are RUM models that do not have a FTC representation. One such example is the Weighted Attribute Rule studied by Gul, Natenzon and Pesendorfer (2014). In their paper, Gul, Natenzon and Pesendorfer (2014) show, among other things, that the weighted attribute rule can be represented as a RUM model. Below is a specific example of a weighted attribute rule that does not have a FTC representation. Example 4. Consider an agent that needs to choose a means of commuting. Let X = {b1 , b2 , b3 , t}, where bi represent buses of different colors while t represents a train. We assume that the agent is indifferent to the bus’s color, and therefore ρ (bi , A) = ρ (bj , A) whenever {bi , bj } ⊂ A. Moreover, the agent is generally indifferent between going on a bus or a train, and therefore ρ (t, A) = 1 whenever t ∈ A and A 6= {t}. This RCR does not have a FTC representation 2 since it does not satisfy ISA: ρt (b1 , {b1 , b3 , t}) 1 1 ρt (b1 , {b1 , b2 , b3 , t}) = 6= = ρt (b1 , {b1 , t}) 2 3 ρt (b1 , {b1 , b2 , t}) However, this example is a weighted attribute rule in the sense of Gul, Natenzon and Pesendorfer (2014). The weighted attribute representation is achieved by setting the attributes to β = {b1 , b2 , b3 } (bus) and τ = {t} and uniform attribute values and intensities, i.e. w (β) = w (τ ), and the ηβ (bi ) = 1 for all i and ητ (t) = 1. Thus, while the FTC and RUM procedures may coincide, neither procedure 26

is more general than the other.

5.3

Binary Advantage Rules

Marley (1991) studied a class of random choice rules called binary advantage models. In these models, choice probabilities depend on a measure of binary advantage. Formally, the binary advantage measure is a function β (x, y) that maps each pair of distinct alternatives to a positive number representing how attractive x is compared to y. In addition to β, the binary advantage rule depends on an aggregation function, θ (A), which maps each non-empty choice set A to a strictly positive number. Together, (β, θ) are a binary advantage representation of ρ if: Q ρ (x, A) = P

z∈A

y∈A\{x}

Q

β (x, y)θ(A)

θ(A) y∈A\{z} β (z, y)



It is easy to see that every FTC rule has a binary advantage representation with β = π 8 and θ (A) = 1 for all A. Therefore, the FTC rule is a specialization of the binary advantage rules developed by Marley. However, unlike binary advantage rules, the FTC procedure is connected to a precise cognitive procedure. Hence, the FTC procedure can be seen as a cognitive foundation for a special class of binary advantage rules.

6

Concluding Remarks

6.1

FTC as revealing a fuzzy preference

There is a large literature that attempts to model preferences as fuzzy relations. A fuzzy relation is a mapping R : X × X → [0, 1], where R (x, y) represent the degree to which x relates to y. Hence, a standard relation can be seen as a fuzzy relation in which R (x, y) is equal either to 0 or to 1 for all x, y. The advantage of fuzzy relations is that they incorporate vagueness and conflicting 8

Since X is finite, one can always normalize β such that it is always below 1.

27

feelings into the agent’s preferences. Allowing preferences to be vague has proven especially useful in the context of social choice. See Barrett and Salles (2011) for a review of fuzzy preferences and of the usage of the concept in social choice. The FTC procedure can be seen as one way in which fuzzy preferences can be translated into choice. Indeed, the literature is unclear as to how agents are supposed to choose when their underlying preferences are fuzzy. The FTC procedure provides one way of doing so: simply use R as the FTC representation. Thus, one can use the FTC procedure as a way of applying, and also revealing, a fuzzy preference relation. The connection between the behavior of the FTC procedure the properties of underlying fuzzy relation remains a topic for future research.

6.2

The duplicates problem, FTC and the over-choice effect

In his critique of Luce (1959)’s classic RCR, Debreu (1960) asked the question of how a random choice rule should treat duplicates. It is often argued that the choice probability of an option should not depend on the number of duplicates available of the other alternatives. To illustrate, consider the random choice rule presented in example 4. In this example, the probability that the agent chooses to travel by train rather than by bus is always one half, no matter how many different colored buses are available. Hence, the RCR presented in example 4 accommodates duplicates in a way that is consistent with the traditional view in the literature. However, there is a tension between the way agents should treat duplicates and the way that participants treat duplicates in choice overload experiments. This is because the choice overload is, in a sense, an increase in the choice probability of a distinct option, the default, resulting from an increase in the number of duplicates of an alternative (jams, chocolates, pens, etc). In other words, duplicates mattering is one of the main findings of over-choice experiments.

28

Indeed, the over-choice effect is exactly the reason why FTC rules cannot possibly represent a random choice rule as in example 4. To see this, let ρ be a FTC rule, and assume that Bn = {b1 , b2 , . . . , bn } is a set of b-duplicates (different colored buses) for some generic option b. Assume, further, that there is an option t (a train), not in Bn , such that: ρ (t, {t, bi }) = ρ (t, {t, bi , bj }) holds for all i and j. Then one can show that t must have a focal advantage over b9 . As such, one can apply Proposition 3 to obtain that ρ (t, Bn ∪ {t}) must converge to 1 as n goes to infinity. Thic clearly cannot hold under Debreu (1960)’s approach to duplicates. Thus, the FTC procedure stresses the tension between the rational treatment of duplicates and choice overload.

6.3

Estimation

Random choice rules are often used in empirical applications. This is because conducting empirical analysis requires having a flexible model that can account for the variance seen in the data. Since the variance inherent in most data sets is very high, it is hopeless to try to match it with a deterministic model. As such, it is helpful to add a stochastic element to the model, thereby leading empiricists to use random choice rules. The FTC procedure should lend itself to easy estimation because of its simplicity. In its most general form, the procedure contains n2 − n − 1 parameters, where n is the number of elements in X. Adding more structure can reduce the number of parameters even further. Reducing the number of parameters allows one to use the model even when the set of all possible alternatives is infinite, such as when X = Rk+ . A model that can be rephrased as a specific parametrization of the FTC procedure has already been estimated by Kivetz, Netzer and Srinivasan (2004) from experimental data. Participants in their experiments were faced with 9

To see why t has a focal advantage over b note that we must have: π (t, bj ) ρb (t, {t, bi , bj }) = i <1 π (bi , bj ) ρbi (t, {t, bi })

where the inequality follows from ρ (t, {t, bi }) = ρ (t, {t, bi , bj }), bi , bj being b-duplicates, and ρ (bi , {t, bi }) > ρ (bi , {t, bi , bj }). where the third follows from the first two.

29

alternatives which can be seen as being elements in a subset X of Rk+ . Kivetz, Netzer and Srinivasan (2004) tried to fit various random choice rules, among them was the following model: n o P exp bV (x) + q y∈A\{x} R (x, y) n o ρ (x, A) = P P z∈A exp bV (z) + q y∈A\{z} R (z, y)

(7)

where b and q are two positive real numbers, and V : X → R+ and R : P X × X → R+ are real valued functions specified as V (x) = kl=1 v (xk ) for some bounded real function v : R+ → R+ and: P R (x, y) = Pk

l:v(xl )>v(yl )

l=1

|v (xl ) − v (yl )| +

P

(v (xl ) − v (yl ))

l:v(yl )>v(xl )

λl (v (xl ) − v (yl ))ψl

where λl and ψl are both strictly bigger than 0. Kivetz, Netzer and Srinivasan (2004) refer to random choice rules satisfying equation 7 as relative advantage models. The relative advantage model estimated by Kivetz, Netzer and Srinivasan (2004) can be readily transformed into a FTC rule as long as: sup {qR (x, y) − bV (y) |x, y ∈ X, x 6= y} < ∞

(8)

To see this, suppose that the above condition holds. Set: β (x, y) = exp {qR (x, y) − bV (y)} for every x 6= y. Taking θ (A) = 1 for all A, one readily obtains that every relative advantage model can be represented as a Binary Advantage rule (see section 5.3) using the pair (β, θ). When equation 8 holds, one can go further to obtain a FTC representation by using the following formula: π (x, y) =

β (x, y) sup {β (x, y) : x, y ∈ X, x 6= y}

30

(9)

for all x 6= y. The study by Kivetz, Netzer and Srinivasan (2004) suggests that estimating specific parametrization of the FTC procedure is possible when using experimental data. The use of experimental data, however, allows Kivetz, Netzer and Srinivasan (2004) to avoid dealing with complicated issues that arise in other economic situations such as endogeneity. It would be interesting to explore whether one can estimate the FTC algorithm in such settings as well.

References Apesteguia, Jose and Miguel a. Ballester, “Choice by sequential procedures,” Games and Economic Behavior, 2013, 77 (1), 90–99. Barrett, Richard and Maurice Salles, “Social Choice with Fuzzy Preferences,” Handbook of Social Choice and Welfare, 2011, 2 (10), 367–389. Camerer, CF, George Loewenstein, and Mathew Rabin, Advances in behavioral economics, Princeton University Press, 2011. Cherepanov, Vadim, Timothy Feddersen, and Alvaro Sandroni, “Rationalization,” Unpublished, June 2010. Chernev, Alexander, “When More Is Less and Less Is More : The Role of Ideal Point Availability and Assortment in Consumer Choice,” Journal of Consumer Research, 2004. Debreu, Gerard, “Review of R.D. Luce Individual choice behavior,” American Economic Review, February 1960, 50 (1), 186–188. Festinger, Leon, Conflict, decision, and dissonance, Stanford University Press, 1964. Gul, F, Paulo Natenzon, and W Pesendorfer, “Random Choice as Behavioral Optimization,” Econometrica, 2014, 82 (5), 1873–1912.

31

Huber, Joel and Christopher Puto, “Market Boundaries and Product Choice: Illustrating Attraction and Substitution Effects,” The Journal of Consumer Research, 1983, 10 (1), 31–44. , J.W. Payne, and Christopher Puto, “Adding asymmetrically dominated alternatives: Violations of regularity and the similarity hypothesis,” Journal of Consumer Research, 1982, 9 (1), 90–98. Iyengar, Sheena S. and Mark R. Lepper, “When choice is demotivating: Can one desire too much of a good thing?,” Journal of Personality and Social Psychology, 2000, 79 (6), 995–1006. , Wei Jiang, and Gur Huberman, “How Much Choice is Too Much ?: Contributions to 401 (k) Retirement Plans,” Pension Research Council Working Paper, 2003, 401. Janis, Irving L and Leon Mann, Decision making: A psychological analysis of conflict, choice, and commitment, Free Press, 1977. Ke, Shaowei, “Boundedly Rational Backward Induction,” Working Paper, 2014. Kivetz, Ran, Oded Netzer, and V. Srinivasan, “Alternative Models for Capturing the Compromise Effect,” Journal of Marketing Research, August 2004, 41 (3), 237–257. Knox, R E and J a Inkster, “Postdecision dissonance at post time.,” Journal of personality and social psychology, April 1968, 8 (4), 319–23. Luce, R. Duncan, Individual choice behavior: a theoretical analysis., Wiley, New York, 1959. and P. Suppes, “Preference, utility and subjective probability,” in R. Duncan Luce, R. R. Bush, and E. Galanter, eds., Handbook of mathematical psychology, Vol 3., Wiley, New York, 1965.

32

Mantel, S.P. and F.R. Kardes, “The role of direction of comparison, attribute-based processing, and attitude-based processing in consumer preference,” Journal of Consumer Research, 1999, 25 (4), 335–352. Manzini, Paola and Marco Mariotti, “Sequentially rationalizable choice,” The American Economic Review, 2007, 97 (5), 1824–1839. and , “Stochastic Choice and Consideration Sets,” Econometrica, 2014, 82 (3), 1153–1176. Marley, AAJ, “Context dependent probabilistic choice models based on measures of binary advantage,” Mathematical Social Sciences, 1991, 21, 201–231. Masatlioglu, Yusufcan, Daisuke Nakajima, and Erkut Y Ozbay, “Revealed Attention,” American Economic Review, August 2012, 102 (5), 2183– 2205. Matějka, Filip and Alisdair McKay, “Rational Inattention to Discrete Choices : A New Foundation for the Multinomial Logit Model,” Working Paper, 2013, pp. 1–55. McFadden, D, “Economic choices,” American Economic Review, 2001, 91 (3), 351–378. Miller, GA, “The magical number seven, plus or minus two: some limits on our capacity for processing information.,” Psychological review, 1956, 63 (1). Pan, Yigang, S O’Curry, and Robert Pitts, “The attraction effect and political choice in two elections,” Journal of Consumer Psychology, 1995, 4 (1), 85–101. Roberts, John and James Lattin, “Consideration: Review of research and prospects for future insights,” Journal of Marketing Research, August 1997, 34 (3), 406. Salant, Yuval and Ariel Rubinstein, “(A, f): Choice with Frames,” The Review of Economic Studies, 2008, 75 (4), 1287. 33

Schwartz, Janet A. and Gretchen B. Chapman, “Are More Options Always Better?: The Attraction Effect in Physicians’ Decisions about Medications,” Medical Decision Making, August 1999, 19 (3), 315–323. Shah, Avni M and George Wolford, “Buying behavior as a function of parametric variation of number of choices.,” Psychological science, May 2007, 18 (5), 369–70. Simonson, Itamar, “Choice Based on Reasons : The Case of Attraction and Compromise Effects,” Journal of Consumer Research, 1989, 16 (2), 158–174. and Amos Tversky, “Choice in context: Tradeoff contrast and extremeness aversion.,” Journal of Marketing Research, 1992, 29 (3), 281–295. Slaughter, Jerel E., Evan F. Sinar, and Scott Highhouse, “Decoy effects and attribute-level inferences.,” Journal of Applied Psychology, 1999, 84 (5), 823–828. Tversky, Amos, “Intransitivity of Preferences,” Psychological Review, 1969, 76 (1), 31–48. and Eldar Shafir, “Choice under conflict: The dynamics of deferred decision,” Psychological Science, November 1992, 3 (6), 358. and Itamar Simonson, “Context-dependent preferences,” Management Science, 1993, 39 (10), 1179–1189. White, Chris M. and Ulrich Hoffrage, “Testing the tyranny of too much choice against the allure of more choice,” Psychology and Marketing, 2009, 26 (March 2009), 280–298. Woodford, Michael, “Stochastic Choice: An Optimizing Neuroeconomic Model,” American Economic Review, May 2014, 104 (5), 495–500.

34

A A.1

Proofs Appendix Proposition 1

Assume first that ρ has a FTC representation π. Then: Q

 π (y, z) z∈A∪{x} ρy (x, A)  = Q Q ρy (x, B) z∈B∪{y} π (x, z) / z∈B∪{x} π (y, z) Q Q π (y, z) ρy (x, A \ B) z∈A\B π (x, z) Qz∈B\A = Q = ρy (x, B \ A) z∈B\A π (x, z) z∈A\B π (y, z) z∈A∪{y} π (x, z) /

Q

hence, ρ satisfies ISA. Suppose now ρ satisfies ISA. We will now construct a FTC representation for ρ. Choose two distinct elements: x∗ , y ∗ ∈ X and define the function β : X × X → R+ as following:

β (x, y) =

  ρy∗ (x,{x,y∗ ,x∗ ∗ })

if y = x∗

 ρx∗ (x,{x,x∗ ,y})

if y 6= x∗

ρy∗ (x,{x,y })

ρx∗ (x,{x,x∗ })

In addition, define the function α : X → R+ as: α (x) =

ρx∗ (x, {x, x∗ }) β (x, x∗ )

It is easy to verify that α and β satisfy the following properties: (1)β (x∗ , x) = 1 for all x, (2)β (x, x) = 1for all x, (3) β (y ∗ , x∗ ) = 1, and (4) α (x∗ ) = 1. Lemma 1. For every distinct x, y, z: ρy (x, {x, y, z}) β (x, z) = ρy (x, {x, y}) β (y, z)

35

Proof. Suppose that x∗ ∈ / {x, y, z}. By ISA: ρy (x, {x, y, z}) ρy (x, {x, y, z, x∗ }) = ρy (x, {x, y}) ρy (x, {x, y, x∗ }) ρy (x∗ , {x, y, z, x∗ }) ρx∗ (x, {x, y, z, x∗ }) = ρy (x∗ , {x, y, x∗ }) ρx∗ (x, {x, y, x∗ }) ρy (x∗ , {y, z, x∗ }) ρx∗ (x, {x, z, x∗ }) β (x, z) = = ∗ ∗ ∗ ρy (x , {y, x }) ρx∗ (x, {x, x }) β (y, z) Assume then that y = x∗ . Then: ρy (x, {x, y, z}) ρx∗ (x, {x, x∗ , z}) β (x, z) = = β (x, z) = ρy (x, {x, y}) ρx∗ (x, {x, x∗ }) β (x∗ , z) where the last equality follows from β (x∗ , z) = 1. If x = x∗ then: ρy (x∗ , {x∗ , y, z}) ρx∗ (y, {x∗ , y}) 1 p (x∗ , z) ρy (x, {x, y, z}) = = = = ρy (x, {x, y}) ρy (x∗ , {x∗ , y}) ρx∗ (y, {x∗ , y, z}) p (y, z) p (y, z) as required. Suppose finally that z = x∗ . Then by ISA: ρy (x, {x, y, x∗ }) ρy (x, {x, y, x∗ , y ∗ }) = ρy (x, {x, y}) ρy (x, {x, y, y ∗ }) ρy (y ∗ , {x, y, x∗ , y ∗ }) ρy∗ (x, {x, y, x∗ , y ∗ }) = ρy (y ∗ , {x, y, y ∗ }) ρy∗ (x, {x, y, y ∗ }) ∗ ∗ ∗ ρy (y , {y, x , y }) ρy∗ (x, {x, x∗ , y ∗ }) = ρy (y ∗ , {y, y ∗ }) ρy∗ (x, {x, y ∗ }) β (x, x∗ ) = β (y, x∗ ) as required. Lemma 2. For every x ∈ A ∪ {x∗ }: ρx∗ (x, A) = α (x)

Y

β (x, y)

y∈A∪{x∗ }

Proof. The case of A = ∅ is obvious, while the case of A = {x, x∗ } follows directly from definition of α and β. Suppose by induction the lemma is true 36

for all A with |A| ≤ k. Let B be such that |B| = k + 1. By ISA: ρx∗ (x, B) ρx∗ (x, {x, x∗ , y}) = = β (x, y) ρx∗ (x, B \ {y}) ρx∗ (x, {x, x∗ }) for every distinct x, y ∈ B. Therefore ρx∗ (x, B) = β (x, y) ρx∗ (x, B \ {y}). The conclusion follows from the induction assumption. Lemma 3. For every A and x, y ∈ X: Q α (x) z∈A β (x, z) Q ρy (x, A) = α (y) z∈A β (y, z) Proof. Suppose x∗ ∈ A. Then: Q α (x) z∈A β (x, z) ρx∗ (x, A ∪ {x, y}) Q = ρy (x, A) = ρx∗ (y, A ∪ {x, y}) α (y) z∈A β (y, z) where the second equality follows from Lemma 2. A similar argument can be made when x∗ ∈ {x, y}. Suppose then that x∗ ∈ / A ∪ {x, y}. Then by the above: Q β (x, x∗ ) α (x) z∈A β (x, z) ∗ Q ρy (x, A ∪ {x }) = β (y, x∗ ) α (y) z∈A β (y, z) But by ISA and Lemma 1: ρy (x, A ∪ {x∗ }) ρy (x, {x, y, x∗ }) β (x, x∗ ) = = ρy (x, A) ρy (x, {x, y}) β (y, x∗ ) which along with the previous equation concludes the proof of the lemma. Lemma 4. For every A and x ∈ A: Q α (x) z∈A β (x, z)  Q ρ (x, A) = P y∈A α (y) z∈A β (y, z)

37

Proof. By Lemma 3 we have: Q α (y) z∈A β (y, z) Q ρx (y, A) = α (x) z∈A β (x, z) for all x, y ∈ A. Therefore: !−1 X

ρ (x, A) =

ρx (y, A)

y∈A

Q α (x) z∈A β (x, z)  Q =P z∈A β (y, z) y∈A α (y)

as required. whenever y 6= x, and π ∗ (x, x) = β (x, x) = 1 for Define π ∗ (x, y) = β(x,y) v(y) all x. Then for every non-empty A ⊂ X: Y

π ∗ (x, z) = Q

z∈A

Y v (x) β (x, z) z∈A v (z) z∈A

therefore by Lemma 4: −1 Q Q ∗ v (z) v (x) z∈A β (x, z) z∈A π (x, z)  Q ρ (x, A) = Q =P −1 P Q ∗ (y, z) π β (y, z) v (z) v (y) y∈A z∈A y∈A z∈A z∈A Q

z∈A

to conclude the proof of the proposition, simply set π (x, y) =

π ∗ (x, y) maxw,z π ∗ (w, z)

whenever x 6= y and have π (x, x) = 1 otherwise.

A.2

Proof of Proposition 2 0

Suppose π and π be two FTC representations of the same random choice rule ρ. Note first that for every x 6= y: 0

π (x, y) π (x, y) = ρ (x, {x, y}) = 0 π (x, y) + π (y, x) π (x, y) + π 0 (y, x)

38

0

therefore ∃αx,y > 0 such that π (x, y) = αx,y π (x, y). Moreover: αx,y = αy,x . Note that for every z ∈ / {x, y}: 0

π (x, y) ρz (x, {x, y, z}) π (x, y) = = 0 π (z, y) ρz (x, {x, z}) π (z, y) 0

0

which implies: αz,y = distinct x, y, z, w:

π (z,y) π(z,y)

=

π (x,y) π(x,y)

= αx,y = αy,x . Therefore for every

αx,y = αx,z = αz,x = αz,w  −1 0 0 implying that ∃α > 0 such that π (x, y) = απ (x, y). α ≤ maxX×X π (x, y) follows from π ≤ 1 for all x, y. Sufficiency is obvious.

A.3

The Properties of Copies (used in proposition 3)

In this section I prove the following Lemma: 0

Lemma 5. Let π be the FTC representation of a RCR ρ, and suppose x   0 0 is an x copy. Then for every y ∈ / x, x : (1)π (x, y) = π x , y ; and (2) 0 π (y, x) = π y, x .  0 0 Proof. Since x is an x copy we have for every y ∈ / x, x :     0 0 0 0  0 π x, x ρy x, x, x , y ρy x , x, x , y π x ,x = = = π (y, x0 ) ρy (x, {x, y}) ρy (x0 , {x0 , y}) π (y, x)   0  0  0  0 0 but ρ x, x, x = ρ x , x, x = 12 implies π x, x = π x , x . There 0  0 fore π y, x = π (y, x). ρ (y, {x, y}) = ρ y, x , y then implies π (x, y) =  0 π x ,y .

A.4

Proof of Proposition 6

I prove here that FTC can approximate every CCM choice function. The proof of the other direction in the paper’s main body (section 4.1). Lemma 6. γ is a CCM choice function if andnonly if there existsoa function P v : X × X → R such that: γ (A) = arg maxx∈A y∈A\{x} R (x, y) 39

Proof. Clearly if R exists then γ is a CCM choice function. Suppose γ is a CCM choice function. Set: R (x, y) = v (x, y) − u (y) then note that for every A and x, y ∈ A: X X R (x, z) ≥ R (y, z) z∈A\{x}

z∈A\{y}

is equivalent to: 

 X





v (x, z) − u (y) ≥ 

z∈A\{x}

 X

v (y, z) − u (x)

z∈A\{y}

implying the lemma. We now turn to proving the proposition. Suppose that γ is a CCM. Define the following sequence of FTC representations: πn (x, y) = enR(x,y) . Then for every A: !n P e y∈A\x R(x,y) P ρn (γ (A) , A) = P →1 y∈A\z R(z,y) e z∈A

40

Focus, Then Compare

∗Department of Economics, University of Chicago, [email protected]. This paper .... Given a full support random choice rule ρ and any A ⊂ X, I take ρy (x, A) ...

600KB Sizes 2 Downloads 214 Views

Recommend Documents

Compare--FederalistsAnti.pdf
Sign in. Page. 1. /. 1. Loading… Page 1 of 1. Page 1 of 1. Compare--FederalistsAnti.pdf. Compare--FederalistsAnti.pdf. Open. Extract. Open with. Sign In.

Wifi Compare Wimax.pdf
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Wifi Compare Wimax.pdf. Wifi Compare Wimax.pdf. Open. Extract.

compare contrast writing.pdf
Whoops! There was a problem loading more pages. Retrying... compare contrast writing.pdf. compare contrast writing.pdf. Open. Extract. Open with. Sign In.

VOIP Compare IPTV.pdf
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. VOIP Compare IPTV.pdf. VOIP Compare IPTV.pdf. Open. Extract.

Compare Contrast Defarges.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Compare ...

Learning features to compare distributions
Goal of this talk. Have: Two collections of samples X Y from unknown distributions. P and Q. Goal: Learn distinguishing features that indicate how P and Q differ. 2/28 ...

How Does Pennsylvania's Tax Burden Compare? - Commonwealth ...
Aug 25, 2016 - California. 11.00%. 42.81%. 7.67%. Average. 10.76%. 39.74%. 4.81%. Sources: Tax Foundation, US Bureau of Economic Analysis, US Bureau ...

how does pennsylvania's tax burden compare? - Commonwealth ...
Aug 25, 2016 - The 2nd highest corporate income tax rate in the U.S (9.9%). Combined with the ... 8th highest state and local cell phone tax rate (14.14%).

Compare Contrast text str purpose.pdf
Compare Contrast text str purpose.pdf. Compare Contrast text str purpose.pdf. Open. Extract. Open with. Sign In. Main menu.

compare pdf files free tool
Sign in. Loading… Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying.

best pdf compare tool
Page 1 of 1. File: Best pdf compare tool. Download now. Click here if your download doesn't start automatically. Page 1 of 1. best pdf compare tool. best pdf ...

compare center-affiliated and unaffiliated
productivity. We evaluate the impact of center affiliation on publication productivity, collaboration, ... 101st US Congressional (1990) call for enhanced support for ...

Compare Human and Chimpanzee Skeleton.pdf
Compare Human and Chimpanzee Skeleton.pdf. Compare Human and Chimpanzee Skeleton.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying ...

DERIVATIVES FOCUS
Feb 17, 2017 - Open Interest F F ก Short Sales F F. F ก F SET50 ก ก กF F 18% F 1,151 F. F ก F ก ก กF F. กก ก ก F Long Index Futures 1,432 F. 6,378 1,250 F ก ...

DERIVATIVES FOCUS
Mar 8, 2017 - ก F F Fก ก. F ก 35.00-35.20 / F F . Single Stock Futures: Long TRUEH17X. Long TRUEH17X ก 6.65-6.70 6.40. TRUEH17X ก ก ก F. F 5 10 ก F ก. F F F ก F F. F ก F 6.65-6.70. ก ก 6.40-6.70. Single Stock Futures:

Daily Focus
Apr 4, 2018 - concerns over its CoF and deposit base which might be affected by other banks' decision to remove their fund transfer fee. However, we believe TMB will see limited impact. The bank is the first and the only one that has never imposed a

Interpreting focus
... instance of what in the presupposition literature has come to be known as the 'binding problem'. ...... Linguistics and Philosophy 19: 259-294. Geurts, B. 1997: ...

DERIVATIVES FOCUS
Jan 9, 2017 - F F F 5 ก 1,911 F Net Long. Index Futures 424 90 F ก ก. ก กก F ก F F F SET50. ก ก F ก Open Interest (OI). Index Futures F ก F Long. F ก OI ก F ก.

beyond compare for ubuntu 1204.pdf
Download. Connect more apps... Try one of the apps below to open or edit this item. beyond compare for ubuntu 1204.pdf. beyond compare for ubuntu 1204.pdf.

Compare Office 365 Education plans - Office.com -
Jan 4, 2014 - and instant messaging. SSiimmppllee ... Android phone, and Windows. Phone. Use the ... 5 PCs/Windows tablets/Macs per user. Desktop ...