Frontier Culture: Historical Roots and Persistence of “Rugged Individualism” in the United States∗ Samuel Bazzi†

Martin Fiszbein‡

Mesay Gebresilasse §

Boston University, CEPR and BREAD

Boston University and NBER

Boston University

July 2017 click here for latest version

Abstract In a classic 1893 essay, Frederick Jackson Turner argued that the American frontier promoted individualism. In this paper, we restate the Frontier thesis and examine its relevance at the subnational level. Using Census data and GIS techniques, we track the frontier throughout the 1790–1890 period and construct a novel, county-level measure of historical frontier experience. We document the sharp demographic differences of frontier locations—high sex ratios, high shares of prime-age adults, high fertility rates, high shares of foreigners, low literacy rates—as well as their higher levels of individualism, proxied by the share of unique names among children. Higher individualism is not driven entirely by selective migration, suggesting that frontier conditions may have shaped behavior, for example, by means of an evolutionary advantage of self-reliance. We offer empirical evidence on this mechanism and consider other potential channels for cultural persistence. We then show that counties with longer frontier experience exhibit greater individualism in the mid-20th century and higher opposition to redistribution and regulation today. The results are robust to accounting for a number of potential confounders as well as an instrumental variable strategy that exploits variation in immigration flows over time.

Keywords: Culture, Individualism, Preferences for Redistribution, American Frontier, Persistence JEL Codes: D72, H2, J11, N31, N91, P16, R11

¨ ¨ We thank Camilo Garcia-Jimeno, Ray Fisman, Oded Galor, Paola Giuliano, Bob Margo, Nathan Nunn, Omer Ozak, Daniele Paserman, and David Weil, as well as seminar participants at ASREC, Boston University, and Brown for helpful comments. Yeonha Jung, Max McDevitt, and Huiren Tan provided excellent research assistance. All errors are our own. † Department of Economics. 270 Bay State Rd., Boston, MA 02215. Email: [email protected]. ‡ Department of Economics. 270 Bay State Rd., Boston, MA 02215. Email: [email protected]. § Department of Economics. 270 Bay State Rd., Boston, MA 02215. Email: [email protected]. ∗

1

By adherence to the principles of decentralization, self-government, ordered liberty, and opportunity and freedom to the individual our American experiment has yielded a degree of well-being unparalleled in all the world. . . . There is a still further road to progress which is consonant with our American system—a method that reinforces our individualism by reducing, not increasing, government interference in business.” —Herbert Hoover (“Rugged Individualism,” 1928) R UGGED INDIVIDUALISM: the practice or advocacy of individualism in social and economic relations emphasizing personal liberty and independence, self-reliance, resourcefulness, self-direction of the individual, and free competition in enterprise.” —Merriam-Webster Dictionary

1

Introduction

Until the late 19th century, the United States underwent an extensive process of westward settlement. Vast tracts of open land characterized its territory. According to the influential historian Frederick Jackson Turner, the presence of a frontier line dividing settled and unsettled locations strongly influenced American culture, fostering the development of unique features. Salient among these was individualism, associated with a strong inclination toward self-reliance and a preference for low levels of government intervention. This paper examines the contemporaneous and long-run effects of the American frontier on culture. First, we show that locations inside the frontier exhibit relatively higher individualism that cannot be entirely explained by selective migration. Evidence suggests that individualism was associated with differential success and survival in the frontier. This individualist advantage enabled an intergenerational mechanism for cultural persistence. We show that locations exposed to the frontier for a longer period historically exhibit higher contemporary levels of individualism, lower support for redistribution, and stronger opposition to government regulation. These findings offer systematic empirical support for a defining and controversial thesis on American history. More importantly, the results shed new light on the persistence of frontier culture and its lasting political legacy across the United States. In the conceptual framework that we propose—based on the classic work of Turner as well as on insights from social psychology and economics—the significance of the frontier resulted from the combination of three factors. First, frontier locations attracted people with distinctive characteristics, both in terms of demographics and individualism. Second, the frontier experience, characterized by isolation and wilderness, fostered the development of self-reliance and related cultural traits known to have a strong relationship with preferences for redistribution and views on the role of government. Finally, the distinctive features of frontier populations affected preferences and social norms at a formative stage, leaving an imprint on local culture. We determine the position of the frontier and track its evolution over time using population data from the Census and applying Geographic Information System (GIS) techniques. Drawing upon Turner’s classic essay and the 1890 Census report, Progress of the Nation (Porter et al., 1890), we identify the frontier

1

line as the line at which population density dropped below two people per square mile. We identify frontier locations at each point in time and define a measure of total frontier experience as the time spent in the frontier during the 1790-1890 period. This provides a precise and comprehensive measurement of the history of the American frontier, and to our knowledge, the first measure of historical intensity of frontier exposure. We provide rigorous new evidence on the demographic and cultural distinctiveness of frontier locations consistent with historical narratives. High sex ratios, high shares of prime-age adults, high fertility rates, low literacy rates, and high shares of foreign-born characterized the frontier. Using semiparametric regressions, we identify sharp breaks in these demographics patterns close to the population density cutoff defining the frontier line in historical accounts. Moreover, event study specifications show how these traits evolve as counties transition out of the frontier and communities grow more permanent roots. These methods reveal similarly sharp patterns of greater individualism inside the frontier as reflected in the prevalence of unique first names for children. This finding holds across various measures of uniqueness and restrictions to native-born (grand)parentage. A core aspect of cultural transmission, name choices not only reflect parental preferences but also shape children’s notions of self-identity with implications for subsequent behavior. A small but growing literature originating in psychology links unique names to other contemporary proxies for individualism as well as social and economic decisions associated with greater self-reliance and less cooperation (see, e.g., Beck Knudsen, 2017; Laham et al., ¨ and Topolinski, 2017). 2012; Twenge et al., 2010; Zurn In order to understand the roots of frontier culture, we need to know why individualism came to thrive in frontier locations. We provide insights using data from a linked sample of households across the 1870 and 1880 Census that allows us to track people across locations. First, we show that selective migration of individualist parents cannot fully explain differential naming patterns inside and outside of the frontier. This suggests that environmental factors in the frontier may have fostered individualism. One potential mechanism is the presence of an evolutionary advantage of individualism in conditions in which people had to rely on themselves for protection and to improve their well-being. We provide suggestive evidence of relatively high returns to individualism in the frontier. Males with unique names, and those that chose unique names for their children, exhibit relatively higher socioeconomic status— both in levels and changes—inside than outside the frontier. Moreover, families with unique names are less likely to leave the frontier for more settled counties to the east during this period. Differential success and persistence of individualism may have influenced the process of cultural transmission across generations, creating a link between initial conditions and modern political preferences. We investigate these long-run effects of frontier exposure on culture using our new measure of total frontier experience (TFE). First, we show that TFE correlates with unique naming patterns several generations after the closing of the frontier. Second, we use a range of survey and voting data to identify significant differences in preferences and policy outcomes across counties with different TFE. These differences are robust to different spatial fixed effects as well as a host of geographic and agroclimatic features that are correlated with TFE and modern culture, including area, latitude and longitude, rainfall and temperature, distance to waterways and the coast, and potential agricultural productivity. We find a robust association of TFE with opposition to redistribution and high levels of government spending. These results hold across three widely used contemporary data sources—the General Social

2

Survey (GSS), the American National Election Survey (ANES), and the Cooperative Congressional Election Study (CCES)—capturing slightly different notions of redistribution and public expenditures. As a benchmark, the GSS-based estimates suggest that each additional decade of TFE is associated with a reduction in support of redistribution on par with the gap in support for redistribution between younger and older individuals 10 years apart. Beyond stated preferences, actual property taxes in 2010 are significantly lower in counties with greater TFE. We further link TFE to growing support for the Republican Party. Each decade of TFE is associated with 3.5 percent more votes for Republican candidates in presidential elections since 2000. This effect ratchets up over the 2000s with each election exhibiting a significantly larger effect of TFE, and the 2016 election exhibiting a particularly large frontier legacy.1 We provide deeper insights into why Republican party support is stronger in areas with higher TFE by identifying a set of contentious policy issues that are at the core of frontier norms and values described in the historical literature. Using the CCES, we show that locations with higher TFE are more likely to support repeal of the Affordable Care Act, less likely to support an increase in the minimum wage, less likely to support a ban on assault rifles, and more likely to oppose government regulation of CO2 emissions. These effects survive even after accounting for the strength of partisan and class cleavages. Each of these issues bears a particular link to salient notions in the frontier: opposition to state intervention, strong belief in effort versus luck in reward, necessity of self-defense, and notions of “manifest destiny.” We take three steps to argue for a causal interpretation of these long-run effects. First, we show that the results survive a progressively richer set of controls aimed at removing important confounders of frontier experience, individualism and preferences for redistribution. These include, among others, population density, diversity, and historical exposure to large-scale public investment such as the railroad. Second, we rule out additional concerns about omitted variables by using the Oster (forthcoming) approach to show that the degree of selection-on-unobservables is relatively limited. Finally, we show that most effects survive an instrumental variables (IV) strategy that isolates plausibly exogenous shifts in the speed of westward expansion through a given location. From the perspective of a given county in the sparsely populated interior, the instrument exploits the coincidental timing of immigration flows to the (coastal) United States and westward movements of the frontier through counties far to the east.2 This paper contributes to a growing economic literature on culture, focusing on preferences for redistribution and individualism. Considerable attention has been paid to preferences for redistribution. Less has been paid to individualism, despite its prominence in the influential work of Greif (1994).3 While 1

As discussed in Section 6.4, the magnitude of this legacy compares favorably to other leading, more proximate explanations for the Republican shift in the American heartland, such as import competition from China. 2 Conceptually, our strategy can be viewed as a combination of Bartik (1991) and spatial lag approaches to identification, avoiding concerns about validity due to reverse causality associated with the latter (see Gibbons and Overman, 2012). 3 An important exception is a series of recent papers by Gorodnichenko and Roland (2011, 2015, 2016), discussed further in Section 2.1. This cross-country analysis argues that individualism fosters economic growth and democratization. Empirically, they rely on a well-established link between pathogen prevalence (i.e., communicable diseases) and the strength of collectivist norms. While this correlation has been found across broad regions of the United States as well (Varnum, 2014), our empirical strategy isolates local variation in the drivers of individualism that is not driven solely by this mechanism. Nevertheless, our findings complement this other work as frontier conditions share many of the same features of ecological and biological stress underlying the pathogen prevalence mechanism. Indeed, we are open to the possibility that part of the way in which the frontier shapes behavior could be through this biology-based, parasite-stress theory of sociability due to Fincher and Thornhill (2012). What is unique here is how these environmental factors interact with the frontier settlement process at a critical juncture of political history.

3

there is extensive work exploring cross-country differences in preferences for redistribution (see Alesina and Giuliano, 2011, for a survey), we are the first to study the role of the American frontier in shaping these preferences, thereby offering support to the conjecture in Alesina et al. (2001) that the frontier helps explain the roots of opposition redistribution in the U.S. today.4 To our knowledge, Garc´ıa-Jimeno and Robinson (2011) is the only other paper that empirically examines the Frontier thesis in economics, assessing the correlation between the size of the frontier and the prevalence of democratic institutions at the country level in the Americas.5 Our study identifies the long-run effects of the American frontier on contemporary preferences and policy outcomes by exploiting subnational variation, using a wealth of data and introducing a novel measure of frontier exposure. Our results show that individualism and opposition to redistribution survive long after the transition from frontier conditions to established settlement, adding new evidence to a growing literature on the deep roots and striking persistence of culture (e.g., Alesina et al., 2013; Fern´andez, 2011; Guiso et al., 2016; Nunn and Wantchekon, 2011; Spolaore and Wacziarg, 2013). Giuliano and Nunn (2017) show how agroclimatic factors shape incentives to maintain culture and traditions across generations. While frontier culture may arise out of particular environmental conditions, its persistence is due in part to the critical juncture of U.S. history in which that culture flourished across the American heartland. Although not explicit, this notion of persistence was arguably at the heart of Turner’s original thesis. Moreover, we add to a very rich literature outside economics that elaborates on Turner’s celebrated work. While many studies in history and sociology attempt to document the demographic traits characterizing the frontier, we offer a more comprehensive and systematic approach to measurement. By identifying sharp structural breaks in these traits at around the seemingly arbitrary population density cutoffs put forward by Turner and his contemporaries, we find renewed support for classic notions of what made the frontier so distinctive. Our analysis of the long-run effects of frontier experience as the outcome of intergenerational transmission of individualism also makes a contribution beyond economics. In the social psychology literature, Vandello and Cohen (1999) and Varnum and Kitayama (2011) document empirical patterns that suggest persistent effects of the American frontier on culture. Using contemporary state-level data, they show that the Western United States exhibit higher levels of individualism. Varnum and Kitayama (2011) use the share of uncommon names to capture a state’s degree of individualism, and explicitly motivate their study as a test of the Frontier thesis (previously discussed at length in Kitayama et al., 2010). We go beyond the broad geographic correlations in these studies by measuring the frontier more precisely, by providing empirical evidence on the underlying mechanisms for persistence, and by linking individualism to actual policy outcomes. Overall, our findings may offer new perspective on the process by which American culture and politics became imbued with high levels of individualism and opposition to redistribution. We cannot directly address the roots of cultural traits in the U.S. as a whole and hence refrain from offering any direct explanation in regards to cross-country cultural variation. However, our findings may help explain 4

In particular, they note on p. 222 that “The open frontier in a country of immigrants strengthened individualistic feelings and beliefs in equality of opportunities rather than equality of outcomes. 5 They find a positive and significant correlation between the percentage of land in the frontier in 1850 and an indicator of democratic quality between 1900 and 2007. They also go beyond the simple Frontier thesis to a conditional one in which the effects of the frontier depend on the quality of initial institutions.

4

the persistent differences in political views across regions of the U.S. From a policy perspective, the cultural persistence that we identify may be helpful in understanding a salient puzzle in American political economy, namely that preferences for redistribution have remained extremely stable despite significant increases in inequality over the last 40 years (see Ashok et al., 2015). The paper is organized as follows. Section 2 provides a general discussion of individualism and opposition to redistribution as well as economic theories about their origins and consequences. We also link these theories to the Frontier thesis and offer a simple conceptual framework to understand its significance. Section 3 explains how we measure frontier exposure historically and over the long run. Section 4 documents the distinctive features of frontier populations. Section 5 offers evidence for why the frontier may have favored individualism while sustaining its persistence across generations. Section 6 then provides estimates of the long-run effects of frontier experience on culture. Section 7 concludes.

2

The Frontier Thesis and Modern Political Economy

This section provides a conceptual background for our analysis by situating Turner’s ideas about the American Frontier within the growing economics literature on culture. We start by discussing some contributions in this literature that are key for our purposes. Then, we restate the Frontier thesis, breaking it down into the potential channels for initial influence and subsequent persistence.

2.1

Individualism and Preferences for Redistribution

The role of culture has generated increasing interest among economists. Following Guiso et al. (2006), culture is often defined as “those customary beliefs and values that ethnic, religious, and social groups transmit fairly unchanged from generation to generation.” A growing number of contributions study the roots and consequences of various cultural traits, including trust, thrift, preferences for redistribution, the strength of family ties, long-term orientation, and attitudes toward fertility and female labor market participation (Alesina and Giuliano, 2011; Alesina et al., 2013, 2014; Figlio et al., 2016; Fern´andez, 2011; ¨ Galor and Ozak, 2016; Luttmer and Singhal, 2011; Tabellini, 2010). This paper focuses on individualism and opposition to government intervention, which were salient in the context of the American Frontier. Individualism has attracted relatively more attention in the social and cultural psychology literature. This literature—a prime source of data and ideas for studies of culture in economics—often portrays individualism as the most important dimension of cross-country variation in culture (e.g., Heine, 2010; Triandis, 1995). The influential work of Hofstede (1980) emphasizes individualism as a concern for oneself and immediate family, an emphasis on personal autonomy and self-fulfillment, and the foundation of identity on personal accomplishments (see also Hofstede, 1991). Individualism is most often defined in contrast with collectivism. Triandis (1988, 1995, 2001) makes the distinction in terms of four major attributes: in individualism, the self is independent rather than interdependent, there is primacy of personal goals rather than group goals, behavior is regulated primarily by personal attitudes rather than social norms, and market exchange is more important than communal relationships.6 6

¨ The classical works of Durkheim (1893), Tonnies (1887), Weber (1905) and Parsons and Shils (1951) had anticipated some aspects of the individualist–collectivist distinction.

5

Individualism is best characterized as a set of interconnected traits and hence is less straightforward to define than other cultural dimensions. Perhaps because of this, the individualist–collectivist dimension of culture has received relatively little attention from economists. There are, however, a few important exceptions. Greif (1994) introduces the individualist–collectivist dimension in his influential contribution on the role of culture in shaping institutional structures governing trade in the Middle Ages. More recently, Gorodnichenko and Roland (2011, 2015) show that individualistic countries have more democratic institutions and higher long-run growth. Gorodnichenko and Roland (2016) build an endogenous growth model in which individualism creates incentives to stand out and innovate. Individualism is closely associated with opposition to redistribution, a cultural trait that has attracted considerable attention in economics. Paul Samuelson once noted that “to an economist the word ‘individualism’ is tied up with laissez faire” (Samuelson, 1965). In fact, Alesina and Giuliano (2011) measure preferences for redistribution using the same question from the General Social Survey that Di Tella et al. (2008) use to measure individualism. Intuitively, the defining characteristics of individualism (e.g., the inclination toward self-reliance, the concern for oneself and immediate family) can be associated with an opposition to redistribution. The connection is explicit in the ideology—often associated with U.S. Republican president Herbert Hoover—of “rugged individualism,” which promotes self-reliance and opposes state intervention through taxes or regulations. In the sociology literature, Celinska (2007) notes that an aspect of “utilitarian individualism and the consequence of a strong belief in self-reliance” is the “opposition toward governmental efforts to equalize citizens’ economic position, to limit private business, and to build strong social programs that provide assistance to the most disadvantaged.”

7

Economists have advanced explanations for variation in preferences for redistribution that are relevant for understanding the roots and persistence of cultural traits characterizing the American Frontier. In a seminal paper, Piketty (1995) analyzes how preferences for tax redistribution are shaped by beliefs about the importance of effort versus luck in the income generating process, and how these beliefs are determined by history. The higher the perceived importance of effort, the higher the expected negative effects of taxes due to adverse incentives, and thus the lower the desired tax rate. Alesina and Angeletos (2005) show that a belief in the importance of fairness—a conviction that income should be determined by effort rather than luck—is highly correlated with social welfare spending as a share of GDP across countries. In a model of tax redistribution, preferences for fairness generate multiple equilibria, one with high redistribution and the other with low.8 The multiplicity of equilibria in models like this may help shed light on the persistent nature of cultural traits, as discussed further in the next section. Benabou and Ok (2001) highlight the role of expectations about future income, examining how the prospects of upward mobility may lead to lower preferences for tax redistribution. Even people with below-average income, who would appear to benefit from redistribution, may oppose it insofar as they expect to move up in the income distribution. Benabou and Ok show that it is possible to have a majority 7

In general, individualism and preferences for redistribution are not necessarily associated in this way. For instance, across countries, the correlation between Hofstede’s measure of individualism and a measure of preferences for redistribution from the World Values Survey goes in the opposite direction. Yet, controlling for country fixed effects, or with data solely from within the United States based on the American National Election Survey, there is a negative association between self-reliance and support for redistribution. 8 The role of fairness considerations and the perceived importance of effort versus luck in income generation emphasized in this framework also sheds light on how perceptions about poverty—in particular, whether poor people are poor due to lack of effort or luck—influence the desired levels of redistribution.

6

of rational expectations agents voting against redistribution. In an empirical study using data from the U.S., Alesina and La Ferrara (2005) find that individuals with greater expected income growth are more likely to oppose redistribution, suggesting that high income mobility in the U.S. might explain its relatively low level of redistribution. These theories explain how preferences for redistribution are affected by expectations about future income and the perceived role of luck versus effort in income generation. The insights also apply more broadly to preferences about the size and scope of government. While we focus on redistribution in line with much of the prior literature, we also investigate other outcomes related to government intervention, spending, and regulation more broadly. These outcomes reflect core policy differences between the United States and Europe that have motivated substantial research in political economy. As noted earlier, Alesina et al. (2001) alludes to a possible role of the frontier. In the next section, we provide further background on this potential connection, laying the groundwork for our empirical analysis that follows.

2.2

The Frontier Thesis: A Restatement

Throughout the 19th century, the United States underwent a process of rapid westward expansion. According to the influential historian Frederick Jackson Turner, the presence of vast tracts of open land throughout a formative period of U.S. history had a strong influence on American society. In his words: “The existence of an area of free land, its continuous recession, and the advance of American settlement westward, explain American Development” (Turner, 1920). More precisely, Turner’s thesis suggests that the presence of the frontier, “the meeting point between savagery and civilization,” was conducive to the formation of individualistic and democratic cultural traits: “These free lands promoted individualism, economic equality, freedom to rise, democracy” (Turner, 1920). This paper can be viewed as a restatement of Turner’s thesis at the subnational level. We argue that the significance of the frontier resulted from the combination of three factors. First, frontier locations attracted people with distinctive characteristics, both in terms of demographics and the prevalence of individualism. Second, the frontier experience, characterized by isolation and wilderness, fostered the development of self-reliance and related cultural traits. Finally, the distinctive features of frontier populations affected preferences and social norms at a formative stage and thus left a persistent imprint on local culture. These three factors—selective migration to the frontier, the effect of place on people, and cultural persistence—were all recognized by Turner (albeit not in a fully precise nor comprehensive way) and explored in many subsequent contributions, from which we draw below. Selective Migration.

In traditional narratives, frontier communities are characterized by the preva-

lence of both young single men, mostly of low socioeconomic status, many of them foreign, and large families seeking access to land (see, e.g., Eblen, 1965). Naturally, these distinctive features reflected the characteristics of individuals that were most willing and able to undertake migration and settle in locations with frontier conditions. The high male to female ratios and the prevalence of prime-age adults may be due to frontier conditions being especially hostile to women and elderly people, both because of the harsh natural environment and higher crime rates. Higher fertility in the frontier may be explained by land availability (Forster and Tucker, 1972; Leet, 1975; Yasuba, 1962). Turner himself devoted attention to selective migration to the frontier, most clearly in his theory of 7

the “safety valve” whereby the frontier provided an outlet for unskilled urban workers, thus relieving pressure in Eastern labor markets and making wages higher than they would have been otherwise. Goodrich and Davison (1935, 1936) and Steckel (1989) do not find strong support for this theory, but Ferrie (1997) shows that unskilled urban workers were a significant part of Westward migratory flows in the 1850s and that the frontier offered benefits to those at the bottom of the ability distribution. Another salient characteristic of frontier people was their high level of individualism. As noted in contemporary social psychology, migrants tend to have independent mindsets—particularly those moving to frontier locations, leaving their social environments to settle in remote and isolated contexts (see Kitayama et al., 2006; Jokela, 2009; Kitayama et al., 2010; Beck Knudsen, 2017). Effects of Place on People.

While frontier locations attracted people with specific traits, the frontier

experience, in turn, influenced the settlers’ values, beliefs, and behavior. In Turner’s words, “a modification of the original stock occurred” in the frontier context. This modification can be explained by the opportunities and challenges created by the unique natural and social context of remote and often isolated frontier locations. The abundance of land and other natural resources offered ample profit opportunities, insofar as they were deftly exploited. On the other hand, frontier settlers often faced harsh climatic conditions and multiple types of danger—e.g., plagues, droughts, blizzards, and crop failure, as well as attacks from wild animals, Native Americans, and other settlers—often without the protection of social infrastructure providing prevention and care. The opportunities and threats faced by frontier settlers may have favored individualism through an evolutionary mechanism. In the frontier context, people had to rely on themselves for protection and prevention, and to improve their living conditions. While the returns to cooperation may have been high, its scope was limited by the difficulties of maintaining reciprocity in a setting with high population mobility.9 Thus, individualistic traits had an adaptive value: beliefs and behavior based on independence and self-reliance made people better suited to cope with the frontier environment (Plaut et al., 2002; Kitayama et al., 2010).10 In addition, resource abundance in frontier locations may have bred attitudes against government intervention because of favorable prospects of upward mobility and a large perceived importance of effort in income generation.11 Based on the theories outlined above, these conditions would tend to generate opposition to redistribution. Turner himself observed that in the frontier the “tax-gatherer is viewed as a representative of oppression,” since the environment “produces antipathy to control.” According to historical narratives, people in the frontier disliked any interference in their exploitation of nature’s resources, particularly from hierarchical institutions like the far-removed central government. Cultural Persistence.

The Guiso et al. (2006) definition of culture underlines persistence, as it refers

to stable beliefs and values that are transmitted across generations. The persistent character of cultural 9

Nisbett and Cohen (1996), Grosjean (2014), Restrepo (2015), and Couttenier et al. (2016) highlight the high levels of violence characterizing contexts of low population density and high population mobility, lack of well-defined property rights, and absence of clear mechanisms for law enforcement. While critics of Turner (e.g. Boatright, 1941) have emphasized the importance of cooperation in the frontier, his supporters have argued that cooperation was not inconsistent with individualism. 10 This view can be framed within a notion of culture as decision-making rules-of-thumb used in uncertain environments, as proposed by evolutionary anthropologists (Boyd and Richerson, 1985, 2005). In their models, a process of natural selection governed by the payoffs from different rules-of-thumb determines which rule prevails. 11 See Appendix A.1 for a more detailed background and supportive evidence on this particular view.

8

traits has been emphasized by Roland (2004), Guiso et al. (2006), Nunn and Wantchekon (2011), and Alesina et al. (2013), among others. The theoretical contributions of Alesina and Angeletos (2005), Guiso et al. (2008), and Tabellini (2008) highlight the possibility of multiple equilibria in cultural traits, suggesting that once certain factors push a society toward a given cultural equilibrium, the outcome may persist in the long-run.12 The idea that distinctive features at early stages of societal formation would have long-lasting effects is emphasized by Zelinsky (1973). His “doctrine of first effective settlement” argues that when “an empty territory undergoes settlement [. . .] the specific characteristics of the first group able to effect a viable, self-perpetuating society are of crucial significance for the later social and cultural geography of the area, no matter how tiny the initial band of settlers may have been.” Our goal in the remainder of the paper is to shed new empirical light on these three forces linking the historical formation of frontier culture to modern political economy outcomes.

3

Mapping the History of the Frontier

This section presents our method for mapping the history of the frontier. First, we explain how to use U.S. Census data and GIS techniques to determine the position of the frontier line at each point in time. Second, we use these tracking algorithms to define frontier counties and measure total exposure to the frontier by the time it is closed in 1890. From colonial times until the late 19th century, America underwent massive population growth and rapid westward expansion. The process was documented in historical sources, like the noteworthy 1890 Census report on the Progress of the Nation (Porter et al., 1890), a key source of inspiration for Turner’s classic 1893 essay. The authors observed that the Thirteen Colonies, already settled communities by 1790, were “the sources of supply for a great westward migration,” as people “swarmed from the Atlantic coast to the prairies, plains, mountains, and deserts by millions during the last century.” The report described in great detail the evolution of population growth, the extent of the settled area, and the westward movement of the center of population from decade to decade throughout the 1790–1890 period.13 It also suggested the notion of a “frontier,” which also appeared, in various ways, in previous works (see Mood, 1945; Juricek, 1966). The population of the Thirteen Colonies was concentrated in close proximity to the East coast. From 1790 to 1890, as the nation’s population increased from 3.9 million to 62.6 million, the total settled area went from under 240,000 square miles to almost 2,000,000. In the same period, the mean center of population shifted westward over 500 miles, from a point in Maryland, just east of Washington D.C., to Decatur, Indiana. The process of frontier settlement advanced continuously throughout the 1800s. However, the speed of westward shifts in the center of population varied considerably from decade to decade. The magnitude of the shift decreased from 1790–1800 to 1800–1810, only to speed up again in the next decade. The 12

Turner himself suggested that the imprint of the frontier was persistent, as frontier societies had “certain common traits, and these traits have, while softening down, still persisted as survivals in the place of their origin, even when a higher social organization succeeded.” 13 The mean center of population as defined by the Census Bureau is an imaginary balance point at which weights of equal magnitude corresponding to the location of each person in an imaginary flat surface representing the U.S. would balance out.

9

maximum east-to-west shift was 127 km in 1850–1860, a decade of large immigrant inflows to the U.S. (2.8 million) and very fast overall population growth (a 35.5 percent increase). In the following decade, which witnessed the Civil War, the westward shift hit a 3-decade low of around 70 km, but it bounced back in the 1870s to around 90 km. The Porter et al. Census report considered that the process of westward expansion was completed, and the frontier closed, by 1890. In a passage famously quoted in Turner’s essay, it stated that “Up to and including 1880 the country had a frontier of settlement, but at present the unsettled area has been so broken into by isolated bodies of settlement that there can hardly be said to be a frontier line.” Turner would go on to discuss some of the key geographic and political forces giving rise to variation in the shape and speed of westward expansion. We use this variation to develop a new measure of frontier exposure next and revisit these underlying forces in Section 6 when developing an empirical strategy for studying the long-run effects of the frontier.

3.1

Locating the Frontier and Tracking its Movements

In empirical studies of the American frontier, researchers have adopted simplifying definitions for specific contexts. In a study of the characteristics of westward migrants in 1850 and 1860, Steckel (1989) defines the frontier as those settlements in Minnesota, Iowa, Kansas, Texas, and those farther west. Ferrie (1997) studies migration to the frontier between 1850 and 1870 and defines the 90◦ west longitude as the frontier‘s eastern boundary. Kitayama et al. (2010) simply associate the frontier with the Western United States. In this paper, we adopt a definition of the frontier line that follows the Porter et al. (1890) report and Turner’s classic essay. While mentioning that the concept was “elastic,” Turner adopted the Porter et al. definition of the frontier line as the margin of settlement with population density of two per square mile or more. We use the location of the frontier line to define frontier counties as those (i) in close proximity to the frontier line—in our baseline, those with a centroid within 100 km of the frontier line—and (ii) with a population density below 6 people per square mile (this definition is also based on the 1890 Census report).14 For each Census year between 1790 and 1890, we calculate county-level population density per square mile. For intercensal years, we interpolate county-level population density by assuming a constant annual population growth rate that matches the decadal growth rate (replacing initial zeros with arbitrarily small values to avoid infinite growth rates). We maintain consistent units of observation over time by harmonizing all data to the 2010 boundaries using an approach suggested in Hornbeck (2010).15 While the harmonization and interpolation procedures may create measurement error, such 14

Consistent with their mapping conventions, Porter et al. (1890) defined six categories of settlement, the first of which was wilderness or vacant land, characterized by densities between 0 and 2 people per square mile, and the second was frontier settlement, characterized by densities between 2 and 6. See also Eblen (1965). The results are qualitatively unchanged if we redefine frontier counties with different distance cutoffs (both higher and lower than 100 km) and without imposing the density restriction of 6 people per square mile. 15 First, we intersect the county shapefiles from each of the decadal census years with the 2010 county shapefile and calculate the area of each intersection (all shapefiles come from the National Historical Geographic Systems database). When the 2010 county falls in one or more counties of the earlier shapefile, each piece of the 2010 county is assigned value equal to the share of the area of the piece in the earlier county multiplied by the total value of the data for the earlier county. Then, the data for each county in 2010 is the sum of all the pieces falling within its area. This harmonization procedure would be exact if all the data from the various years are evenly distributed across county areas.

10

error is largely confined to classical noise in county-level outcome measures and hence works against finding systematic effects of the frontier. Using the annual county-level population densities between 1790–1890, we locate the frontier line for each year by drawing contour lines that divide counties with population densities above and below two people per square mile. Figure 1 plots the resulting lines for 1790, 1820, 1850, and 1890. Meanwhile, Appendix Figure A.1 illustrates how closely our estimation of population density and the corresponding frontier lines match the historical map for 1860 in Porter et al. (1890).16 In constructing the primary frontier line in each year, we further discard line segments less than 500 km in length. This effectively discards several of the isolated pockets of dense populations beyond the main frontier line and pockets of sparse populations inside that line. Neither of these types of pockets correspond to the historical notion of the frontier as “margins of civilization”, and hence we retain only the core, connected frontier line emerging from east-to-west expansion. We further discard a second major frontier line emerging in California in the latter part of the 19th century, as the process of settlement on the West Coast was marked by the Gold Rush and distinct historical forces that are beyond the scope of the original Frontier thesis.17 The evolution of the resulting, primary frontier line from 1790 to 1890 can be seen in Figure 2, and it is this frontier expansion that forms the core of our analysis.

3.2

Measuring Total Frontier Experience

The uneven speed of westward expansion gives rise to differences across counties in time exposed to the frontier that we seek to exploit in our investigation of the long-run effects of frontier culture. To measure the intensity of historical frontier experience at the local level, we calculate the number of years each county spent within the frontier belt during the period of frontier expansion from 1790 and 1890 as defined by the Census Bureau. We exclude from our analysis the counties that were already settled (i.e., east of the frontier line) by 1790, since the available data do not allow us to adequately measure total frontier experience. Furthermore, our sample excludes counties that were west of the 1890 main frontier line demarcating east-to-west expansion. For each year between 1790–1890, we assign each county a dummy variable equal to one if it is considered to be in the frontier using our definition. In our main specification, we assign a county frontier status in a given year if its centroid (using 2010 county boundaries) is within 100 km of the frontier line in that year. Results are robust to other distance cutoffs. Then, the total frontier experience (TFE) for each county is the sum of indicators of frontier status from 1790 to 1890. Figure 3 shows the spatial distribution of TFE, measured in years and using the 100 km frontier cutoff, for the counties included in our final sample. Total frontier experience ranges from 0 to 66 years, with a mean of 19.4 years and a standard deviation of 12 years, and displays substantial variation both across and within states and bands of latitude and longitude more broadly. Whether and how the cross-county differences in TFE matter today are the key questions we aim to answer.

16

Comparison maps for other years also demonstrates that the use of harmonized 2010 county boundaries has little effect on the location of the frontier relative to an approach based on contemporaneous county boundaries historically. 17 From 1850 onward, the boundary between Arkansas and the Indian Territory (currently Oklahoma), where settlement did not advance, remained a part of the frontier line. Our results are robust to dropping from the sample the counties around this border.

11

4

The Distinctive Features of the Frontier

This section thoroughly documents the distinctive features of frontier locations. We do so with two complementary strategies: (i) a cross-sectional comparison across counties with different population densities reported below, and (ii) a within-county, longitudinal comparison of the years before and after a given county exits the frontier. The latter approach reinforces the former, delivering broadly similar insights, and hence we report this in Appendix A.2. We focus on a set of demographic characteristics associated with the frontier in historical accounts: sex ratios, age distributions, fertility and literacy rates, and immigrant population shares.18 Moreover, we show that frontier locations were sharply different in levels of individualism as proxied by the share of children with uncommon names. We draw upon historical Census data on gender, age, birthplace, and literacy from Haines and ICPSR (2010). For each of these variables, we use data for all available years between 1790 and 1890. For some variables we consider the white population only as this helps to attain consistency in the construction of variables across time periods and ensures that observed differentials in the demographic characteristics of interest are not driven by variation in the share of whites in the population. Further details on the construction of all variables used in this section can be found in Appendix B. Our measure of individualism is based on the names that parents give to their children. As suggested by the social psychology literature, individualistic types are prone to give their children unique names, reflecting a desire to stand out, as opposed to common names, which reflect a desire to fit in (Twenge et al., 2010). These choices have profound implications for children’s self-identity and hence social behavior. As such, the local prevalence of unique names captures important information about culture, an underlying theme across sociology (e.g., Lieberson and Bell, 1992), economic (e.g., Abramitzky et al., 2016; Fryer Jr. and Levitt, 2004), and psychology (e.g., Gureckis and Goldstone, 2009; Varnum and Kitayama, 2011) research on different aspects of unique names. In our setting, names offer a direct mechanism for persistence, linking parental preferences and socioeconomic behavior in the next generation. We use full-count, historical Census data from several decades beginning 1850 to measure the share of children in a given county under 10 years of age with unique names and consider various approaches to quantifying uniqueness.19 Appendix B offers examples of common names in 1850 and 1880, which include John, William, Sarah, Elizabeth, among others. Beck Knudsen (2017) validates this type of unique names proxy by showing that it is strongly correlated with other commonly used measures of individualism both at the national and subnational level. Section 5.2 below provides evidence on the economic content of unique names in our setting. The results in this section provide new empirical evidence in support of historical narratives about the rugged individualists settling the American frontier. While confirming that the frontier was different 18

Historians and sociologists have devoted considerable attention to analyzing the demographics of frontier locations, in particular sex ratios, fertility rates, and age distributions (Bowen, 1978; Coombs, 1993; Demos, 1968; Easterlin et al., 1978; Eblen, 1965; Modell, 1971; Moller, 1945; Smith, 1928; Schaefer, 1985). While the findings often provide some support for stylized facts captured in traditional narratives, these studies are focused on specific places at particular points in time, making it difficult to establish general regularities about frontier demographics. This section offers a new step in that direction, representing the first systematic documentation of frontier demographics. 19 Another potentially confounding naming practice lies in the passing on of parental first names to children. Using data discussed in Section 5.1, we show that while only around three (five) percent of girls (boys) have such matronymics (patronymics), this practice is less common inside the frontier and significantly so for boys. Choosing novel names for one’s children rather than one’s own arguably reflects a desire to instill independence. As such, this finding is consistent with our broader claim that the higher prevalence of unique names inside the frontier reflects greater individualism.

12

along dimensions previously noted in the literature, our results also document just how different it was. Our results provide the first evidence of (i) a break in certain demographic characteristics at around the frontier boundary, and (ii) a higher prevalence of individualistic traits inside the frontier. Distinctive Frontier Populations.

We identify the distinctive features of the frontier using cross-

sectional regressions relating population characteristic x in county c in Census division d at time t to the population density in that county. In particular, we estimate pooled, semiparametric regressions of the following form: xcdt = α + g(population densitycdt ) + θd + θt + εcdt ,

(1)

where g(·) is some nonlinear function recovered using the partially linear (Robinson, 1988) estimator with Census division and year fixed effects, θd and θt , respectively. While we estimate g(·) across all counties in the sample, we restrict the graphs presented in this section to counties with less than 50 people/mi2 in order to focus on the local changes at the frontier threshold. In 1840, the midpoint of our study period, in the sample there are 437 counties in the range of 0 to 2 people/mi2 , 303 counties from 2–6 people/mi2 , 201 from 6–10, 540 from 10–50 people/mi2 , and 41 from 50–100 people/mi2 .20 The graphs in Figure 4 described below plot 95 percent confidence intervals around g(·). Demographics.

Figure 4(a) shows the distribution of the sex ratio (for whites) by level of popula-

tion density. Males significantly outnumber females in less densely populated counties. The sex ratio displays a peak of around 1.6 in the most sparsely populated counties and declines sharply with population density until around levels of 3–4 people/mi2 . The slope of g(·) then flattens out, and the sex ratio stabilizes at around 1.05–1.1 males for every female. Moreover, frontier settlers tended to be prime-age adults, with relatively fewer children and old individuals. Figure 4(b) bears this out by plotting estimates of g(·) in equation (1) for the share of individuals in the age 15–49 range. This prime-age adult share declines sharply as we move towards density levels of 2–3 people/mi2 and eventually levels off at around 0.37. More dependent groups younger than 15 and older 49 are underrepresented on the frontier relative to the more densely settled areas to the east. High fertility rates are another salient feature of the frontier borne out in our historical data. The availability of cheap land may have attracted families with children and also promoted childbearing among settler families. Unfortunately, direct measures of the number of children ever-born per woman are not available. Instead, we consider the ratio of total (white) children aged 0 to 5 over total (white) women aged 15 to 39 as a rough proxy for prevailing fertility within a given decade. Based on this proxy in Figure 4(c), we find that fertility rises from low levels in the most sparsely populated counties, peaks at around 2–3 people/mi2 and then secularly declines thereafter. Figure 4(d) highlights another important feature of the types of individuals found on the frontier: literacy rates for white adults are significantly lower in more sparsely populated areas. Below 2–3 20

See Appendix Figure A.3 for the full distribution of population density across counties in different decades. Note that, in any given time period, the frontier does not comprise all the sparsely settled counties in the sample. In 1880, for instance, among the counties in the sample with population densities below 6 people/mi2 , over 60 percent were within 100 km from the frontier line, but the rest fall outside the strict definition of the frontier. While our aim is to establish the distinctive features of the frontier, we choose not to impose any specific definition in the baseline empirical strategy. Instead, we simply examine how key population characteristics vary with population density. Similar insights arise from a complementary approach examining how the same population characteristics vary with distance to the frontier.

13

people/mi2 , around one-quarter of the population in the average county is illiterate. The low levels of literacy observed in the frontier may be partly explained by the negative selection of westward migrants in the United States during this period, in line with Turner’s safety valve theory and the results in Ferrie (1997). Finally, Panel (e) of Figure 4 presents evidence on the extent to which foreign-born immigrants contributed to frontier settlement over the 1800s. The nonlinear shape of g(·) shows that the immigrant population share was significantly higher in more sparsely populated counties. This is the visual evidence of the common historical account of long-distance migrants from abroad settling the frontier. Here, the g(·) gradually levels out at higher levels of density, of around 5–10 people/mi2 . Looking across these outcomes, we consistently find evidence that demographic characteristics exhibit a sharp change roughly around our demarcation of the frontier line, consistent with the historical literature (see Section 3). For example, using the Chow (1960) test, we can easily reject the null hypothesis of a constant effect of population density (on all x) above and below 6 people/mi2 (the upper bound of frontier settlement according to Eblen, 1965; Porter et al., 1890), or above and below any cutoff in the 2–6 range. Naturally, the precise turning points vary across outcomes. More agnostic Zivot and Andrews (2002) tests help identify unknown structural break points in each decade. In 1850, for example, we find a break in the sex ratio at 2.7, the prime-age adult share at age 2.0, and the share of foreign born at 3.4. Individualism. The foregoing patterns provide robust evidence consistent with the historical accounts of the frontier discussed in Section 2.2. These demographic turning points had profound implications for the ensuing patterns of settlement along the frontier. However, in understanding the long-run effects of frontier exposure, it is important to determine whether the early settlers along the frontier fit the individualistic mold at the heart of prevailing historical narratives. The remainder of this section provides among the first rigorous empirical evidence that indeed the early settlers were more individualistic as captured by naming patterns recorded in the the full count 1850 and 1880 Census data. Figure 5 displays a stark pattern in the uniqueness of names across counties that suggests a break in the relationship between individualism and population density. In particular, in the most sparsely populated counties, more than 65 (80) percent of boys (girls) aged 0–10 with native-born parents have names that are outside the top 10 most popular in that decade’s birth cohort within the Census Division. However, this ratio decreases sharply as we move towards counties with more than 2–3 people/mi2 , leveling off thereafter. The Zivot and Andrews (2002) test identifies a structural break in names for boys (girls) at 3.2 (2.6) people/mi2 in 1850. We find broadly similar patterns when including children with foreign-born parents, restricting to children with native-born grandparents as seen in Appendix Figure A.9, and using other measures of uniqueness (explored further in the long-run analysis in Section 6.3). The unique names of children on the frontier are consistent with the selective migration of individualistic people to the frontier and with the adoption of individualistic traits in these locations.

5

Frontier Roots of “Rugged Individualism”

In this section, we discuss the mechanisms through which the conditions of life in frontier settlements may have created a culture characterized by an emphasis on self-reliance and opposition to redistribution and to government intervention more broadly. The creation and subsequent persistence of frontier 14

culture depended crucially on the initial generation of individualistic settlers, who shaped the local culture of their communities. We saw above that individualism proxied by unique names was more prevalent in the frontier. This may have resulted from the self-selection of individualists to the frontier where environmental factors then further promoted individualism, as discussed in Section 2.2. In Section 5.1, we provide evidence that selection can explain only part of the cultural differentiation in the frontier. Section 5.2 then document empirical patterns consistent with environmental factors playing a role as well. Appendix Section A.1 provides further, descriptive evidence on why frontier conditions implied favorable prospects of upward mobility and a large perceived importance of effort in income generation. Combined with the adaptive value of individualism, these conditions could have fostered cultural opposition to redistribution and government intervention more broadly.21 The empirical results presented below, which we view as suggestive rather than conclusive, provide intuition and motivation for the long-run relationship between frontier experience and culture that we identify in Section 6. The roots of this long-run impact can be traced to the way in which individualism came to thrive in the frontier.

5.1

Selective Migration and Environmental Factors

An important question about our overarching interpretation concerns the role of self-selection in shaping observed cultural patterns on the frontier. It is difficult if not impossible to precisely disentangle the effect of people on a place versus the effect of a place on people. Yet, we can say something about whether self-selection of individualist types into the frontier can explain all of the observed differences in culture between frontier and non-frontier locations. We do so here by estimating a set of regressions that clarify how much of the contemporaneous frontier differential in unique naming patterns (seen, for example, in Figure 5) can be explained by selective migration. The results are consistent with selection being important but unable to fully account for the pervasiveness of unique names. Table 1 reports estimates of the frontier differential in unique naming patterns based on versions of the following equation: P(child has uncommon nameic,1880 ) = α + β f rontierc,1880 + x0ic ζ + εic,1880 ,

(2)

where the binary dependent variable equals one if child i residing in county c in 1880 has a name that falls within the top 10 nationally within that decade (similar results obtain with other measures of uniqueness), and the frontier indicator equals one if that county falls within the frontier according to our base21

Another potentially relevant mechanism we consider is the parasite-stress theory of collectivism detailed in footnote 3. We view this mechanism as important but less directly relevant to our findings for at least three reasons. First, using a 10 percent sample from the 1880 Population Census, which is the first to record information on sickness and disease, we can show that the prevalence of pathogens—associated with, among others, tuberculosis, malaria, and typhoid—does not exhibit the same sort of nolinearities or structural breaks in population density as seen for demographics and individualism in the prior section. Second, our long-run empirical strategy detailed in Section 6 isolates variation in frontier experience that is arguably more localized than the main variation in pathogen prevalence at the core of parasite-stress explanations for individualism. Indeed, controlling for state fixed effects and/or a parsimonious set of geographic controls as we do below effectively removes all the variation in disease prevalence between frontier and non-frontier locations. Finally, for this mechanism to bind over a relatively short time period such as the century we consider, it should be the case that individuals have sufficient knowledge of the person-to-person nature of disease transmission to warrant behavioral responses favoring members of one’s own group, giving rise to collectivist culture.

15

line definition. We restrict attention to white children age 0 to 10 with native-born parents and cluster standard errors at the county level. The x vector includes age×gender and birth order fixed effects as well as indicators for whether the parents have unique names. However, these controls do not significantly affect estimates of β. The inclusion of county fixed effects and various sample restrictions will help clarify how much of the β coefficient is plausibly due to selection. To estimate selection patterns, we need to track households across time. For this purpose, we use full count data from the 1870 and 1880 Censuses available from ancestry.com (including location, names, and demographics), and link households across rounds using an algorithm from Feigenbaum (2016).22 The overall differential in column 1 of Table 1 suggests that children in frontier locations in 1880 are around 10 percent more likely to have an uncommon name. Note that this includes children whose parents have lived in the frontier for at least 10 years as well as those who migrated at some time during the 1870s. Is this effect completely accounted for by the differential individualism between migrants from non-frontier to frontier counties and stayers in non-frontier counties? Columns 2–5 provide an initial answer by restricting the overall sample to children who parents resided in a non-frontier county in 1870. The coefficient β then identifies the differential naming behavior of parents who emigrate from eastern, non-frontier counties to frontier counties relative to those who stay in the non-frontier origins. The β coefficient falls from 0.075 to 0.052 in column 2 and then further to 0.023 in column 3 when including origin county fixed effects, which provides a sharper and more credible comparison to stayers in one’s county of origin rather than all non-frontier origins as in column 2. Columns 4 and 5 go a step further, restricting the sample to include only those children whose parents migrated to the frontier or stayed in the same non-frontier origin, effectively discarding parents who migrated across non-frontier counties or whose counties transitioned into or out frontier status during the 1870s. Across columns, we find that β is statistically significantly lower than in column 1. Together, these results suggest that selection cannot fully explain the frontier differential in our leading proxy of individualism. It is of course still possible that selective migration before 1870, which cannot be observed in this data, help explain some of the differential in column 1. However, the results in this table help rule out a first-order, within-decade effect of selection, which is important given the high rates of migration to the frontier.23 In sum, recent migrants to the frontier exhibit greater propensity to give their children uncommon names compared to those who never left their non-frontier county of origin. However, this differential is smaller than the overall differential between frontier and non-frontier residents, which suggests some scope for other environmental factors to explain why individualism appears to be more pervasive on the frontier. One explanation that we explore next is that individualism may be differentially rewarded on the frontier. 22

The base sample in 1880 is restricted to male household heads, native-born, aged 30–50, white, and who have at least one child (biological) aged 0–10, white, and native born. The target year is 1870. The set of potential matches for these men are first identified based on name, birth state and birth year. A random training sample is then drawn from among the potential matches and manually trained. The importance of each match feature is quantified using a probit model, and used to estimate a probability score for each link. A true match is defined as one with a sufficiently high score both in absolute and relative terms. The match rate was 26.4%. Further details can be found in Appendix B. 23 Another useful comparison can be made by restricting the sample to stayers. We can show that among children whose parents did not move counties between 1870 and 1880, those born on the frontier are around 25 percent more likely to have an uncommon name. This effect suggests that the place differential, net of recent selective migration, is even larger.

16

5.2

The Returns to Individualism

The opportunities and threats faced by frontier settlers may have favored individualism through an evolutionary mechanism. In the frontier context, people had to rely on themselves for protection and prevention, and to advance their living conditions, so independence and self-reliance made people better suited to cope with the environment. In other words, individualistic traits had an adaptive value. This section presents empirical patterns consistent with higher socioeconomic returns to individualism in frontier locations. In particular, we show that households exhibiting greater individualism tended to be more successful economically and to remain longer in frontier areas. We use data from the abovementioned linked sample, which contains information on occupation and education in the 1880 round. Our goal then is to understand whether unique names are associated with differential economic well-being inside versus outside the frontier. We do this in two steps. First, we investigate the relationship between father i’s economic well-being in county c in 1880, yic,1880 , and unique names according to the following difference-in-difference type specification: yic,1880 = α + β own unique namei + η(own unique namei × f rontierc,1880 )

(3)

+ δ children unique namei + ζ(children unique namei × f rontierc,1880 ) + FEc + εic , where β captures the hedonic returns to the father’s own unique name outside the frontier and η the differential effect inside the frontier. At the same time, δ captures the association of unique name choices for children born during the 1870s and the father’s economic well-being outside the frontier and ζ the differential effect inside the frontier. We restrict attention to white, native-born fathers that did not change counties between 1870 and 1880 and that have at least one child. As above, we define uniqueness as the father having a name that falls outside of the top 10 nationally, but other definitions deliver very similar results. We measure status using either the Duncan (1961) socioeconomic index (sei) or the occupational score measure provided by the NAPP (occscore). Both measures range from 0 to 100 and capture the income returns associated with specific occupations in the 1950 Census while the sei measure additionally captures notions of prestige as well as educational attainment. These measures are widely used in the economic history literature (see Olivetti and Paserman, 2015, for a discussion) and for our purposes mostly serve to capture broad differences in well-being across individuals. Finally, we cluster standard errors at the county level, and the county fixed effects, FEc , account for all differences in outcomes that are common across individuals within the same county. The estimates in Panel A of Table 2 are consistent with significant differential returns to individualism inside the frontier. Across all specifications, fathers with unique names outside the frontier exhibit socioeconomic status that is nearly 0.05 standard deviations lower than fathers with more common names. This is roughly the typical difference between a farmer and a blacksmith or a blacksmith and a carpenter. While we remain open to multiple interpretations, this apparent economic penalty could be due to various types of discrimination—albeit not based on race or citizenship—as seen in later periods within the United States. However, this penalty is more than offset inside the frontier where unique names exhibit a differentially positive association with well-being. We observe a similar differential for unique names of children, which exhibit a positive correlation with father’s well-being outside the frontier and an even stronger positive correlation inside the frontier. To the extent that unique names truly capture 17

individualism, these results, which hold for both sei and occscore, suggest that individualist types are relatively better off inside the frontier. In Appendix Table A.1, we go a step further and show that this differential effect of individualism inside the frontier arises not only in levels but also in growth of socioeconomic well-being. The NAPP linked sample for 1870–1880 allows us to investigate changes in occupational standing for 1 percent of the entire population (i.e., a small subset of all individuals in the prior analyses). The results suggest that fathers with uncommon names exhibit significantly faster growth in sei and occscore—25–75 log points depending on the specification—in the frontier but not outside the frontier. While we are not making any causal claims, one concern with our interpretation is that these results are based on the selected sample of households that remained in the same county between 1870 and 1880. However, as we show in Appendix Table A.2, individualist types that migrate to frontier counties are no more likely to exhibit higher economic well-being than those that migrate to non-frontier counties during this period. Among this population of movers, those with unique names for themselves and their children exhibit higher status, but do so equally when moving to frontier and non-frontier locations. Together, these findings suggest that there may be something particular to the frontier that rewards the types of behavior more commonly observed in people exhibiting a high degree of individualism. There are of course other explanations, but the patterns line up with the historical narrative fairly well. Moreover, in Panel B of Table 2, we provide a second piece of evidence consistent with an individualist advantage inside the frontier. We estimate the following equation linking unique names to migration choices for household h living in frontier county c in 1870: P(outmigratehc ) = α + βf father has unique nameh + βm mother has unique nameh

(4)

+ η any children with unique nameh + FEc + εhc , where outmigratehc is a binary outcome indicating whether the household moved from a frontier county in 1870 to a non-frontier county to the east by 1880. The key explanatory variables are as defined above with the mother’s unique name status defined similarly. The results suggest that, within a given frontier county, households in which fathers have unique names are around 4 percentage points less likely to leave the frontier for a more settled county to the east by 1880. This is a large effect given that 40 percent of linked households in our sample left the frontier for eastern, settled counties over this period.24 We observe little relationship to mother’s names but find that households with uniquely named children are also significantly less likely to leave the frontier. Of course, this latter result is less straightforward to interpret as we do not observe the timing of migration within the decade. Nevertheless, together, these findings suggest that both inherited and chosen individualism are associated with a higher likelihood of survival inside the frontier. By revealed preference, such migration choices are consistent with the economic differential for individualist types inside the frontier. Overall, these findings point to a short-run evolutionary mechanism whereby individualist types were more likely to thrive in the frontier, thereby giving rise to incentives for maintaining this key cultural trait across generations. Indeed, we show in Appendix Table A.4 that the within-family, in24

In Appendix Table A.3, we show that this migration accounts for the lion share of departures from frontier counties during this period. The other direction of migration—from frontier counties to other frontier counties—is not associated with unique names.

18

tergenerational persistence in unique names is significantly higher inside the frontier.25 Recall that this persistence If this mechanism survived multiple generations, then it may help explain why one still finds such a distinct imprint of frontier culture in the political preferences and policies adopted by individuals in these areas more than a century later.

6

Long-Run Effects of Frontier Experience on Culture

We now turn to an investigation of how the early settlement process shaped culture in those areas that remained along the frontier for longer than others. There are several ways in which the individualistic early settlers identified above could have exerted lasting effects on the culture of these newly settled areas of the country over the next century. Our goal in the rest of the paper is to identify some of these effects as they relate to modern political economy debates. We start by presenting our empirical framework to examine the effects of historical frontier exposure on contemporary outcomes, discuss key data sources, and then move on to our main cross-county results. Finally, we discuss and present evidence on potential channels for long-run persistence.

6.1

Empirical Strategy

We aim to relate historical frontier exposure defined in Section 3.2 to modern measures of individualism and preferences for individuals in county c. In particular, our main county-level, cross-sectional estimating equation is given by: yc = α + β total frontier experiencec + x0c γ + FE + εc ,

(5)

where yc is some long-run outcome capturing cultural traits (for example, preferences for redistribution). Total frontier experience (TFE) is the amount of time (in decades) a given county remained on the frontier according to our baseline definition, restricting to those counties inside the extent of the historical frontier as seen in Figure 3. Robustness to alternative measures is considered in Section 6.5. The vector xc contains a set of county and where possible individual characteristics, and FE are state or Census division fixed effects. In addition to these fixed effects, xc often includes latitude, longitude, and distance to rivers, lakes, and the coast.26 The coefficient β therefore identifies a localized differential effect of TFE. Following the approach suggested by Bester et al. (2011), standard errors in all specifications are clustered on 60-square-mile grid squares that completely cover counties in our sample.27 When estimating effects for several correlated outcomes, we also report mean effects based on the widely used Kling et al. (2007) approach.28 25

Recall from footnote 19 that this persistence is not driven by patronymic or matronymic naming behavior, which is actually less common in the frontier. 26 Further controls are discussed below, but it is important to note that all results are robust to the exclusion of controls. In fact, these controls often work against finding large effects of total frontier experience especially when using the instrumental variables approach detailed below, but we retain them as a baseline in order to be conservative. 27 Inference remains unchanged when using the computationally more intensive Conley (1999) spatial HAC estimator with a bandwidth of 300 km. We retain the arbitrary grid-cell approach as it is considerably easier to implement and less subject to instabilities, which becomes important as we move to an instrumental variables specification. 28 This approach offers one way to address the multiple inference problem. It effectively takes a weighted average of the estimates of β for each of K related outcome variables (placed on the same scale) with the weights equal to inverse sample

19

The main threat to causal identification of β lies in omitted variables. We need to rule out a host of factors that may be correlated with TFE as well as contemporary culture. For example, areas with higher TFE may have had unfavorable endowments (e.g., mountainous terrain, low levels of agricultural productivity), high levels of conflict with Native Americans, or less exposure to public infrastructure investments such as the railroad, all of which may have independently affected cultural traits. Additionally, regulations facilitating access to land may have reduced time in the frontier and caused favorable views of government intervention. We address these concerns in three ways. First, we progressively add a richer set of controls aimed at removing as much of the confounded variation as possible. Second, we rule out additional concerns about omitted variables by using the Oster (forthcoming) approach to show that the degree of selection-on-unobservables is relatively limited. Finally, we pursue an instrumental variables (IV) strategy as a further robustness check. We propose an IV approach that exploits the timing of immigration inflows to isolate plausibly exogenous variation in total frontier experience. The basic idea is that, while the process of westward movement may be thought of as a wave with time-varying speed, the time that a given location spends in the frontier depends on how fast the wave of westward expansion is going when passing through that location. From the perspective a given county in the American heartland, the instrument attains validity via the coincidental timing of immigration flows to the United States as a whole and westward expansions of the frontier in counties far to the east. We construct the instrument as follows. First, we identify the relevant time frame for each county. The beginning of that period is the year in which the location is just west of the frontier, i.e., 110 km west of the frontier line. At this time, the county is still unsettled, and its local conditions do not affect the contemporaneous process of westward movement. We then consider the average annual immigrant inflow (in logs) as an exogenous predictor for the speed of the frontier wave in the next 60 years (recall that the max of frontier experience in the sample is around 60 years). Our approach can be thought of as a Bartik (1991)-inspired strategy, since we interact variation in local initial conditions—the year at which the relevant time frame starts for each county—with a national shock exogenous to local conditions. Given the central role of immigration in shaping the westward evolution of the frontier, it is perhaps unsurprising that the instrument delivers a strong first stage. One way to visualize the strength of this plausibly exogenous frontier wave is to compare the longitude of the population center of the U.S. (see Section 3 and the green dot in Figure A.1(b) for 1860) with the average annual immigrant inflow in the preceding 60 years. The sources of this strong linear relationship across decades can be seen in Appendix Figures A.7 and A.8. In the baseline results that follow, we focus on OLS estimates of equation (5) and leave the IV results and other robustness checks to Section 6.5.

6.2

Data on Contemporary Culture and Political Economy

We measure contemporary individualism and preferences for government policy using data from multiple rounds of three widely used, nationally representative surveys—the Cooperative Congressional Election Study (CCES), the General Social Survey (GSS), and the American National Election Survey (ANES). standard deviation of that variable for a suitable control group. In our case, the choice of that control group affects the mean effect size albeit not significance. We consider a few alternative approaches, including the few counties with zero frontier experience, those with less than a decade, and those with below the median. Results are very similar across all three.

20

These surveys are staples in the social science literature on political preferences and social norms.29 The CCES is a phone-based survey conducted every two years around elections in November since 2006, the ANES is an in-person survey conducted annually since 1948, and the GSS is an in-person survey conducted annually since 1972. One advantage of working with all three datasets is that we can corroborate findings across surveys that ask questions about the same underlying preferences in different ways. For example, the CCES asks respondents if and how respondents would like state-level welfare spending to change whereas the ANES asks respondents if and how spending on the poor should change. The CCES also includes a set of questions on policy issues such as gun ownership that are particularly relevant to some of the mechanisms driving the persistence of frontier culture. Despite their rich level of detail, these surveys have one important limitation for our purposes, namely the limited geographic scope. While the CCES has broad spatial coverage, the GSS and ANES do not (as seen in Appendix Figures B.3–B.2). This proves especially challenging given our focus on spatial variation across a limited swathe of the United States outside of major coastal population centers. Nevertheless, for key outcomes across the three surveys, we are able to identify relatively precise effects. These sample coverage limitations do not extend to the measure of unique names from the postfrontier Population Censuses noted above or to measures of actual policy outcomes in terms of Republican vote shares and reported property tax rates. The former come from Leip’s Atlas of Presidential Elections, and we focus on recent elections in the 2000s for simplicity. The latter are estimated from the American Community Survey in 2010 by the National Association of Home Builders. For all of these measures, we link county-level identifiers in the underlying data to the 2010 county boundaries as seen in Figure 3. Together, these multiple rounds of survey data and local voting records allow us to paint a rich picture of persistent culture of individualism and small-government norms in areas exposed to the frontier for a longer time during the 19th century.

6.3

Total Frontier Experience and Persistent Individualism

We begin by documenting a long-run link between the duration of total frontier exposure (TFE) and contemporary individualism. First, nearly half a century after the closing of the frontier, unique name choices for children are more pervasive in counties exposed to the frontier for a longer historical period. In Table 3, we report the effect of TFE (β from equation 5) on the share of white boys and girls age 0–10 given unique names in the 1930s. This data from the full count 1940 Population Census captures naming choices at least one generation and as many as five generations after counties exited the frontier. Our baseline measure of uniqueness is the top 10 within the county’s Census division, and we report a battery of robustness checks based on different measures of uniqueness and sample restrictions on familial genealogy.30 In the average county, 73 percent of boys and 80 percent of girls have unique names with standard deviations of 0.07 and 0.04, respectively. In this and subsequent tables, we normalize the 29

See Acharya et al. (2016) for a recent example of using CCES and ANES in a related methodological setting and Alesina and Giuliano (2011) for a rigorous investigation of the determinants of preferences for redistribution in the GSS. Further details on all of the data sources discussed in the section can be found in Appendix B. 30 One important dimension of robustness that we corroborate below is that the results are not sensitive to the cutoff for defining uniqueness (10, 25, 100, . . . ). This is likely due to the fact that naming frequencies in the United States follow a power law (see, e.g., Hahn and Bentley, 2003; Gureckis and Goldstone, 2009), and hence the mean above different cutoffs is identical up to a scaling constant in that cutoff.

21

dependent variables so that standard deviation effect sizes can be read directly from the coefficients. Taking the most demanding OLS specification in column 4 of Table 3 suggests that each additional decade of TFE is associated with 0.13 and 0.20 standard deviations higher share of unique names for boys and girls, respectively. These results are quantitatively meaningful. For example, moving from counties with the minimum to the maximum TFE implies nearly 10 percent more children with unique names. The main specification in column 4 is the result of progressively building on the specification in column 1 with no controls. Columns 2 and 3 add Census division and state fixed effects, respectively, to rule out broad regional differences in TFE and culture. Finally, column 4 captures several additional county-level confounders including latitude, longitude, county area, average rainfall and temperature, elevation, distance to rivers and lakes, distance to the coast, and predetermined potential agricultural yield.31 Several of these factors may have given rise to particular cultural norms and also determined the speed of frontier transitions historically. Comparing across columns 1–4, the coefficient remains relatively stable despite large changes in the R2 . This pattern is consistent with relatively limited selectionon-unobservables according to the coefficient stability test parameters δ reported at the bottom of the table based on Oster (forthcoming), who suggests that values of |δ| > 1.3 imply a limited role for unobservables in explaining our results.32 In Section 6.5, we show that these findings hold up to a further set of (non-predetermined) controls for population density, diversity, and year of connection to the railroad. Furthermore, these results are not sensitive to the particular measure of uniqueness or the national background of the parents assigning names. We document this robustness for both OLS and IV specifications in Appendix Tables A.5 and A.6. In both tables, column 1 replicates the full control result from Table 3. In columns 2-4, respectively, we restrict to children with native-born fathers, native-born parents, native-born grandparents, which, together, help address concerns about immigrants having unique name preferences and being more likely to settle in frontier areas historically. Column 5 defines uniqueness based on the top 10 names nationally while columns 6 and 7 do so at the state and county level, respectively. Columns 8 and 9 expands uniqueness to those outside the top 25 and top 100 names, respectively. Finally, column 10 restricts to non-biblical names to account for the fact that religiosity may be confounded with TFE and naming choices. All of these results reveal a similar persistent effect of TFE on this leading proxy for individualism. Together with the findings in Section 4, these results suggest that unique name choices were not only more common in frontier areas historically but also persist over multiple generations in those areas exposed to the frontier for longer. Indeed, the effect of TFE on unique name choices can be seen in the early 1900s with little change thereafter. Appendix Table A.7 shows that the effect sizes are relatively stable across the Census rounds from 1910 to 1940. This is consistent with the persistence of the early frontier culture of individualism.33 31 32

See Appendix B for details on each of these measures. This procedure builds on the original test in Altonji et al. (2005) by accounting for the possibility that the additional covariates beyond the do   baseline   not add much explanatory power to the regressions. Specifically, we compute measures of δ = βc (βu −βc )

×

2 2 Rc −Ru 2 η×Rc

where βu is the coefficient in the baseline regression with no controls and βc is the coefficient with

additional covariates, Ru2 and Rc2 are the corresponding R2 measures, and η is a scaling parameter that we set equal to 0.3 following common practice, but insights are similar for larger values. 33 Ideally, we could carry these results through to the contemporary period, but, unfortunately, the 1940 Census is the latest round that provides information on names. Although the Social Security Administration releases baby name counts by state,

22

Beyond unique names, we draw upon a well-suited measure from the ANES data to provide further evidence of the link between frontier history and high levels of individualism. Specifically, we draw upon the 1990 ANES round in which respondents were asked whether (1) “it is more important to be a cooperative person who works well with others”, or (2) “it is more important to be a self-reliant person able to take care of oneself.”34 Appendix Table A.8 provides evidence that self-reliant preferences are stronger today in counties with longer exposure to the frontier historically. Around 55 percent of individuals respond in support of the cooperative answer. However, across different specifications, each decade of additional historical frontier experience is associated with around 2–6 percentage points (p.p.) lower support of cooperation over self-reliance. While the results with the full set of controls are noisy, we nevertheless view these findings as at least suggestive of longstanding claims about the rugged individualism born on the frontier. We turn now to investigate the closely related link between frontier exposure and opposition to government intervention.

6.4

Total Frontier Experience and Opposition to Redistribution and Regulation

This section uses a wide array of survey data to identify a long-run effect of total frontier experience (TFE) on contemporary preferences for small government. First, longer exposure to the frontier is associated with opposition to redistribution, preferences for limited government intervention, and low levels of local taxation. Second, Republican Party support is significantly higher in counties with higher TFE. Finally, TFE is associated with strong opposition to government regulations surrounding issues that were salient in frontier culture historically. We view all of these outcomes as closely connected measures capturing the same underlying opposition to government. In all cases, we report estimates of equation (5) including individual demographic controls (age, age squared, gender, and race dummies), survey wave fixed effects, and the full set of controls in column 4 of Table 3. We continue reporting supportive Oster (forthcoming) tests for selection-on-unobservables while instrumental variables results and other robustness checks are discussed in Section 6.5. Redistribution and Limited Government.

Our first set of results in Table 4 shows that longer frontier

exposure historically is associated with stronger opposition to income redistribution today. In column 1, we draw upon ANES data from 1992 and 1996, which asks respondents whether they would like to see “federal spending on poor people be increased, decreased (or cut entirely) or kept about the same.” Around nine percent of individuals would like to see such redistributive spending decreased. Each additional decade of frontier exposure is associated with an additional p.p. increase in support of cuts. Columns 2 provides further evidence linking TFE to lower preferences for redistribution using the CCES. We find that each decade of TFE is associated with one p.p. higher support for cutting state spending on welfare. Although seemingly small relative to the mean of 40 percent, this effect size is roughly equivalent to the gap in opposition to such spending between individuals five years apart (with the older respondents usually more in favor of welfare spending cuts, reflecting well-known cohort differences in fiscal conservatism). We find similar effect sizes for a more extreme desire to “greatly” cut state spending on welfare. 34

it does not do so at the county level as required for our empirical strategy. The question was designed explicitly for studies of American individualism (see Markus, 2001). Unfortunately, though, this question was only asked in a single round.

23

Following Alesina and La Ferrara (2005), column 3 uses a measure from the GSS indicating the intensity of preferences for redistribution on a scale from 1 to 7 with 1 indicating that the government should not be engaged in redistribution and 7 indicating the government should reduce income differences through redistribution. We normalize this measure but find qualitatively similar results when transforming it to a binary outcome equal to one for values ≥ 5. Each additional decade of TFE is associated with around 0.03 standard deviations lower support for redistribution. This effect size is roughly equivalent to the difference between individuals that differ in age by 10 years (with older individuals again reporting less support for redistribution). Turning to broader measures of opposition to big government, we show in columns 4 and 5 that current residents of areas with higher TFE are more prone to fiscal conservatism. Column 4 uses a CCES question on whether individuals would like to cut domestic spending or raise taxes to balance the federal budget. Column 5 uses an index based on the principal components of a set of questions from the GSS on whether the government spends too much on an array of different public goods and social transfers. In both cases, we find that individuals are significantly more opposed to high levels of government spending in areas with greater exposure to the frontier historically. The Kling et al. (2007) mean effects analysis yields similar insights as the individual outcomes, e.g., combining the two CCES measures into a single index yields a statistically significant effect of around 0.02. Finally, we show in column 6 of Table 4 that these reported preferences are not a mere artifact of reporting bias but line up with actual policy differences across counties. In particular, we find that each decade of TFE is associated with around 3.5 percent lower reported property tax rates, which range from 0.1 to 2.9 percent across counties in our study.35 Given that much of the variation in tax rates lies across rather than within states, this is not a small effect. In fact, it is roughly akin to the within-state difference between counties that are 10 percent more versus less aligned with the Republican party, another policy outcome we revisit below. Together, the findings in Table 4 suggest that historical frontier exposure is associated with significantly lower support for big government in the form of redistribution, spending, and taxation. These patterns line up with the conceptualization of persistent norms along the frontier as described in Section 2. To shed further light on the legacy of frontier culture, we next examine voting outcomes and preferences over a set of policies closely linked to frontier history. Party Identification.

Having documented systematic associations between total frontier experience,

individualism and opposition to big government, we now turn to identify the relationship with conservative political party support. In particular, we show that these persistent effects of the frontier have strong implications for the growing strength of the Republican party, the contemporary bearer of conservative political platforms. We focus on Presidential elections since 2000. During this period, Republican party platforms have been clearly associated with a broad opposition to government intervention and aversion to tax redistribution. In fact, previous work by Rodriguez (1999) and Kerr (2014) has considered the Republican vote share as a proxy for preferences for redistribution. Table 5 shows that individuals in areas with higher TFE are more likely to exhibit stronger preferences for the Republican party. Averaging across the five elections through 2016, column 1 shows that each 35

More than half of the variation in tax rates lies across rather than within states, and hence when including division instead of state fixed effects, we find an even larger effect size around 5.5 percent (see column 1 of Table A.20).

24

decade of TFE is associated with around a 2 p.p. greater Republican vote share relative to the mean of 60 percent. We average for simplicity, but results are similar when pooling and including year fixed effects. This effect size is plausible and in line with individual-level regressions using degree of stated support for the Republican party in CCES.36 For perspective, though, the 2 p.p. effect is roughly the difference in population-weighted, average county-level vote shares in Iowa (48.4 percent) and Wisconsin (46.3 percent) over these five elections. That the results survive the demanding set of controls helps rule out concerns that our frontier measure is simply capturing regional variation within states, and in Section 6.5, we discuss further robustness to measures of county-level population density. This average effect across the 2000s masks an interesting ratcheting up over time as seen in columns 2–6. In particular, the effect of an additional decade of TFE is associated with significantly higher Republican vote share with each additional election based on cross-equation tests of the relative effect sizes (i.e., coefficients divided by means). Moreover, each of the prior years’ effects is significantly smaller than the effect in 2016. In other words, the 2016 election exhibited a uniquely large frontier effect but was still part of a growing trend in that differential. Putting these results together, the estimates in columns 7 and 8 provide marked evidence of the relatively larger shift towards the Republican party in areas with longer total frontier experience (TFE). The average heartland county in our long-run analysis exhibits a nine percent shift towards the Republican candidates between 2000 and 2016. Each additional decade of TFE is associated with an additional 1.6 p.p. relative to that mean. Alternatively, comparing a county at the 25th percentile of TFE (11 years) to a county at the 75th percentile of TFE (26 years), implies an additional 2.4 p.p. Republican party swing. As a benchmark, Autor et al. (2016) find that a similar interquartile shift in exposure to import competition from China implies a 1.7 p.p. Republican shift relative to a mean of -0.6 p.p. in their full-country sample of commuting zones. A similarly large shift can be seen in column 8, which shows the frontier effect on the Mitt Romney–Donald Trump differential between 2012 and 2016. Overall, these findings offer suggestive evidence of a potential link between frontier culture and the growing strength of the Republican Party in certain swathes of the American heartland.37 In sum, the results in Table 5 suggest relatively more conservative political preferences in areas of the United States today that were part of the frontier for a longer period of time in the 19th century. In practice, individuals identify with and vote Republican for a number of reasons that are arguably too numerous to unpack in this paper alone. However, one consistent theme across Republican party platforms is the view that government should not be too heavily relied upon and hence government should be small. While this notion is somewhat vague and arguably selectively applied in political discourse, it does bear an interesting relationship to the individualistic norms described in historical accounts of the 36

Using the CCES 2007, 2012, and 2014 survey rounds, we construct an indicator equal to one if the respondent identifies as a “strong Republican” on a seven point scale ranging from “strong Democrat” to “strong Republican” with around 17 percent of individual–years reporting the latter. This individual-level measure of Republican party support is useful as it allows for us to control for potential confounders of political preferences such as age and race that may also be correlated with TFE. The estimates imply that an additional decade of TFE is associated with around 4.5 percent greater intensity of strong Republican support. As a benchmark, consider that with each additional year of age, individuals are around 2 percent more likely to report strong Republican support. 37 We refrain from taking a strong stand on the role of the frontier legacy in explaining the popularity of Donald Trump. Nevertheless, we would be remiss if we did not mention that there seems to be a segment of the (libertarian) right that has indeed made such a connection, however speculative. Some go so far as to directly liken his stances and peculiarities to the behaviors described in Turner’s original thesis (see, e.g., Mendenhall, 2017).

25

frontier. It is in this respect that we view these voting outcomes as reflecting actions taken by individuals with preferences shaped by frontier culture as seen in earlier results.38 As a coarse, summary takeaway, the Kling et al. (2007) mean effect on individualism (unique name share) and conservative political preferences (Republican vote share) and policy (property tax rate) suggests that each decade of historical frontier exposure leads to around 0.15 standard deviation more frontier culture today. Contentious Policy Issues.

We provide here additional insight into the nature of modern political

preferences along the historical frontier. In Table 6, we use CCES data to relate total frontier experience to measures of (1) opposition to the Affordable Care Act (ACA or Obamacare), (2) support of an increase in the minimum wage, (3) support of a ban assault rifles, and (4) opposition to Environmental Protection Agency (EPA) regulations on pollution. For each of these outcomes, there is considerable variation across individuals in terms of support as gleaned from a simple comparison of means across outcomes. These policy issues have been sharply contentious, with the main political parties adopting polarized positions. Moreover, they can be connected with a set of norms and beliefs pervasive on the frontier in terms of the link between effort and reward (ACA and minimum wage), the salience of manifest destiny (EPA regulations), and the right to bear arms of one’s choosing (ban on assault rifles). The results in columns 1 and 2 suggest that individuals in frontier-exposed areas are significantly less likely to support the ACA and minimum wage hikes, respectively. The estimates are stable relative to possible controls (based on the Oster test) and again point to the fairly strong opposition to government intervention. For comparison, the effect sizes are on par with the roughly ten year age differences in support for these contemporary, bellwether conservative causes. Meanwhile, in columns 3 and 4, we find further evidence that residents of frontier areas are more opposed to government regulation. Column 3 shows that each additional decade of exposure to the frontier is associated with around 3 percent less support for a ban on assault rifles. Initiated by President Clinton in 1994, the Federal Assault Weapons Ban was subsequently passed by several Democratic-leaning states such as California and Massachusetts. It remains a hot-button issue for gun rights advocates and may be especially salient in frontier areas with a history of violent confrontation with Native Americans and frontier justice more generally. Indeed, using the same CCES data, we can show that residents of counties with an additional decade of TFE are 2 p.p. more likely to own a gun today relative to a mean of 43 percent. Finally, in column 4, we find that frontier-exposed areas exhibit less support for environmental regulation. In particular, we consider an indicator equal to one if the respondent is opposed to EPA regulation of CO2 emissions. The effect sizes are on par with if not slightly larger than those in preceding columns. Presuming older individuals have higher discount rates, a useful benchmarking exercise suggests that this effect is akin to the reduction in support for pollution regulation with each additional five years of age. Combining all of these estimates into a single index implies a mean effect of around 0.04 that is significant at the 1 percent level. These results are suggestive about the reasons why frontier experience is associated with stronger support for the Republican party. But the effects of frontier experience go beyond party lines. While the 38

Consistent with this view, individuals that identify as Republican in the ANES data from 1990 are around 15–20 percent more likely to believe that it is better to be a self-reliant than a cooperative person, based on this proxy for individualism discussed in Section 6.3.

26

main parties are defined in part on these contentious policy issues, we argue in the following section that the findings here are not driven purely by tribal party affiliation but rather capture something deeper. Overall, the findings in Tables 4–6 paint a rich picture of the cultural legacy of historical frontier exposure on political preferences. While we are unable to observe these modern outcomes throughout the 20th century, it is plausible that the early settlers left a lasting imprint on frontier locations and that the degree of that imprint increased with duration of exposure.

6.5

Instrumental Variables and Other Robustness Checks

This section presents several results aimed at strengthening the credibility of the main OLS findings in Tables 3–6 on the long-run legacy of frontier culture. First, we demonstrate the robustness of our findings to alternative measures of total frontier experience. Second, we show that most of our findings hold using the instrumental variables (IV) specification discussed in Section 6.1. Third, we show robustness to a host of controls for important confounders of frontier experience and modern preferences. Finally, we show that the effects of frontier experience on views about government cut across party and class lines. Alternative Measures of Total Frontier Experience (TFE).

Our baseline measure of TFE aimed to

closely follow definitions in the historical literature as discussed in Sections 3.1 and 3.2. In Tables A.9– A.13, we demonstrate the robustness of measure of TFE to three relevant margins of adjustment to what it means for county c to be inside the frontier at time t. First, we reduce the catchment area from 100 km to 75 and then to 50 km in proximity to the frontier line. Second, we adjust the density restriction to include counties with > 2 people/mi2 but still less than 6 and then remove the population density restriction altogether. Finally, we consider defining the frontier line as including only the main, westernmost extent of all counter lines identified by the GIS algorithm. While all outcomes do not survive all TFE definitions, the overwhelming message from Tables A.9–A.13 is that our particular choice of the frontier definition based on the historical record is not driving the findings on unique names or opposition to state intervention, redistribution and regulation. IV Estimates.

Table 7 presents instrumental IV estimates for the main outcomes. We report results

based on the full control, baseline specification from column 4, and the instrument is strong across all specifications as seen in Kleibergen and Paap (2006) first stage Wald statistics.39 For a majority of the outcomes, the estimated effects of frontier experience remain statistically and quantitatively significant. While most of the significant IV estimates are statistically indistinguishable from the OLS estimates, a few—e.g., unique names in columns 1–2, property tax rate in column 8, and Republican vote share in column 13—are relatively larger perhaps due to the potential sources of downward bias mentioned above. Yet, some of the survey-based outcomes are not as robust. Several but not all of the estimates for the CCES outcomes in particular lose statistical and in some cases quantitative significance. However, it is worth noting that this lack of significance is driven by (a subset of) the demanding geographic and agroclimatic control variables x, which apparently leave little scope for the instrument to identify residual covariation between TFE and the given CCES outcome. The OLS results do not exhibit similar 39

Corresponding Lagrange-Multiplier tests for underidentification easily pass conventional significance thresholds, and weakinstrument robust confidence intervals deliver qualitatively identical results as conventional inference reported in the table.

27

sensitivity to the x controls.40 Overall, these IV estimates largely corroborate the findings from the simpler OLS specifications. The overarching patterns continue to support a strong link between TFE and modern culture as reflected in political preferences and policy outcomes. Potential Mediators of Long-Run Effects.

Table 8 goes a step further towards addressing plausi-

ble mediators of the long-run effect of frontier experience on modern preferences. One salient concern about our long-run results is that they reflect the fact that places with greater frontier experience are less densely populated today, leading our results merely to capture a well-known urban–rural divide in preferences. We address this concern by including quadratic polynomials in county-level population density both at the time the frontier closed in 1890 as well as in 2010. Another concern is that frontier experience is associated with greater diversity in residents’ birthplaces, both domestic and abroad. This was the case in the contemporaneous historical period as we saw in Section 4 (and Appendix A.2) and could have persisted through the 20th century with direct implications for individualism and preferences over redistribution. We capture both historical and modern diversity with birthplace fractionalization indices in 1870 and 2000, respectively.41 Finally, one may concerned that total frontier experience is higher in counties that were connected to the railroad later in the 19th century. This later exposure to major public investments associated with the railroad may have in turn directly shaped views about the role of government that persisted over the long-run. We address this concern by controlling for the year in which each county was first connected to a rail line according to data from Atack et al. (2010). Although each of these measures are co-determined with frontier exposure historically, these “bad controls” do not significantly change the key takeaways from the baseline OLS or IV estimates as seen in Panels A and B, respectively. Several of the estimated effects of frontier exposure become smaller, which is consistent with the plausible channel of mediation, but most of the implied effects remain sizable and important. This suggests that the legacy of the frontier operates in channels that go beyond these density, diversity, and railroads. In Appendix Tables A.14–A.26, we report simpler OLS specifications with these controls (entered linearly) to illustrate the underlying direction of mediation. Overall, this robustness is reassuring and broadly consistent with the limited selection-on-unobservables revealed by the Oster (forthcoming) tests discussed in the previous sections.42 Frontier Effects Go Beyond Party and Class Lines.

A related exercise helps clarify the findings on

individual preferences for redistribution and size of government. As noted earlier, many of the policies elicit strong partisanship within the U.S. with Republicans and Democrats hewing closely to the party line. However, as seen in Tables A.27 and A.28, higher total frontier experience is associated with stronger opposition to government intervention even after controlling for the strength of Republican party support in the CCES. Moreover, these long-run effects of the frontier survive further controls for individual education and family income. Again, although these covariates are potentially “bad controls”, their inclusion helps rule out the concern that all of the observed effects are driven by prolonged 40

Moreover, all of the IV results for CCES outcomes, excluding the welfare preference outcome in column 6, become significant once we drop controls, with latitude, longitude, distance to rivers, and temperature being key factors. P 41 That is, we take 1 − o (birthplaceoc /populationc )2 , which is simply 1 minus the Herfindahl concentration index for origin o—each U.S. state and country abroad—birthplace diversity in county c. Birthplaces are defined as either a given state within the U.S. or a given country or country grouping abroad. We lose some counties on account of missing coverage in the 1870 Census, but results in Table 7 are unchanged when restricted to the smaller sample in Panel B. 42 The results are also robust to estimating the effects of frontier experience for former slave- and non-slave-states separately.

28

frontier experience simply leading to tribal party- and class-based identity unrelated to the deep roots of frontier culture.

7

Conclusion

The American frontier has long been a prominent theme in the cultural and political history of the United States. This paper examines the contemporaneous and long-run effects of the American frontier on culture at the subnational level. First, we show that U.S. locations in the frontier were characterized by strikingly different demographics and higher prevalence of individualistic traits. We offer new evidence that these traits conferred differential socioeconomic returns on the frontier. Second, we show that locations with longer frontier experiences exhibit higher contemporary levels of inclination toward self-reliance and stronger opposition to government intervention in the form of taxation, redistribution, and other regulations. This provides one of the first pieces of systematic evidence consistent with Turner’s famous thesis about the cultural traits nurtured on the frontier. Our research offers advances in the study of a prominent dimension of American history both in terms of results and methods. We provide a comprehensive account of the history of the frontier by locating its position and tracking its movements from 1790 to 1890. In addition, we introduce a new approach to measuring historical frontier exposure that may be useful in other attempts to understand the legacy of the frontier. Moreover, our findings establish the deep roots of some salient aspects of American culture, shedding light on contemporary political debates. Indirectly, our findings may help to shed new light on the historical origins and persistence of the uniquely high degree of individualism and opposition to redistribution in the United States. Nevertheless, this study is best suited to identifying the effects of frontier experience at the local level, so any extrapolation to higher levels of political and social organization is speculative. Understanding the wider implications of frontier culture remains an important topic for future research.

29

References Abramitzky, Ran, Leah Platt Boustan, and Katherine Eriksson, “Cultural assimilation during the age of mass migration,” Technical Report, National Bureau of Economic Research 2016. Acharya, Avidit, Matthew Blackwell, and Maya Sen, “The political legacy of American slavery,” The Journal of Politics, 2016, 78 (3), 621–641. Alesina, Alberto and Eliana La Ferrara, “Preferences for redistribution in the land of opportunities,” Journal of public Economics, 2005, 89 (5), 897–931. and George-Marios Angeletos, “Fairness and redistribution,” The American Economic Review, 2005, 95 (4), 960–980. and Paola Giuliano, “Preferences for Redistribution,” Handbook of Social Economics, 2011. , Edward L Glaeser, and Bruce Sacerdote, “Why Doesn’t the United States Have a European-Style Welfare State?,” Brookings Papers on Economic Activity, 2001, 2001 (2), 187–277. , Paola Giuliano, and Nathan Nunn, “On the origins of gender roles: Women and the plough,” The Quarterly Journal of Economics, 2013, 128 (2), 469–530. ,

et al., “Family Ties,” Handbook of Economic Growth, 2014, 2, 177–215.

Altonji, Joseph G., Todd E. Elder, and Christopher R. Taber, “Selection on observed and unobserved variables: Assessing the effectiveness of Catholic schools,” Journal of political economy, 2005, 113 (1), 151–184. Ashok, Vivekinan, Ilyana Kuziemko, and Ebonya Washington, “Support for redistribution in an age of rising inequality: New stylized facts and some tentative explanations,” Technical Report, National Bureau of Economic Research Working Paper 21529 2015. Atack, Jeremy, Fred Bateman, Michael Haines, and Robert A. Margo, “Did railroads induce or follow economic growth?,” Social Science History, 2010, 34 (2), 171–197. Autor, David, David Dorn, Gordon Hanson, and Kaveh Majlesi, “A Note on the Effect of Rising Trade Exposure on the 2016 Presidential Election,” Unpublished Manuscript, 2016. Bartik, T. J., Who Benefits from State and Local Economic Development Policies?, W.E. Upjohn Institute for Employment Research, 1991. Benabou, Roland and Efe A Ok, “Social mobility and the demand for redistribution: the POUM hypothesis,” The Quarterly Journal of Economics, 2001, 116 (2), 447–487. Bester, C Alan, Timothy G Conley, and Christian B Hansen, “Inference with dependent data using cluster covariance estimators,” Journal of Econometrics, 2011, 165 (2), 137–151. Billington, Ray Allen, America’s frontier heritage, University of New Mexico Press, 1974. Boatright, Mody C, “The myth of frontier individualism,” The Southwestern Social Science Quarterly, 1941, pp. 14–32. Bowen, William A, The Willamette Valley: migration and settlement on the Oregon Frontier, University of Washington Press, 1978. Boyd, R and PJ Richerson, Culture and the Evolutionary Process. Chicago, University of Chicago Press, 1985. Boyd, Robert and Peter J Richerson, The origin and evolution of cultures, Oxford University Press, 2005. Celinska, Katarzyna, “Individualism and collectivism in America: The case of gun ownership and attitudes toward gun control,” Sociological Perspectives, 2007, 50 (2), 229–247. Chow, Gregory C., “Tests of Equality Between Sets of Coefficients in Two Linear Regressions,” Econometrica, 1960, 28 (3), 591–605. Conley, T. G., “GMM estimation with cross sectional dependence,” Journal of Econometrics, 1999, 92 (1), 30

1–45. Coombs, Jan, “Frontier patterns of marriage, family, and ethnicity: Central Wisconsin in the 1880s,” Journal of Family History, 1993, 18 (3), 265–282. Couttenier, Mathieu, Pauline Grosjean, and Marc Sangnier, “The Wild West is Wild: The Homicide Resource Curse,” Journal of the European Economic Association, 2016. Demos, John, “Families in colonial Bristol, Rhode Island: An exercise in historical demography,” The William and Mary Quarterly: A Magazine of Early American History, 1968, pp. 40–57. Duncan, Otis Dudley, “A socioeconomic index for all occupations,” Class: Critical Concepts, 1961, 1, 388–426. Durkheim, Emile, De la division du travail social: e´tude sur l’organisation des soci´et´es sup´erieures, Alcan, 1893. Easterlin, Richard A, George Alter, and Gretchen A Condran, “Farms and farm families in old and new areas: The northern states in 1860,” Quantitative Studies in History, 1978, pp. 22–84. Eblen, J. E., “An Analysis of Nineteenth-Century Frontier Populations,” Demography, 1965, 2, 399–413. Feigenbaum, James J., “Automated census record linking: A machine learning approach,” Unpublished Manuscript, 2016. Fern´andez, Raquel, “Does culture matter?,” Handbook of Social Economics, 2011. Ferrie, J. P., “Migration to the Frontier in Mid-Nineteenth Century America: A Re-Examination of Turner’s’ Safety-Valve’,” Department of Economics, Northwestern University Manuscript, 1997. ¨ Figlio, David, Paola Giuliano, Umut Ozek, and Paola Sapienza, “Long-term orientation and educational performance,” NBER Working Paper 22541, 2016. Fincher, Corey L. and Randy Thornhill, “Parasite-stress promotes in-group assortative sociality: The cases of strong family ties and heightened religiosity,” Behavioral and Brain Sciences, 2012, 35 (2), 61–79. Forster, Colin and Graham Shardalow Lee Tucker, Economic Opportunity and White American Fertility Ratios, 1800-1860, Yale University Press, 1972. ¨ ¨ Galor, Oded and Omer Ozak, “The agricultural origins of time preference,” The American Economic Review, 2016, 106 (10), 3064–3103. Garc´ıa-Jimeno, Camilo and James A. Robinson, “The Myth of the Frontier,” in Dora L Costa and Naomi R Lamoreaux, eds., Understanding Long-Run Economic Growth: Geography, Institutions, and the Knowledge Economy, University of Chicago Press, 2011, pp. 49–89. Gibbons, Stephen and Henry G. Overman, “Mostly pointless spatial econometrics?,” Journal of Regional Science, 2012, 52 (2), 172–191. Giuliano, Paola and Nathan Nunn, “Understanding cultural persistence and change,” Technical Report, Harvard Universtity, mimeo 2017. Goodrich, Carter and Sol Davison, “The Wage-Earner in the Westward Movement I,” Political Science Quarterly, 1935, 50 (2), 161–185. and , “The wage-earner in the westward movement II,” Political Science Quarterly, 1936, 51 (1), 61–116. Gorodnichenko, Yuriy and Gerard Roland, “Which dimensions of culture matter for long-run growth?,” American Economic Review: Papers & Proceedings, 2011, 101 (3), 492–498. and

, “Culture, institutions and democratization,” NBER Working Paper 21117, 2015.

and

, “Culture, institutions and the wealth of nations,” Review of Economics and Statistics, 2016.

Greif, Avner, “Cultural beliefs and the organization of society: A historical and theoretical reflection on collectivist and individualist societies,” Journal of political economy, 1994, 102 (5), 912–950.

31

Grosjean, Pauline, “A history of violence: The culture of honor and homicide in the US south,” Journal of the European Economic Association, 2014, 12 (5), 1285–1316. Guiso, Luigi, Paola Sapienza, and Luigi Zingales, “Does culture affect economic outcomes?,” The journal of economic perspectives, 2006, 20 (2), 23–48. , , and , “Alfred marshall lecture social capital as good culture,” Journal of the European Economic Association, 2008, 6 (2-3), 295–320. , , and 1436.

, “Long-term persistence,” Journal of the European Economic Association, 2016, 14 (6), 1401–

Gureckis, Todd M. and Robert L. Goldstone, “How you named your child: Understanding the relationship between individual decision making and collective outcomes,” Topics in Cognitive Science, 2009, 1 (4), 651–674. Hahn, Matthew W. and R. Alexander Bentley, “Drift as a mechanism for cultural change: an example from baby names,” Proceedings of the Royal Society of London B: Biological Sciences, 2003, 270 (Suppl 1), S120–S123. Haines, Michael R. and ICPSR, Historical, Demographic, Economic, and Social Data: The United States, 1790-2002, Interuniversity Consortium for Political and Social Research, 2010. Heine, Steven J, “Cultural Psychology,” In Fiske, Gilbert, and Gardner (eds.), Handbook of social psychology. Vol. 2. John Wiley & Sons., 2010, 1 (2), 254–266. Hofstede, Geert, Culture’s consequences: International differences in work-related values 1980. , Cultures and Organizations: Software of the Mind 1991. Hornbeck, R., “Barbed wire: Property rights and agricultural development,” The Quarterly Journal of Economics, 2010, 125 (2), 767–810. IIASA/FAO, “Global Agro-ecological Zones (GAEZ v3.0),” Unpublished Manuscript, 2012. Jokela, Markus, “Personality predicts migration within and between US states,” Journal of Research in Personality, 2009, 43 (1), 79–83. Jr., Roland G. Fryer and Steven D. Levitt, “The causes and consequences of distinctively black names,” The Quarterly Journal of Economics, 2004, 119 (3), 767–805. Juricek, J. T., “American Usage of the Word ”Frontier” from Colonial Times to Frederick Jackson Turner,” American Philosophical Society, 1966, 110 (1), 10–34. Kerr, William R, “Income inequality and social preferences for redistribution and compensation differentials,” Journal of Monetary Economics, 2014, 66, 62–78. Kitayama, Shinobu, Keiko Ishii, Toshie Imada, Kosuke Takemura, and Jenny Ramaswamy, “Voluntary settlement and the spirit of independence: Evidence from Japan’s” northern frontier.”,” Journal of personality and social psychology, 2006, 91 (3), 369. , Lucian Gideon Conway III, Paula R Pietromonaco, Hyekyung Park, and Victoria C Plaut, “Ethos of independence across regions in the United States: the production-adoption model of cultural change.,” American Psychologist, 2010, 65 (6), 559. Kleibergen, Frank and Richard Paap, “Generalized reduced rank tests using the singular value decomposition,” Journal of Econometrics, 2006, 133 (1), 97–126. Kling, Jeffrey R., Jeffrey B. Liebman, and Lawrence F. Katz, “Experimental analysis of neighborhood effects,” Econometrica, 2007, 75 (1), 83–119. Knudsen, Anne Sofie Beck, “Historical Individualism: Selective Migration and Cultural Persistence,” 2017. Laham, Simon M., Peter Koval, and Adam L Alter, “The name-pronunciation effect: Why people like Mr. Smith more than Mr. Colquhoun,” Journal of Experimental Social Psychology, 2012, 48 (3), 752–756.

32

Leet, Don R, “Human fertility and agricultural opportunities in Ohio counties: From frontier to maturity, 1810-1860,” Essays in Nineteenth Century Economic History: The Old Northwest, 1975, pp. 138–158. Lieberson, S. and E.O. Bell, “Children’s First Names: An Empirical Study Of Social Taste,” American Journal of Sociology, 1992, 98 (3), 511–554. Luttmer, Erzo FP and Monica Singhal, “Culture, context, and the taste for redistribution,” American Economic Journal: Economic Policy, 2011, 3 (1), 157–179. Markus, Gregory B., “American individualism reconsidered,” Citizens and politics: Perspectives from political psychology, 2001, pp. 401–32. Mendenhall, Allen, “Donald Trump The Cowboy,” The Daily Caller, January 2017. Migration Policy Institute, “Tabulations of U.S. Department of Homeland Security, Office of Immigration Statistics, Yearbook of Immigration Statistics,” 2016. Minnesota Population Center, “National Historical Geographic Information System: Version 2.0.,” Minneapolis, MN: University of Minnesota, 2011. Modell, John, “Family and fertility on the Indiana frontier, 1820,” American Quarterly, 1971, 23 (5), 615– 634. Moller, Herbert, “Sex composition and correlated culture patterns of colonial America,” The William and Mary Quarterly: A Magazine of Early American History, and Culture, 1945, pp. 114–153. Mood, F., “The Concept of the Frontier, 1871-1898: Comments on a Select List of Source Documents,” Agricultural History Society, 1945, 19 (1), 24–30. Nisbett, RE and D Cohen, “Culture of Honor: The Psychology of Violence in the South,” 1996. Nunn, Nathan and Leonard Wantchekon, “The slave trade and the origins of mistrust in Africa,” American Economic Review, 2011, 101 (7), 3221–3252. Olivetti, Claudia and M Daniele Paserman, “In the name of the son (and the daughter): Intergenerational mobility in the United States, 1850–1940,” American Economic Review, 2015, 105 (8), 2695–2724. Oster, E., “Unobservable selection and coefficient stability: Theory and validation,” Journal of Business and Economic Statistics, forthcoming. Overmeyer, Philip H, “Westward Expansion before the Homestead Act,” Williamson (Ed.), The Growth of the American Economy, 1944, pp. 83–112. Parsons, Talcott and Edward Shils, Toward a General Theory of Action, Harvard University Press, 1951. Piketty, Thomas, “Social mobility and redistributive politics,” The Quarterly journal of economics, 1995, 110 (3), 551–584. Plaut, Victoria C, Hazel Rose Markus, and Margie E Lachman, “Place matters: consensual features and regional variation in American well-being and self.,” Journal of personality and social psychology, 2002, 83 (1), 160. Porter, Robert, Henry Gannett, and William Hunt, “Progress of the Nation, including the Map of the Population of 1870,” Report on Population of the United States at the Elventh Census, 1890, pp. 13–30. Restrepo, Pascual, “The Mounties and the Origins of Peace in the Canadian Prairies,” 2015. Robinson, Peter M, “Root-N-Consistent Semiparametric Regression,” Econometrica, 1988, 56 (4), 931–54. Rodriguez, FC, “Does distributional skewness lead to redistribution,” Evidence from the United States, qEconomics and Politics, 1999, 11 (2). Roland, Gerard, “Understanding institutional change: Fast-moving and slow-moving institutions,” Studies in Comparative International Development, 2004, 38 (4), 109–131. Samuelson, Paul, “Modern economic realities and individualism,” in G. Mills (ed.), Innocence and power: individualism in twentieth-century America, University of Texas Press, 1965, pp. 51–71.

33

Schaefer, Donald F, “A Statistical Profile of Frontier and New South Migration: 1850-1860,” Agricultural History, 1985, 59 (4), 563–578. Schaffner, Brian F Ansolabehere Stephen;, “General Social Surveys, 1972-2014,” CCES Common Content, 2006-2016, 2017. Smith, Guy-Harold, “The populating of Wisconsin,” Geographical Review, 1928, 18 (3), 402–421. Smith, Tom W, Peter Marsden, Michael Hout, and Jibum Kim, “General Social Surveys, 1972-2014,” 2015. Spolaore, Enrico and Romain Wacziarg, “How deep are the roots of economic development?,” Journal of Economic Literature, 2013, 51 (2), 325–369. Steckel, R. H., “Household migration and rural settlement in the United States, 18501860,” Explorations in Economic History, 1989, 26 (2), 190–218. Tabellini, Guido, “The scope of cooperation: Values and incentives,” The Quarterly Journal of Economics, 2008, 123 (3), 905–950. , “Culture and institutions: economic development in the regions of Europe,” Journal of the European Economic Association, 2010, 8 (4), 677–716. Tella, Rafael Di, Juan Dubra, and Robert MacCulloch, “A Resource Belief-Curse? Oil and Individualism,” NBER Working Paper 14556, 2008. The American National Election Studies, “THE ANES GUIDE TO PUBLIC OPINION AND ELECTORAL BEHAVIOR.” Tonnies, ¨ Ferdinand, Gemeinschaft und Gesellschaft: Abhandlung des Communismus und des Socialismus als empirischer Culturformen, Fues, 1887. Triandis, Harry, Collectivism v. Individualism: A Reconceptualisation of a Basic Concept in Cross-cultural Social Psychology, London: Palgrave Macmillan UK, , Individualism & collectivism., Westview press, 1995. , “Individualism and collectivism: Past, present, and future,” The handbook of culture and psychology, 2001, pp. 35–50. Turner, Frederick Jackson, “The Frontier in American History (New York, 1920),” The Significance of Sections in American History, 1920. Twenge, Jean M, Emodish M Abebe, and W Keith Campbell, “Fitting in or standing Out: Trends in American parents’ choices for childrens names, 1880–2007,” Social Psychological and Personality Science, 2010, 1 (1), 19–25. Vandello, Joseph A and Dov Cohen, “Patterns of individualism and collectivism across the United States.,” Journal of personality and social psychology, 1999, 77 (2), 279. Varnum, Michael E.W., “Sources of regional variation in social capital in the United States: Frontiers and pathogens.,” Evolutionary Behavioral Sciences, 2014, 8 (2), 77. Varnum, Michael EW and Shinobu Kitayama, “What’s in a name? Popular names are less common on frontiers,” Psychological Science, 2011, 22 (2), 176–183. Weber, Max, Die protestantische Ethik und der Geist des Kapitalismus 1905. Yasuba, Yasukichi, Birth rates of the white population in the United States, 1800-1860, Johns Hopkins Press, 1962. Zelinsky, W., The Cultural Geography of the United States, Pearson Education, Limited, 1973. Zivot, Eric and Donald W. K. Andrews, “Further evidence on the great crash, the oil-price shock, and the unit-root hypothesis,” Journal of Business & Economic Statistics, 2002, 20 (1), 25–44. Zurn, ¨ Michael and Sascha Topolinski, “When trust comes easy: Articulatory fluency increases transfers in the trust game,” Journal of Economic Psychology, 2017, 61, 74–86. 34

Figures Figure 1: Population Density and the Frontier for Selected Years

35 Notes: Based on county level data from National Historical Geographic Information System: Version 11.0. Population allocated across years and counties based on the procedure described in Section 3.1, which builds upon Hornbeck (2010).

Figure 2: The Evolution of the Frontier, 1790 to 1890

36 Notes: Based on county level data from National Historical Geographic Information System: Version 11.0. The frontier lines demarcate the counties with below and above population density equal to 2 people per square mile. The frontier lines presented above are only those resulting from westward expansions and after removing smaller “island frontiers” lines in the interior using the rules described in Appendix B.

Figure 3: Total Frontier Experience

37 Notes: Based on county level data from National Historical Geographic Information System: Version 11.0. Total frontier experience is the total number of years the county was within 100 km of the frontier and its population density was below 6 people/mi2 , between 1790–1890.

Figure 4: Distribution of Demographic Characteristics by Population Density (a) Sex Ratio

(b) Prime-Age Adult Share .5 E(prime-age adult share | year & division FE)

E(sex ratio | year & division FE)

2

1.5

1

.5

.45

.4

.35 0

10

20

30

40

50

0

10

20

30

40

50

40

50

Population Density

(c) Fertility

(d) Illiteracy .3

1.2

1.1

E(illiteracy | year & division FE)

E(White Children Aged 0-5/White Women Aged 15-39 | year & division FE)

Population Density

1

.9

.25

.2

.15

.1 .8 0

10

20

30

40

50

0

10

Population Density

20

30

Population Density

(e) Foreign-Born Share

E(immigrant shares | year & division FE)

.3

.25

.2

.15

.1 0

10

20

30

40

50

Population Density

Notes: These figures plot semiparametric estimates of the relationship between population density and five demographic characteristics prominent in historical accounts of the frontier. We estimate these nonparametric curves g(·) based on the Robinson (1988) partially linear approach, pooling across all years 1790–1890 for each county c. The specification includes Census division and year fixed effects and are based on an Epanechnikov kernel and rule-of-thumb bandwidth. The dashed lines are 95 percent confidence intervals. The estimates are recovered over all counties, but the figure zooms in on those with less than 50 people/mi2 for presentational purposes. All data come from the county level records from National Historical Geographic Information System: Version 11.0. (a) Sex ratio for whites is the ratio of the number of white males over white females. (b) Share white 15–49 is the fraction of whites aged 15–49 over the total number of whites. (c) Our proxy for fertility is simply equal to the number of children aged 0–5 per women aged 15–39 (for whites). (d) Share of whites aged 20+ Illiterate is the number of illiterate whites aged 20 or older over the number of whites aged 20 or older. (e) Share of Pop Foreign Born is the ratio of the the number of foreign born persons over total population.

38

Figure 5: Distribution of Uncommon Children Names (for Whites Aged 0-10 with NativeBorn Parents) by Population Density (a) Boys

(b) Girls .85 E(uncommon girl names | year & division FE)

E(uncommon boy names | year & division FE)

.7

.68

.66

.64

.8

.75

.62 .7 0

10

20

30

40

50

0

Population Density

10

20

30

40

50

Population Density

Notes: Based on county level data from National Historical Geographic Information System: Version 11.0 Database. The dependent variables on the left (right) is the county level share of girls (boys) aged 0-10 with native parents and have names outside of the top 10 popular names in their corresponding Census division. The semiparametric specification is otherwise the same as in Figure 4. See the notes therein for details.

39

Tables

Table 1: Selective Migration Does not Fully Explain Unique Names on the Frontier Dependent Variable: Child Given Unique Name in 1870s (1) (2) (3) (4) (5) Sample Restriction: White, Native-Born Parents + none Resident of Non-Frontier County in 1870 stayers non-frontier migrants to frontier 1870 County of Residence Fixed Effects

No

No

Yes

No

Yes

resident of frontier county in 1880

0.075*** (0.018)

0.052*** (0.015)

0.023*** (0.007)

0.037*** (0.012)

0.010* (0.005)

Number of Children, Age 0–10 Mean Unique Name Share R2

1,239,513 0.708 0.02

1,224,068 0.708 0.02

1,224,068 0.708 0.03

647,479 0.714 0.02

647,479 0.714 0.03

Yes Yes Yes

Yes Yes Yes

Yes Yes Yes

Yes Yes Yes

Yes Yes Yes

Gender×Age Fixed Effects Birth Order Fixed Effects Mother and Father Unique Name Status

Notes: This table reports estimates of equation (2) based on the linked sample of households in the 1870 and 1880 full county Population Census. The linking is done using the procedure described in Section 5.1. The sample is restricted to children age 0–10 with white, native-born parents. Columns 2-5 restrict to those households living in non-frontier counties in 1870. Columns 4 and 5 further restrict to those individuals who migrated from a non-frontier county in 1870 to a frontier county in 1880 or who remained inside the same non-frontier county in both decades. Children have a unique name if their name falls outside the top 10 nationally during that decade. Similar results obtain for the top 20. The explanatory variable equals one if the household resides in a county that is inside the frontier as defined using the distance and population density cutoffs described in Section 3. Standard errors are clustered at the county level in 1870. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

40

Table 2: Unique Names, Socioeconomic Success and Survival in the Frontier (1)

(2)

(3)

(4)

(5)

Panel A: Father’s Economic Status in 1880 Dep. Var.: normalized seius ocscorus father has unique name

-0.045*** (0.005)

father has unique name × frontier county

-0.047*** (0.005) 0.072** (0.033)

at least one child 0-10 has unique name

-0.048*** (0.005)

0.081*** (0.007)

at least one child 0-10 has unique name × frontier county

Number of Individuals R2 Sample Restriction County Fixed Effects

-0.049*** (0.005) 0.069** (0.033) 0.080*** (0.007) 0.073* (0.042)

-0.044*** (0.005) 0.075** (0.033) 0.065*** (0.007) 0.066 (0.042)

264,038 264,038 264,038 264,038 264,038 0.09 0.09 0.09 0.09 0.10 Native-Born, Age 30–50, White, Non-Migrant 1870–1880 Yes Yes Yes Yes Yes Panel B: Dep. Var.: Migrates from Frontier County in 1870 to Non-Frontier Eastern County in 1880

father has unique name

-0.042*** (0.009)

mother has unique name

-0.010 (0.007)

at least one child 0-10 has unique name

-0.026** (0.011)

Number of Individuals Mean of Dependent Variable R2 Sample Restriction

27,066 0.410 0.08

Origin County Fixed Effects

Yes

-0.041*** (0.009) -0.009 (0.007) -0.023** (0.011)

27,066 27,066 27,066 0.410 0.410 0.410 0.07 0.07 0.08 Native-Born, Age 30–50, White Frontier Resident in 1870 Yes Yes Yes

Notes: This table reports estimates of equation (3) in Panel A and equation (4) in Panel B based on the same linked sample of households from the 1870 and 1880 Census described in the notes to the previous table and at length in Section 5.1. Unique name measures are as defined elsewhere and based on the top 10 nationally for all family members. The frontier dummy in both panels is based on our core time-varying measure described in Section 3.1. In Panel A, the dependent variable in columns 1-4 is the normalized Duncan (1961) socioeconomic index (seius) and in column 5 is the normalized occupational score (ocscorus), both provided by the North Atlantic Population Project (NAPP). The sample in Panel A includes all male household heads (fathers) that did not migrate across counties between 1870 and 1880. Standard errors are clustered at the county level. The sample in Panel B is restricted to households residing in frontier counties in 1870, and the dependent variable equals one if the household moved to a non-frontier county in the east by 1880. Standard errors are clustered at the origin county. All regressions include dummies for the number of children born in the 1870s. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

41

Table 3: Total Frontier Experience and Unique Name Choices Persist into the 20th Century Sample: White Children Aged 0-10, 1940 Census (1) (2) (3) (4) Panel A: Share of Boys with Unique Names frontier experience

Oster δ for β = 0 Number of Counties R2

0.209*** (0.031)

0.139*** (0.024)

0.131*** (0.024)

0.128*** (0.020)

2,039 0.06

3.67 2,039 0.50

2.37 2,039 0.56

2.26 2,035 0.63

Panel B: Share of Girls with Unique Names frontier experience

Oster δ for β = 0 Number of Counties R2 Division Fixed Effects State Fixed Effects Geographic/Agroclimatic Controls

0.201*** (0.031)

0.182*** (0.028)

0.198*** (0.029)

0.194*** (0.023)

2,039 0.06

6.69 2,039 0.27

5.02 2,039 0.33

5.06 2,035 0.44

No No No

Yes No No

Yes Yes No

Yes Yes Yes

Notes: This table reports estimates of equation (5) for our leading proxy of individualism in the 20th century, the share of boys and girls age 0–10 with unique names in the 1940 Census. The dependent variable is normalized so that the coefficient indicates the standard deviation effect of each additional decade of frontier exposure historically. The sample is restricted to counties with some frontier experience based on the measure described in Section 3.2. The baseline measure of uniqueness is given by the share of children in county c with a name that falls outside the top 10 names for children born in the same Census division within the given decade. Other measures of unique names are considered in Appendix Table A.5. Frontier experience is expressed in decades. Column 1 is the simple bivariate regression. Columns 2 and 3 add Census division and state fixed effects. Column 4 adds the following controls: county area; county centroid latitude and longitude; distance to oceans, lakes and rivers from county centroid; mean county temperature and rainfall; elevation; average potential agricultural yield. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Inference remains unchanged when using the computationally more intensive Conley (1999) spatial HAC estimator with a bandwidth of 300 km. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

42

Table 4: Total Frontier Experience and Opposition to Government Intervention and Redistribution Data Source: Dependent Variable:

Scale:

frontier experience

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2 Survey Wave Fixed Effects Individual Demographic Controls State Fixed Effects Geographic/Agroclimatic Controls

ANES Prefers Cut Public Spending on Poor binary (1)

CCES Prefers Cut Public Spending on Welfare binary (2)

GSS Believes Gov’t Should Redistribute normalized (3)

CCES Prefers Reduce Debt by Spending Cuts binary (4)

GSS Index of Preferences for Spending Cuts normalized (5)

ACS County Property Tax Rate, 2010 [0, 100] (6)

0.010** (0.004)

0.011*** (0.004)

-0.022* (0.012)

0.013*** (0.002)

0.034*** (0.010)

-0.032*** (0.006)

5.6 0.09 2,322 95 0.04 Yes Yes Yes Yes

6.5 0.40 27,749 1,732 0.04 Yes Yes Yes Yes

6.3 0.00 9,085 255 0.06 Yes Yes Yes Yes

2.4 0.43 169,536 2,000 0.04 Yes Yes Yes Yes

2.6 0.00 5,739 253 0.07 Yes Yes Yes Yes

1.4 1.02 – 2,028 0.82 – – Yes Yes

43

Notes: This table reports estimates of equation (5) for several measures capturing preferences for redistribution and state spending as well as actual property tax rates. Full details on the outcomes can be found in Appendix B. We use all available survey rounds with the given outcome, and in all cases, we restrict to those counties in our main sample as described in the notes to Table 3. The ANES measure in column 1 equals one if the respondent prefers that federal government spending on poor people be cut. The CCES measure in column 2 equals one if the respondent would prefer to cut public spending on welfare programs. The GSS measure in column 3 is a normalized measure of intensity of support on a 7 point scale of the statement that the government should reduce income differences in society through redistribution. The CCES question in column 4 equals one if the household would prefer that the state budget be balanced through spending cuts rather than tax increases. The GSS measure in column 5 is a normalized first principal component analysis (PCA) index based on a series of questions about whether the government spends too much on different public goods and transfer programs. The measure of county-level property tax rates in column 6 is estimated from American Community Survey data from 2010. Combining estimates from different columns and related outcomes across subsequent tables yields mean effects estimates based on the Kling et al. (2007) approach that retain statistical significance of the frontier effect. All columns are based on the specification in column 4 of Table 3 with additional individual-level controls for age, age squared, gender, and race. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table 5: Total Frontier Experience and the Rising Republican Vote Share Dependent Variable:

Republican Vote Share in . . . Presidential Election 2000 2004 2008 2012 2016 ∆ ’16–’00 (2) (3) (4) (5) (6) (7)

2000–16 (Avg.) (1) frontier experience

Oster δ for β = 0 Number of Counties Mean of Dependent Variable R2

∆ ’16–’12 (8)

1.887*** (0.353)

1.276*** (0.326)

1.530*** (0.345)

1.719*** (0.367)

2.057*** (0.391)

2.852*** (0.412)

1.576*** (0.261)

0.795*** (0.124)

14.24 2,035 60.0 0.22

-17.20 2,035 56.6 0.19

24.66 2,035 60.3 0.20

5.87 2,035 57.4 0.27

9.17 2,035 60.6 0.24

26.62 2,035 65.4 0.21

9.12 2,035 8.9 0.21

-5.20 2,035 4.9 0.26

Notes: This table reports estimates of equation (5) for measures of the county-level Republican vote share in the last five presidential elections with data from the Leip Atlas. Column 1 averages across all five elections. Columns 2–6 report yearspecific effects. The sample and measure of frontier experience are as described in the notes to Table 3, and all estimates are based on the specification in column 4 from that table. Cross-equation tests reveal that the effect sizes are statistically different in each subsequent year and each year is statistically different from 2016.Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table 6: Total Frontier Experience and Preferences Over Contentious, “Frontier-Relevant” Policy Issues Dependent Variable:

frontier experience

Oster δ for β = 0 Number of Individuals Number of Counties Mean of Dependent Variable R2 Survey Wave Fixed Effects Individual Demographic Controls State Fixed Effects Geographic/Agroclimatic Controls

Prefers Repealing Affordable Care Act (1)

Supports Increasing Minimum Wage (2)

Supports Banning Assault Rifles (3)

Opposes EPA Regulation of CO2 Emissions (4)

0.023*** (0.004)

-0.023*** (0.007)

-0.017*** (0.004)

0.016*** (0.004)

3.0 29,432 1,727 0.535 0.06 Yes Yes Yes Yes

3.2 5,134 1,066 0.688 0.06 Yes Yes Yes Yes

2.7 29,390 1,722 0.630 0.09 Yes Yes Yes Yes

2.2 29,203 1,717 0.317 0.08 Yes Yes Yes Yes

Notes: This table reports estimates of equation (5) for four measures of support for conservative issues that are particularly relevant to the frontier setting in historical accounts. These are all binary indicators based on questions in the CCES across different years. The measure in Column 1 equals one if the individual in 2014 believes that the Affordable Care Act (ACA) should be repealed, in Column 2 equals one if the individual in 2007 supports an increase in the minimum wage, in Column 3 equals one if the individual in 2014 supports a ban on assault rifles, and in Column 4 equals one if the individual in 2014 opposes regulation of pollution by the Environmental Protection Agency (EPA). The set of specifications are otherwise the same as in Table 3; see the notes therein for details. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

44

Table 7: Instrumental Variables Estimates for Main Outcomes Data Source: Dependent Variable:

1940 Pop. Census Unique Name Share Boys Girls (1)

(2)

ANES CCES Prefers Cut Public Spending on Poor on Welfare (3)

(4)

GSS Government Should Redistribute (5)

CCES Prefer Cut Debt by Spending Cuts (6)

GSS Index of Preferences for Cut Spending (7)

ACS County Property Tax Rate, 2010 (8)

Prefers Repealing ACA (9)

Supports Increasing Min. Wage (10)

CCES Supports Banning Assault Rifle (11)

Opposes EPA Regulation of CO2 (12)

Leip Republican Vote Share 2000–16 (Avg.) (13)

Panel A: Baseline Specifications from Tables 3–5 frontier experience

KP Wald Statistic Number of Individuals Number of Counties Mean of Dependent Variable Survey Wave Fixed Effects Individual Demographic Controls State/Division Fixed Effects Geographic/Agroclimatic Controls

0.224*** (0.048)

0.260*** (0.049)

0.021*** (0.008)

0.002 (0.006)

-0.027 (0.021)

0.010** (0.005)

0.043* (0.023)

-0.046*** (0.015)

0.012 (0.007)

-0.027** (0.012)

-0.002 (0.008)

0.009 (0.007)

2.854*** (0.724)

103.5 – 2,035 0.00

103.5 – 2,035 0.00

18.5 2,322 95 0.090

30.8 27,749 1,732 0.402

52.2 9,085 255 0.00

33.4 169,536 2,000 0.434

45.2 5,739 253 0.00

103.5 – 2,028 1.02

34.1 29,432 1,727 0.535

38.9 5,134 1,066 0.688

34.2 29,390 1,722 0.630

34.5 29,203 1,717 0.317

103.5 – 2,035 60.0

– – Yes Yes

– – Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

– – Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

– – Yes Yes

45

Notes: This table reports instrumental variables estimates of equation (5) based on the Bartik (1991)-inspired instrument described in Section 6.1. We report results for several of the long-run outcomes in Tables 3–6 using the same specifications therein. Panel A reports the baseline IV estimates while Panel B reports estimates that additionally include a set of bad (i.e., not predetermined) controls: population density in 1890 and 2010, birthplace diversity (fractionalization) in 1870 and 2000, and the first year the county was connected to the railroad. The sample size is slightly reduced here due to coverage gaps in the 1870 Census. Results look similar in Panel A restricted to this smaller sample. The KP Wald Statistic is due to Kleibergen and Paap (2006). Given the large Wald stats, similar inference arises using weak-instrument robust estimators. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table 8: Robustness to Additional Controls as Mediators of Main Outcomes Data Source: Dependent Variable:

1940 Pop. Census Unique Name Share Boys Girls (1)

(2)

ANES CCES Prefers Cut Public Spending on Poor on Welfare (3)

(4)

GSS Government Should Redistribute (5)

CCES Prefer Cut Debt by Spending Cuts (6)

GSS Index of Preferences for Cut Spending (7)

ACS County Property Tax Rate, 2010 (8)

Prefers Repealing ACA (9)

Supports Increasing Min. Wage (10)

CCES Supports Banning Assault Rifle (11)

Opposes EPA Regulation of CO2 (12)

Leip Republican Vote Share 2000–16 (Avg.) (13)

Panel A: Baseline OLS Specifications from Tables 3–5 + Controls for Pop. Density in 1890/2010, Birthplace Diversity in 1870/2000, Year Connected to Rail frontier experience

0.065*** (0.018)

0.127*** (0.021)

0.013*** (0.004)

-0.0001 (0.003)

-0.023* (0.013)

0.003 (0.002)

0.025 (0.013)

-0.002 (0.005)

0.011* (0.004)

-0.021** (0.008)

-0.004 (0.003)

0.006* (0.003)

1.068*** (0.333)

– 1,838

– 1,838

2,188 87

26,577 1,606

8,466 242

169,020 1,819

5,382 240

– 1,836

29,432 1,727

4,905 1,004

28,114 1,588

27,929 1,581

– 1,838

0.159*** (0.039)

0.231*** (0.040)

0.016*** (0.006)

-0.002 (0.028)

0.018 (0.030)

0.004 (0.025)

0.031 (0.023)

-0.026* (0.015)

0.012* (0.007)

-0.027** (0.012)

-0.001 (0.007)

0.008 (0.006)

1.947*** (0.724)

KP Wald Statistic Number of Individuals Number of Counties

140.5 – 1,838

140.5 – 1,838

69.3 2,188 87

43.6 26,577 1,606

49.9 8,466 242

46.9 169,020 1,819

45.2 5,382 240

140.4 – 1,836

47.3 29,432 1,727

51.9 4,905 1,004

47.3 28,114 1,588

47.8 27,929 1,581

140.5 – 1,838

Mean of Dependent Variable

0.00

0.00

0.090

0.402

0.00

0.434

0.00

1.02

0.535

0.688

0.630

0.317

60.0

Survey Wave Fixed Effects Individual Demographic Controls State/Division Fixed Effects Geographic/Agroclimatic Controls

– – Yes Yes

– – Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

– – Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes

– – Yes Yes

Number of Individuals Number of Counties

Panel B: Baseline IV Specifications from Tables 3–5 + Controls for Pop. Density in 1890/2010, Birthplace Diversity in 1870/2000, Year Connected to Rail frontier experience

46

Notes: This table reports instrumental variables estimates of equation (5) based on the Bartik (1991)-inspired instrument described in Section 6.1. We report results for several of the long-run outcomes in Tables 3–6 using the same specifications therein. Panel A reports the baseline IV estimates while Panel B reports estimates that additionally include a set of bad (i.e., not predetermined) controls: population density in 1890 and 2010, birthplace diversity (fractionalization) in 1870 and 2000, and the first year the county was connected to the railroad. The sample size is slightly reduced here due to coverage gaps in the 1870 Census. Results look similar in Panel A restricted to this smaller sample. The KP Wald Statistic is due to Kleibergen and Paap (2006). Given the large Wald stats, similar inference arises using weak-instrument robust estimators. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Appendix A

Further Empirical Results

We report here additional results mentioned in the paper.

A.1

Effort as the Road to Riches

Frontier conditions of wilderness and isolation presented both opportunities and challenges. Insofar as these conditions implied favorable prospects of upward mobility and a large perceived importance of effort in income generation, they would be prone to generate opposition to redistribution and government intervention more broadly, as suggested by the literature reviewed in Section 2.2. The abundance of land and other natural resources offered ample profit opportunities provided they were deftly exploited. In the words of one of Turner’s most prominent students, the frontiersman “harbored the belief that his self-sufficiency would increase, knowing that his own abilities would assure him a prosperous future as he exploited the natural resources about him” (Billington, 1974). In this now classic narrative, the frontier was characterized by “ widespread property holdings” and by a “belief that every man would achieve affluence.” The relative ease of access to land, combined with the pervasive threats to success, weakened class distinctions and stimulated economic mobility on the frontier. In the account of Overmeyer (1944), because “life was rough, crude, hard, and dangerous, few people of culture and means were attracted to the frontier, and hence the pioneers in general represented the great body of lower middle class, some of whom through their own efforts were able to amass comfortable fortunes.” And given that “all had to face the same hardships and dangers,” the frontier was a “great leveling institution.” As Billington (1974) noted, the frontier conditions also implied that “A man’s capacities, not his ancestry, determined his eventual place in the hierarchy, to a greater degree than in older societies.” These pervasive views had merit as seen in data on inequality. Consistent with the idea that frontier locations offered a level playing field, and thus were prone to generate a large perceived importance of effort, Appendix Figure A.6(a) shows that levels of land inequality, as captured by the Gini index, were much lower in sparsely populated locations. Appendix Figure A.6(b) shows that land inequality rose sharply as counties exited the frontier. These aspects of frontier life were bound to instill an opposition to government intervention. Citing again from Billington (1974), the frontier man “wanted not government interference with his freedom as he followed the road to riches.” In the prevalent view, “every man was a self-dependent individual, capable of caring for himself without the fostering care of society,” which “seemed just in a land that provided equal opportunity for all to ascend the social ladder.”

47

A.2

Demographics and Individualism in the Transition from Frontier to Post-Frontier

As noted in Section 4, our second strategy for identifying distinctive traits along the frontier is based on a panel event-study specification of the following form: xcdt = α +

40 X

γj 1(years since exiting frontier = j) + θd + θt + εcdt ,

(A.1)

j=−20

where the γj coefficients identify the average x for counties that have exited or will exit the frontier (as defined in Section 3.1) j years prior or in the future, respectively. We plot 95 percent confidence intervals for the γ terms, each of which are estimated with reference to the decade in which the county transitioned out of the frontier. These event study figures provide similar insights as the semiparametric ones in Section 4 with the added advantage that they capture some degree of within-county differences over time rather than static cross-sectional differences in demographics and individualism.1 Demographics.

The event study in Figure A.4(a) provides evidence of the abrupt shift in the sex ratio

as counties exit the frontier. Relative to the decade in which counties exit the frontier, the preceding two decades have around 0.25 higher male to female sex ratios whereas those decades thereafter have slightly lower ratios. These patterns line up with the typical view of frontier settlements displaying a majority of males. The results in Figure A.4(b) provide similar evidence of convergence towards a more stable age distribution as counties exit the frontier. For fertility rates, the transition takes a different shape, as seen in Figure A.4(c). The number of young children per adult (child-bearing age) woman remains fairly stable in the decades prior to exiting and then sharply declines thereafter, falling by around 0.3 as we approach 30 years after exiting. We find a similar albeit less pronounced transition path for illiteracy rates in Figure A.4(d). For the foreign-born population share, we see a steady decline after counties exit the frontier in Figure A.4(e) . In the two decades prior to exiting, the foreign-born account for a share of the population around 10–15 percentage points (p.p.) larger than in the years after exiting. Individualism. Turning to our proxy for individualism, the event study results in Figure A.5 highlight the homogenization in naming patterns as counties approach the decade in which they exit the frontier. Thereafter, we see those naming patterns stabilize around that more conformist equilibrium in the ensuing decades as communities put down deeper roots.

1

We do not use county fixed effects as the unbalanced nature of the panel leads to very noisy estimates. Nevertheless, the descriptive results here offer additional insight on the process of demographic and cultural change along the frontier.

48

Figures Figure A.1: Comparing Our Population Density Estimates for 1790 and 1860 with the Census Report from 1890

49 Notes: This figure compares our estimates on the right of population density in 1790 and 1860 at the (2010) county level using the harmonization procedure described in Section 3 to the historical estimates in on the left based on the the noteworthy Progress of the Nation Census report on which Turner based his thesis.

Figure A.2: The Frontier Using Contemporaneous vs 2010 County Boundaries for selected years

50 Notes: Based on county level Population Census data from 1790-1880 and NHGIS county shapefiles. The figures provide the county boundaries for selected years and the frontier lines for the corresponding years drawn using the contemporaneous county boundaries as well as the 2010 county boundary. The frontier lines delineate the counties that had population density of two persons or higher. The frontier lines in blue are drawn using the contemporaneous county boundaries where as the frontier lines in red are drawn using the 2010 county boundaries (after the data harmonization discussed in Section 3.1 ).

Figure A.3: Population Density Across Counties, 1800–1880 800

800 800

600

600

400

200

0

0

10

20 30 Population Density, 1800

40

0

50

800

Frequency

Frequency

600

400

0

20

40 60 Population Density, 1830

0

20

40 Population Density, 1810

60

0

80

800

800

600

600

400

0

80

400

200

200

200

0

400

Frequency

200

Frequency

Frequency

Frequency

600

600

600

400

400

0

20

40 60 Population Density, 1820

80

400

200

0

20

40 60 Population Density, 1840

80

0

100

0

20

40 60 Population Density, 1850

80

100

0

20

40 60 Population Density, 1880

80

100

400

200

Frequency

Frequency

Frequency

300

200

200 100

0

0

20

40 60 Population Density, 1860

80

100

0

0

20

40 60 Population Density, 1870

80

100

0

Notes: Based on county level Population Census data from 1800–1880. Population density is measured in number of persons per square mile.

51

Figure A.4: Demographic Characteristics Along the Transition out of the Frontier Event Study Estimates with Respect to Year of Exiting the Frontier (a) Sex Ratio

(b) Prime-Age Adult Share prime-age adult share: effect relative to decade of frontier exit

sex ratio: effect relative to decade of frontier exit

.6

.4

.2

0

-.2 -20

-10

0 10 20 30 Year Before/After Leaving the Frontier

.06

.04

.02

0

-.02

40

-20

-10

(c) Fertility .1 illiteracy: effect relative to decade of frontier exit

fertility: effect relative to decade of frontier exit

40

(d) Illiteracy

.1

0

-.1

-.2

-.3

.05

0

-.05 -20

-10 0 10 20 30 Year Before/After Leaving the Frontier

40

-20

.05

0

-.05

-.1

-.15 -20

-10 0 10 20 30 Year Before/After Leaving the Frontier

-10

0 10 20 30 Year Before/After Leaving the Frontier

40

(f) Foreign- and Out-of-State Born Share out-of-state born share: effect relative to decade of frontier exit

(e) Foreign-Born Share immigrant share: effect relative to decade of frontier exit

0 10 20 30 Year Before/After Leaving the Frontier

40

.1

0

-.1

-.2

-.3 -20

-10 0 10 20 30 Year Before/After Leaving the Frontier

40

Notes: This figure plots coefficients from the event study regressions of the form in equation (A.1) for each of the outcomes in the semiparametric regressions presented in Figure 4. The decade-specific point estimates and 95 percent confidence intervals are each with reference to the county-specific decade of exiting the frontier. All regressions include division and year fixed effects. Standard errors are clustered using the arbitrary grid cell approach of (Bester et al., 2011) as described in Section 6.1.

52

Figure A.5: Uncommon Names Among White Children With Native Parents Along the Transition out of the Frontier Event Study Estimates with Respect to Year of Exiting the Frontier (b) Girls

.06

uncommon girl names: effect relative to decade of frontier exit

uncommon boy names: effect relative to decade of frontier exit

(a) Boys

.04

.02

0

-.02

-.04 -20

-10

0 10 20 30 Year Before/After Leaving the Frontier

.1

.05

0

-.05

40

-20

-10

0 10 20 30 Year Before/After Leaving the Frontier

40

Notes: This figure replicates the event study specification in Figure A.4 for the unique names measure defined and used in Figure 5 but extended to include data up to 1930.

Figure A.6: Inequality is Lower on the Frontier (a) Semiparametric

(b) Event Study .1 land gini: effect relative to decade of frontier exit

E(land gini | year & division FE)

.45

.4

.35

.3 0

10

20

30

40

.05

0

-.05

50

-20

Population Density

-10

0 10 20 30 Year Before/After Leaving the Frontier

40

Notes: Based on county level data from National Historical Geographic Information System: Version 11.0 Database from 1790-1890. Land inequality is measured using the county level gini coefficient based on the number of farms in seven bins of farm size. The semiparametric specification in (a) is the same as in Figure 4. The event study specification in Figure A.4. See the notes therein for details.

53

Figure A.7: Annual Migration Inflows

Annual Inflow of Immigrants to the US

800000

600000

400000

200000

0 1820

1830

1840

1850 1860 year

1870

1880

1890

Notes: This figure plots the total number of migrants entering the United States, 1820-1890. The data for 1820-1940 is available from the Migration Policy Institute (2016).

Figure A.8: Immigration and Westward Expansion Westward Shift in Center of Population (in KM)

140 1850-1860

120

100 1830-1840

80

1870-1880 1840-1850 1880-1890

1810-1820 1860-1870 1790-1800 1820-1830 1800-1810

60 11

12

13 14 15 Log of Total Decadal Migration Inflow

16

Notes: This figure plots the length of the decadal westward shift of the center of population (in km) against the total immigrant inflow in the decade (in logs). Due to lack of migration inflow data for earlier years, the values for 1790-1819 are imputed. See Appendix B for details. However, note that the line fit is calculated restricting to the data from 1820-1890 obtained from the Migration Policy Institute (2016).

54

Figure A.9: Unique Names Among White Children With Native-Born Grand Parents Along the Transition out of the Frontier (a) Semiparametric Cross-Sectional Regressions (a) Boys

(b) Girls .82 E(uncommon girl names | year & division FE)

E(uncommon boy names | year & division FE)

.7

.68

.66

.64

.62

.8

.78

.76

.74 0

10

20

30

40

50

0

10

Population Density

20

30

40

50

Population Density

(b) Event Study Regressions (b) Girls uncommon girl names: effect relative to decade of frontier exit

uncommon boy names: effect relative to decade of frontier exit

(a) Boys .04

.02

0

-.02

-.04

-.06 -10

0 10 20 30 Year Before/After Leaving the Frontier

40

.04

.02

0

-.02

-.04 -10

0 10 20 30 Year Before/After Leaving the Frontier

40

Notes: This figure replicates the semiparametric specification from Figure 5 and the the event study specification in Figure A.5 for the same unique names measure therein but restricted to children with native-born grandparents.

55

Tables Table A.1: Differential Growth Effects of Unique Names Inside the Frontier Dependent Variable: ∆ log seius ocscorus (2) (3) (4) (5)

(1) has unique name has unique name × frontier county

Number of Individuals Mean of Dependent Variable R2 County Fixed Effects Age, Age Squared Control Number of Children Fixed Effects Baseline seius or ocscorus

(6)

-0.021 (0.032) 0.646** (0.319)

-0.023 (0.032) 0.650** (0.312)

-0.040 (0.028) 0.738** (0.339)

0.006 (0.018) 0.213 (0.150)

0.005 (0.018) 0.221 (0.168)

-0.005 (0.016) 0.272* (0.163)

3,898 0.052 0.24

3,898 0.052 0.26

3,898 0.052 0.41

3,897 0.030 0.26

3,897 0.030 0.28

3,897 0.030 0.39

Yes No No No

Yes Yes Yes No

Yes Yes Yes Yes

Yes No No No

Yes Yes Yes No

Yes Yes Yes Yes

Notes: This table reports estimates analogous to those in Panel A of Table 2 but based on the NAPP Linked Sample based on a 1 percent of the population in the 1870 and 1880 Population Censuses. The sample is restricted relative to the one in Table 2 as we are interested in the change in socioeconomic status (seius) and occupational standing (ocscorus) between 1870 and 1880, and this information is only available in both years for this smaller linked sample. The estimates are based on white, male household heads that reside in the same county in 1870 and 1880. The dependent variable is the change in log seius or ocscorus, and the frontier dummy equals one if the given county lies in the frontier in 1880. The unique name measure is based on the top 10 names nationally, but we find similar results for the top 25 names. The controls listed at the bottom of the table include the baseline dependent variable in columns 3 and 6. Standard errors are clustered at the county level. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.2: Unique Names Are Not Associated with Differential Success Among Migrants to the Frontier versus Non-Frontier

(1) father has unique name

0.016*** (0.005)

father has unique name × frontier county

Dependent Variable: Father’s Economic Status in 1880 seius ocscorus (2) (3) (4) (5) 0.017*** (0.005) -0.004 (0.017)

at least one child 0-10 has unique name

0.012** (0.005)

0.125*** (0.007)

at least one child 0-10 has unique name × frontier county

Number of Individuals R2

239,638 0.11

239,638 0.11

239,638 0.11

0.013** (0.005) -0.003 (0.017) 0.127*** (0.007) -0.027 (0.024)

0.005 (0.005) 0.005 (0.018) 0.101*** (0.007) -0.020 (0.025)

239,638 0.11

239,638 0.12

Notes: This table reports estimates analogous to those in Panel A of Table 2 but restricting to individuals that move counties between 1870 and 1880 with the frontier dummy indicating equal to one for those migrants that arrived in a frontier county some time between 1870 and 1880. The dependent variable is normalized socioeconomic status (seius) or occupational standing (ocscorus). All columns include fixed effects for the number of children born in the decade, include 1880 county fixed effects, and cluster standard errors at that same level. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

56

Table A.3: Unique Names and Survival in the Frontier (1)

(2)

(3)

(4)

Dependent Variable: Panel A: Emigrated from Frontier County in 1870 father has unique name

-0.051*** (0.009)

mother has unique name

27,066 0.583 0.07

27,066 0.583 0.07

-0.005 (0.007)

at least one child 0-10 has unique name

Number of Individuals Mean of Dependent Variable R2

-0.024** (0.010)

-0.050*** (0.009) -0.005 (0.007) -0.020* (0.010)

27,066 0.583 0.07

27,066 0.583 0.07

Panel B: Return Migrated from Frontier County in 1870 to Non-Frontier Eastern County in 1880 father has unique name

-0.042*** (0.009)

mother has unique name

27,066 0.410 0.07

27,066 0.410 0.08

-0.010 (0.007)

at least one child 0-10 has unique name

Number of Individuals Mean of Dependent Variable R2

-0.026** (0.011)

-0.041*** (0.009) -0.009 (0.007) -0.023** (0.011)

27,066 0.410 0.08

27,066 0.410 0.07

Panel C: Onward Migrated from Frontier County in 1870 to Frontier Western County in 1880 father has unique name

-0.006 (0.006)

mother has unique name

27,066 0.167 0.11

27,066 0.167 0.11

0.006 (0.005)

at least one child 0-10 has unique name

Number of Individuals Mean of Dependent Variable R2

0.001 (0.008)

-0.006 (0.006) 0.006 (0.005) 0.001 (0.008)

27,066 0.167 0.11

27,066 0.167 0.11

Notes: This table reproduces the estimates from Panel B of Table 2 alongside other outcomes in Panels A and C that clarify that the “return migration” effect comprises the full effect on outmigration destinations discussed with respect to that finding. Standard errors are clustered at the origin county level. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

57

Table A.4: Intergenerational Persistence in Unique Names is Higher in the Frontier

(1) resident of frontier county father has unique name

0.185*** (0.034) 0.058*** (0.002)

0.049*** (0.002) -0.026 (0.049) 0.017*** (0.001) 0.113*** (0.029)

0.089*** (0.003) -0.009 (0.081) 0.016*** (0.002) 0.115*** (0.041)

0.008*** (0.002) -0.003 (0.045) 0.018*** (0.002) 0.127*** (0.049)

645,406 0.714 0.02

645,406 0.714 0.02

645,406 0.714 0.04

327,464 0.665 0.04

317,942 0.765 0.03

Yes No

Yes No

Yes Yes

Yes Yes

Yes Yes

0.024*** (0.002)

mother has unique name × frontier county

Number of Individuals Mean Dependent Variable R2 (Gender×)Age Fixed Effects County Fixed Effects

Girls (5)

-0.104* (0.062) 0.058*** (0.002) 0.102* (0.053) 0.024*** (0.002) 0.218*** (0.033)

father has unique name × frontier county mother has unique name

Dependent Variable: Child Has Unique Name Both Genders Boys (2) (3) (4)

Notes: This table reports estimates of the intergenerational persistence in unique names inside and outside the frontier based on the linked sample data for 1870 and 1880 used in Table 2. The sample is restricted to white children age 0–10 with native-born parents that did not move counties between 1870 and 1880. Unique name for fathers, mothers, and children is based on the top 10 nationally. All columns include fixed effects for birth order and the number of children siblings born in the decade and cluster standard errors at the 1880 county level. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

58

Table A.5: Robustness to Other Measures of Unique Names (OLS Estimates)

Further Sample Restriction

(1)

(2)

(3)

None

Native Father

Native Parents

Unique Measure

(4)

(5)

(6)

(7)

(8)

(9)

(10)

None

None

Top 25 Division

Top 100 Division

Top 10 National Non-Biblical

Sample: White Children Aged 0-10, 1940 Census Native None None None None Grandparents

Top 10 Division

Top 10 National

Top 10 State

Top 10 County

Panel A: Share of Boys with Unique Names frontier experience

Number of Counties R2

0.128*** (0.020)

0.133*** (0.020)

0.132*** (0.020)

0.103*** (0.022)

0.142*** (0.019)

0.140*** (0.020)

0.111*** (0.032)

0.118*** (0.021)

0.130*** (0.020)

0.116*** (0.016)

2,035 0.63

2,035 0.62

2,035 0.62

2,035 0.26

2,035 0.63

2,035 0.61

2,035 0.20

2,035 0.64

2,035 0.68

2,035 0.73

Panel B: Share of Girls with Unique Names

59

frontier experience

Number of Counties R2

0.194*** (0.023)

0.188*** (0.023)

0.103*** (0.022)

0.101*** (0.021)

0.188*** (0.022)

0.197*** (0.021)

0.054* (0.032)

0.175*** (0.019)

0.178*** (0.019)

0.172*** (0.020)

2,035 0.44

2,035 0.44

2,035 0.26

2,034 0.12

2,035 0.49

2,035 0.42

2,035 0.22

2,035 0.59

2,035 0.65

2,035 0.54

Notes: This table reports analogous estimates of Table 3 for alternative measures of the prevalence of individualistic naming patterns. The dependent variable is normalized across all columns. Column 1 reproduces the baseline estimates from that main table based on the full specification in column 4. Column 2 restricts the measure to children with native-born fathers, column 3 restricts to those with native-born mothers and fathers, and column 4 restricts to those with native-born grandparents. Column 5 changes the definition of uniqueness to be based on the top 10 nationally, column 6 changes to the top 10 at the state level, and column 7 to the top 10 at the given county level. Column 8 increases the uncommon threshold to the top 25, and column 9 increases that to the top 100. Column 10 restricts to top names that are do not have biblical roots. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.6: Robustness to Other Measures of Unique Names (IV Estimates)

Further Sample Restriction

(1)

(2)

(3)

None

Native Father

Native Parents

Unique Measure

(4)

(5)

(6)

(7)

(8)

(9)

(10)

None

None

Top 25 Division

Top 100 Division

Top 10 National Non-Biblical

Sample: White Children Aged 0-10, 1940 Census Native None None None None Grandparents

Top 10 Division

Top 10 National

Top 10 State

Top 10 County

Panel A: Share of Boys with Unique Names frontier experience

60 Number of Counties KP Wald Stat R2

0.224*** (0.048)

0.225*** (0.049)

0.222*** (0.049)

0.150*** (0.054)

0.195*** (0.046)

0.223*** (0.048)

0.200** (0.085)

0.201*** (0.047)

0.210*** (0.044)

0.152*** (0.040)

2,035 103.5 0.02

2,035 103.5 0.02

2,035 103.5 0.02

2,035 103.5 0.01

2,035 103.5 0.04

2,035 103.5 0.03

2,035 103.5 0.01

2,035 103.5 0.02

2,035 103.5 0.03

2,035 103.5 0.04

Panel B: Share of Girls with Unique Names frontier experience

Number of Counties KP Wald Stat R2

0.260*** (0.049)

0.243*** (0.048)

0.150*** (0.054)

0.093* (0.050)

0.193*** (0.047)

0.257*** (0.050)

0.058 (0.074)

0.248*** (0.042)

0.241*** (0.042)

0.205*** (0.047)

2,035 103.5 0.06

2,035 103.5 0.05

2,035 103.5 0.01

2,034 103.5 0.01

2,035 103.5 0.06

2,035 103.5 0.06

2,035 103.5 0.00

2,035 103.5 0.06

2,035 103.5 0.07

2,035 103.5 0.06

Notes: This table reports analogous instrumental variables estimates of the OLS specifications in Table A.5. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.7: Unique Naming Choices Persist (Intergenerationally) Through the Early 20th Century 1910 OLS (1)

Sample: White Children Aged 0-10 in . . . Census 1920 1930 1940 IV OLS IV OLS IV OLS IV (2) (3) (4) (5) (6) (7) (8)

Panel A: Share of Boys with Unique Names in Top 10 Nationally frontier experience

Number of Counties KP Wald Stat R2

0.181*** (0.026)

0.136** (0.059)

0.183*** (0.023)

0.176*** (0.053)

0.148*** (0.021)

0.185*** (0.047)

0.142*** (0.019)

0.195*** (0.046)

2,028

2,028 105.1 0.04

2,035

2,035 103.5 0.06

2,035

2,035 103.5 0.04

2,035

2,035 103.5 0.04

0.30

0.44

0.53

0.63

Panel B: Share of Girls with Unique Names in Top 10 Nationally frontier experience

Number of Counties KP Wald Stat R2

0.162*** (0.019)

0.128*** (0.041)

0.172*** (0.018)

0.175*** (0.043)

0.163*** (0.022)

0.151*** (0.046)

0.188*** (0.022)

0.193*** (0.047)

2,028

2,028 105.1 0.05

2,035

2,035 103.5 0.08

2,035

2,035 103.5 0.06

2,035

2,035 103.5 0.06

0.56

0.64

0.56

0.49

Notes: This table reports analogous OLS and IV estimates of Table 3 but for each year since 1910. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.8: Total Frontier Experience and Contemporary Cooperation vs. Self-Reliance

frontier experience

Oster δ for β = 0 Number of Individuals Number of Counties Mean of Dependent Variable KP Wald Stat KP Underidentification Test p-value R2 Individual Demographic Controls State Fixed Effects Division Fixed Effects Geographic/Agroclimatic Controls

OLS (1)

OLS (2)

OLS (3)

OLS (4)

IV (5)

-0.019* (0.009)

-0.024** (0.009)

-0.038*** (0.014)

-0.020 (0.013)

-0.057 (0.041)

-2.79 567 48 0.549

-2.74 567 48 0.549

-20.45 567 48 0.549

22.49 567 48 0.549

0.01 Yes No No No

0.01 Yes No Yes No

0.02 Yes Yes Yes No

0.03 Yes No Yes Yes

567 48 0.549 8.4 0.014 0.01 Yes No Yes Yes

Notes: This table reports estimates for a dependent variable based on a proxy for individualism in the 1990 round of ANES, covering 567 individuals in 48 counties across 17 states in our sample. The measure asks individuals whether (1) “it is more important to be a cooperative person who works well with others”, or (2) “it is more important to be a self-reliant person able to take care of oneself.” The dependent variable equals one if they answer (1). We report the same set of specifications in columns 1–4 as in Table 3 to demonstrate the statistically and economically significant effect sizes despite the coverage limitations. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

61

Table A.9: Alternative Measures of Total Frontier Experience (TFE) and Unique Names Dependent Variable: Share of . . . with Names Outside Boys Girls Boys Girls Boys Girls Top 10 Top 100 Top 10 in Division in Division in County (1) (2) (3) (4) (5) (6) TFE: 100 km, ≤ 6/mi2 , all contour lines (baseline)

0.128*** (0.020)

0.194*** (0.023)

0.130*** (0.020)

0.178*** (0.019)

0.111*** (0.032)

0.054* (0.032)

TFE: 75 km, ≤ 6/mi2 , all contour lines

0.135*** (0.021)

0.202*** (0.023)

0.141*** (0.020)

0.189*** (0.019)

0.112*** (0.034)

0.052 (0.032)

TFE: 50 km, ≤ 6/mi2 , all contour lines

0.140*** (0.022)

0.214*** (0.025)

0.143*** (0.021)

0.183*** (0.021)

0.102*** (0.037)

0.040 (0.034)

TFE: 100 km, 2-6/mi2 , all contour lines

0.121*** (0.031)

0.177*** (0.041)

0.114*** (0.032)

0.154*** (0.034)

0.139** (0.060)

0.063 (0.057)

TFE: 75 km, 2-6/mi2 , all contour lines

0.111*** (0.032)

0.168*** (0.043)

0.107*** (0.033)

0.151*** (0.036)

0.152** (0.061)

0.071 (0.059)

TFE: 50 km, 2-6/mi2 , all contour lines

0.095*** (0.036)

0.161*** (0.047)

0.090** (0.037)

0.137*** (0.039)

0.140** (0.069)

0.077 (0.066)

TFE: 100 km, no density restriction, all contour lines

0.063*** (0.020)

0.078*** (0.024)

0.067*** (0.020)

0.092*** (0.020)

0.104*** (0.033)

0.063** (0.029)

TFE: 75 km, no density restriction, all contour lines

0.070*** (0.019)

0.091*** (0.022)

0.076*** (0.019)

0.100*** (0.019)

0.095*** (0.034)

0.060** (0.028)

TFE: 50 km, no density restriction, all contour lines

0.080*** (0.020)

0.116*** (0.024)

0.083*** (0.019)

0.102*** (0.020)

0.077** (0.035)

0.039 (0.031)

TFE: 100 km, ≤ 6/mi2 , main contour line

0.087*** (0.026)

0.120*** (0.033)

0.094*** (0.026)

0.123*** (0.026)

0.141*** (0.040)

0.099*** (0.034)

TFE: 75 km, ≤ 6/mi2 , main contour line

0.080*** (0.026)

0.108*** (0.033)

0.091*** (0.026)

0.122*** (0.028)

0.125*** (0.040)

0.089*** (0.032)

TFE: 50 km, ≤ 6/mi2 , main contour line

0.081*** (0.028)

0.116*** (0.034)

0.094*** (0.028)

0.119*** (0.030)

0.108** (0.045)

0.073** (0.035)

Notes: This table reports estimates of equation (5) for three measures of unique names for white children, age 0–10 in the 1940 Census. Each cell is a different regression based on the given dependent variable in the column and the measure of total frontier experience in the given row. The measures of frontier experience vary (i) the catchment area from 100 to 75 to 50 km from the contour lines, (ii) the density restriction from ≤ 6 people/mi2 to 2≥people/mi2 ≤ 6 to no restriction, and (iii) including all contour lines identified by the GIS algorithm or only the main, westernmost one. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

62

Table A.10: Alternative Measures of Total Frontier Experience (TFE) and Opposition to Government Intervention and Redistribution Prefers Cut Public Spending on Poor (ANES) (1)

Dependent Variable: Prefers Cut Prefers Reduce Public Spending Public Debt by on Welfare Spending Cuts (CCES) (CCES) (2) (3)

County Property Tax Rate, 2010 (ACS) (4)

TFE: 100 km, ≤ 6/mi2 , all contour lines (baseline)

0.012*** (0.004)

0.008*** (0.003)

0.013*** (0.002)

-0.032*** (0.006)

TFE: 75 km, ≤ 6/mi2 , all contour lines

0.016*** (0.005)

0.009*** (0.003)

0.015*** (0.003)

-0.034*** (0.006)

TFE: 50 km, ≤ 6/mi2 , all contour lines

0.016*** (0.006)

0.011*** (0.004)

0.016*** (0.003)

-0.035*** (0.007)

TFE: 100 km, 2-6/mi2 , all contour lines

0.028*** (0.007)

0.019*** (0.006)

0.029*** (0.005)

-0.016** (0.008)

TFE: 75 km, 2-6/mi2 , all contour lines

0.029*** (0.007)

0.017*** (0.006)

0.031*** (0.006)

-0.017** (0.008)

TFE: 50 km, 2-6/mi2 , all contour lines

0.057*** (0.015)

0.019*** (0.007)

0.029*** (0.006)

-0.014* (0.008)

TFE: 100 km, no density restriction, all contour lines

0.012** (0.005)

0.003 (0.003)

0.006*** (0.002)

-0.013** (0.007)

TFE: 75 km, no density restriction, all contour lines

0.014*** (0.005)

0.004 (0.003)

0.009*** (0.002)

-0.016** (0.006)

TFE: 50 km, no density restriction, all contour lines

0.017** (0.007)

0.004 (0.003)

0.007*** (0.003)

-0.020*** (0.006)

TFE: 100 km, ≤ 6/mi2 , main contour line

0.001 (0.008)

0.004 (0.004)

0.011*** (0.004)

-0.038*** (0.008)

TFE: 75 km, ≤ 6/mi2 , main contour line

0.011 (0.007)

0.006 (0.005)

0.014*** (0.005)

-0.038*** (0.008)

TFE: 50 km, ≤ 6/mi2 , main contour line

0.009 (0.008)

0.009 (0.006)

0.017*** (0.005)

-0.043*** (0.008)

Notes: This table reports estimates of equation (5) for the outcomes in Table 4. Each cell is a different regression based on the given dependent variable in the column and the measure of total frontier experience in the given row. The measures of frontier experience vary (i) the catchment area from 100 to 75 to 50 km from the contour lines, (ii) the density restriction from ≤ 6 people/mi2 to 2≥people/mi2 ≤ 6 to no restriction, and (iii) including all contour lines identified by the GIS algorithm or only the main, westernmost one. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

63

Table A.11: Alternative Measures of Total Frontier Experience (TFE) and Opposition to Government Intervention and Redistribution in GSS Dependent Variable: Government Index of Should Preferences for Redistribute Cutting Spending (normalized) (normalized) (1) (2) TFE: 100 km, ≤ 6/mi2 , all contour lines (baseline)

-0.025* (0.014)

0.046*** (0.015)

TFE: 75 km, ≤ 6/mi2 , all contour lines

-0.017 (0.015)

0.046*** (0.015)

TFE: 50 km, ≤ 6/mi2 , all contour lines

-0.021 (0.018)

0.055*** (0.018)

TFE: 100 km, 2-6/mi2 , all contour lines

-0.014 (0.032)

0.042 (0.032)

TFE: 75 km, 2-6/mi2 , all contour lines

0.019 (0.032)

0.030 (0.034)

TFE: 50 km, 2-6/mi2 , all contour lines

0.035 (0.032)

-0.009 (0.037)

TFE: 100 km, no density restriction, all contour lines

-0.036*** (0.012)

0.018 (0.014)

TFE: 75 km, no density restriction, all contour lines

-0.027** (0.012)

0.027* (0.015)

TFE: 50 km, no density restriction, all contour lines

-0.028* (0.016)

0.025 (0.018)

TFE: 100 km, ≤ 6/mi2 , main contour line

-0.049*** (0.018)

0.030 (0.021)

TFE: 75 km, ≤ 6/mi2 , main contour line

-0.034* (0.017)

0.027 (0.021)

TFE: 50 km, ≤ 6/mi2 , main contour line

-0.038* (0.022)

0.042 (0.025)

Notes: This table reports estimates of equation (5) for the GSS outcomes in Table 4. Each cell is a different regression based on the given dependent variable in the column and the measure of total frontier experience in the given row. The measures of frontier experience vary (i) the catchment area from 100 to 75 to 50 km from the contour lines, (ii) the density restriction from ≤ 6 people/mi2 to 2≥people/mi2 ≤ 6 to no restriction, and (iii) including all contour lines identified by the GIS algorithm or only the main, westernmost one. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

64

Table A.12: Alternative Measures of Total Frontier Experience (TFE) and the Rising Republican Vote Share 2000–16 (Avg.) (1)

Dependent Variable: Republican Vote Share in . . . Presidential Election 2000 2004 2008 2012 2016 ∆ ’16–’00 (2) (3) (4) (5) (6) (7)

∆ ’16–’12 (8)

TFE: 100 km, ≤ 6/mi2 , all contour lines (baseline)

2.047*** (0.320)

1.150*** (0.295)

1.491*** (0.308)

1.974*** (0.334)

2.386*** (0.356)

3.231*** (0.380)

2.080*** (0.254)

0.845*** (0.127)

TFE: 75 km, ≤ 6/mi2 , all contour lines

2.286*** (0.308)

1.392*** (0.287)

1.717*** (0.301)

2.184*** (0.323)

2.647*** (0.341)

3.493*** (0.368)

2.100*** (0.263)

0.846*** (0.130)

TFE: 50 km, ≤ 6/mi2 , all contour lines

2.099*** (0.335)

1.103*** (0.325)

1.452*** (0.334)

2.056*** (0.351)

2.517*** (0.369)

3.368*** (0.381)

2.265*** (0.271)

0.852*** (0.136)

TFE: 100 km, 2-6/mi2 , all contour lines

2.237*** (0.447)

1.741*** (0.430)

1.839*** (0.438)

2.197*** (0.475)

2.348*** (0.512)

3.059*** (0.545)

1.318*** (0.446)

0.711*** (0.170)

TFE: 75 km, 2-6/mi2 , all contour lines

2.292*** (0.455)

1.846*** (0.448)

1.943*** (0.449)

2.210*** (0.483)

2.369*** (0.520)

3.093*** (0.556)

1.246*** (0.466)

0.724*** (0.182)

TFE: 50 km, 2-6/mi2 , all contour lines

2.081*** (0.486)

1.633*** (0.478)

1.740*** (0.478)

2.024*** (0.513)

2.173*** (0.556)

2.832*** (0.586)

1.199** (0.477)

0.659*** (0.195)

TFE: 100 km, no density restriction, all contour lines

1.237*** (0.333)

0.829*** (0.305)

1.046*** (0.315)

1.219*** (0.347)

1.438*** (0.371)

1.654*** (0.377)

0.825*** (0.226)

0.216* (0.119)

TFE: 75 km, no density restriction, all contour lines

1.395*** (0.311)

0.988*** (0.284)

1.163*** (0.297)

1.339*** (0.326)

1.616*** (0.347)

1.868*** (0.357)

0.880*** (0.223)

0.253** (0.119)

TFE: 50 km, no density restriction, all contour lines

1.218*** (0.326)

0.640** (0.305)

0.827*** (0.319)

1.213*** (0.340)

1.517*** (0.360)

1.893*** (0.368)

1.252*** (0.232)

0.376*** (0.123)

TFE: 100 km, ≤ 6/mi2 , main contour line

1.867*** (0.437)

0.938** (0.390)

1.355*** (0.405)

1.731*** (0.458)

2.286*** (0.492)

3.023*** (0.529)

2.085*** (0.346)

0.737*** (0.173)

TFE: 75 km, ≤ 6/mi2 , main contour line

2.078*** (0.424)

1.127*** (0.374)

1.555*** (0.395)

1.934*** (0.443)

2.531*** (0.477)

3.244*** (0.532)

2.117*** (0.367)

0.714*** (0.180)

TFE: 50 km, ≤ 6/mi2 , main contour line

1.788*** (0.460)

0.732* (0.420)

1.208*** (0.436)

1.691*** (0.477)

2.250*** (0.517)

3.063*** (0.551)

2.331*** (0.366)

0.813*** (0.182)

Notes: This table reports estimates of equation (5) for the voting outcomes in Table 5. Each cell is a different regression based on the given dependent variable in the column and the measure of total frontier experience in the given row. The measures of frontier experience vary (i) the catchment area from 100 to 75 to 50 km from the contour lines, (ii) the density restriction from ≤ 6 people/mi2 to 2≥people/mi2 ≤ 6 to no restriction, and (iii) including all contour lines identified by the GIS algorithm or only the main, westernmost one. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

65

Table A.13: Alternative Measures of Total Frontier Experience (TFE) and Preferences Over Contentious, “Frontier-Relevant” Policy Issues

TFE: 100 km, ≤ 6/mi2 , all contour lines (baseline)

Prefers Repealing Affordable Care Act (1) 0.023*** (0.004)

Dependent Variable: Supports Increasing Supports Banning Minimum Wage Assault Rifles (2) (3) -0.023*** -0.017*** (0.007) (0.004)

Opposes EPA Regulation of CO2 Emissions (4) 0.016*** (0.004)

TFE: 75 km, ≤ 6/mi2 , all contour lines

0.028*** (0.004)

-0.024*** (0.009)

-0.020*** (0.004)

0.019*** (0.004)

TFE: 50 km, ≤ 6/mi2 , all contour lines

0.030*** (0.004)

-0.022** (0.009)

-0.021*** (0.005)

0.021*** (0.005)

TFE: 100 km, 2-6/mi2 , all contour lines

0.038*** (0.008)

-0.036** (0.014)

-0.036*** (0.008)

0.027*** (0.007)

TFE: 75 km, 2-6/mi2 , all contour lines

0.041*** (0.008)

-0.026* (0.015)

-0.036*** (0.008)

0.027*** (0.007)

TFE: 50 km, 2-6/mi2 , all contour lines

0.041*** (0.008)

-0.023 (0.017)

-0.037*** (0.009)

0.031*** (0.007)

TFE: 100 km, no density restriction, all contour lines

0.013*** (0.004)

-0.018*** (0.007)

-0.007* (0.004)

0.007** (0.003)

TFE: 75 km, no density restriction, all contour lines

0.017*** (0.004)

-0.015* (0.008)

-0.011*** (0.004)

0.012*** (0.004)

TFE: 50 km, no density restriction, all contour lines

0.018*** (0.004)

-0.010 (0.008)

-0.012*** (0.004)

0.008** (0.004)

TFE: 100 km, ≤ 6/mi2 , main contour line

0.021*** (0.006)

-0.020* (0.011)

-0.013** (0.005)

0.014*** (0.005)

TFE: 75 km, ≤ 6/mi2 , main contour line

0.024*** (0.007)

-0.022* (0.012)

-0.017*** (0.006)

0.018*** (0.005)

TFE: 50 km, ≤ 6/mi2 , main contour line

0.031*** (0.007)

-0.020 (0.013)

-0.022*** (0.006)

0.021*** (0.006)

Notes: This table reports estimates of equation (5) for the regulation policy view outcomes in Table 6. Each cell is a different regression based on the given dependent variable in the column and the measure of total frontier experience in the given row. The measures of frontier experience vary (i) the catchment area from 100 to 75 to 50 km from the contour lines, (ii) the density restriction from ≤ 6 people/mi2 to 2≥people/mi2 ≤ 6 to no restriction, and (iii) including all contour lines identified by the GIS algorithm or only the main, westernmost one. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

66

Table A.14: Robustness of Unique Names Results to Other County-Level (Bad) Controls (1)

Sample: White Children Aged 0-10, 1940 Census (2) (3) (4) (5)

(6)

Panel A: Share of Boys with Unique Names frontier experience

0.128*** (0.020)

population density 1890

0.121*** (0.020) -0.048** (0.023)

population density 2010

0.098*** (0.020) 0.009 (0.008) -0.055*** (0.008)

first year connected to rail

0.087*** (0.021) -0.038** (0.018)

-0.787*** (0.148) -1.764*** (0.454)

0.073*** (0.019) 0.011** (0.005) -0.042*** (0.008) 0.009*** (0.001) -0.685*** (0.136) -0.364 (0.468)

1.5 1,910 0.69

0.9 1,838 0.72

0.012*** (0.001)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Number of Counties R2

0.103*** (0.019) -0.037** (0.016)

2.3 2,035 0.63

2.0 2,035 0.64

1.5 2,035 0.66

1.3 1,960 0.67

Panel B: Share of Girls with Unique Names frontier experience

0.194*** (0.023)

population density 1890

0.188*** (0.023) -0.043* (0.023)

population density 2010

0.163*** (0.023) 0.017** (0.008) -0.058*** (0.010)

first year connected to rail

0.148*** (0.023) -0.029* (0.016)

-0.422** (0.177) -1.869*** (0.600)

0.135*** (0.021) 0.020*** (0.006) -0.044*** (0.010) 0.012*** (0.002) -0.274* (0.151) -0.498 (0.554)

3.1 1,910 0.49

2.0 1,838 0.54

0.015*** (0.002)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Number of Counties R2

0.174*** (0.022) -0.034* (0.017)

5.1 2,035 0.44

4.6 2,035 0.45

3.3 2,035 0.47

2.8 1,960 0.49

Notes: This table subjects the column 4 specification in Table 3 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

67

Table A.15: Robustness of TFE and Opposition to Redistribution (ANES) (1) frontier experience

0.010** (0.004)

population density 1890

Dependent Variable: Prefers Cutting Spending on Poor (2) (3) (4) (5) 0.010** (0.004) -0.002 (0.001)

population density 2010

0.011*** (0.004) -0.002*** (0.001) 0.001 (0.001)

first year connected to rail

0.011*** (0.004) -0.002* (0.001)

-0.060 (0.048) 0.033 (0.069)

0.012* (0.007) -0.003*** (0.001) 0.001 (0.001) -0.000 (0.001) -0.078 (0.065) -0.060 (0.072)

4.10 2,209 89 0.04

1.96 2,188 87 0.04

-0.000 (0.001)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Number of Individuals Number of Counties R2

0.011*** (0.004) -0.002* (0.001)

5.56 2,322 95 0.04

5.70 2,322 95 0.04

7.05 2,322 95 0.04

6.96 2,301 93 0.04

(6)

Notes: This table subjects the column 1 result in Table 4 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.16: Robustness of TFE and Opposition to Redistribution (CCES) (1) frontier experience

0.011*** (0.004)

population density 1890

Dependent Variable: Prefers State Decrease Welfare Spending (2) (3) (4) (5) 0.010*** (0.003) -0.004*** (0.002)

population density 2010

0.005 (0.003) -0.000 (0.001) -0.003*** (0.000)

first year connected to rail

0.008** (0.003) -0.004** (0.002)

-0.025 (0.034) -0.297*** (0.046)

0.001 (0.003) -0.002** (0.001) -0.001** (0.001) 0.000 (0.000) -0.013 (0.033) -0.233*** (0.058)

0.4 0.40 26,795 1,661 0.05

0.1 0.40 26,577 1,606 0.05

0.001* (0.000)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

0.002 (0.003) -0.003*** (0.001)

6.5 0.40 27,749 1,732 0.04

8.0 0.40 27,749 1,732 0.04

1.1 0.40 27,749 1,732 0.04

3.2 0.40 27,531 1,677 0.04

(6)

Notes: This table subjects the column 2 result in Table 4 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

68

Table A.17: Robustness of TFE and Opposition to Redistribution (GSS) Dependent Variable: Believes Gov’t Should Reduce Income Differences (1) (2) (3) (4) (5) (6) frontier experience, decadal

-0.022 (0.012)*

population density 1890

-0.024 (0.012)** -0.003 (0.002)*

population density 2010

-0.025 (0.012)** -0.002 (0.002) -0.000 (0.001)

first year connected to rail

-0.026 (0.011)** -0.003 (0.002)

0.163 (0.110) 0.038 (0.148)

-0.019 (0.013) -0.000 (0.001) -0.002 (0.001) 0.000 (0.001) 0.162 (0.115) 0.154 (0.175)

1.61 8,538 243.00 0.06

1.96 8,466 242.00 0.07

0.000 (0.001)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Number of Individuals Number of Counties R2

-0.016 (0.013) -0.003 (0.002)

6.29 9,085 255.00 0.06

7.70 9,085 255.00 0.06

6.94 9,085 255.00 0.06

9.18 9,013 254.00 0.06

Notes: This table subjects the column 3 result in Table 4 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.18: Robustness of TFE and Opposition to Government Intervention (CCES) (1) frontier experience

Dependent Variable: Prefers State Balance Budget by Cutting Spending (2) (3) (4) (5) (6)

0.013*** (0.002)

population density 1890

0.011*** (0.003) -0.005*** (0.002)

population density 2010

0.006*** (0.002) -0.001 (0.001) -0.003*** (0.000)

first year connected to rail

0.010*** (0.002) -0.005*** (0.002)

-0.081*** (0.023) -0.236*** (0.049)

0.003 (0.002) -0.001*** (0.000) -0.002*** (0.000) 0.001*** (0.000) -0.070*** (0.022) -0.133*** (0.042)

0.6 0.43 163,376 1,887 0.04

0.3 0.43 162,020 1,819 0.04

0.001*** (0.000)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

0.005** (0.002) -0.004*** (0.001)

2.4 0.43 169,536 2,000 0.04

2.6 0.43 169,536 2,000 0.04

0.8 0.43 169,536 2,000 0.04

1.5 0.43 168,176 1,931 0.04

Notes: This table subjects the column 4 result in Table 4 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

69

Table A.19: Robustness of TFE and Opposition to Government Intervention (GSS) (1) frontier experience, decadal

0.034 (0.010)***

population density 1890

Dependent Variable: Index of Preferences For Spending Cuts (2) (3) (4) (5) 0.036 (0.011)*** 0.002 (0.002)

population density 2010

0.030 (0.011)*** 0.005 (0.002)** -0.002 (0.001)*

first year connected to rail

0.035 (0.011)*** 0.003 (0.002)*

-0.254 (0.109)** -0.165 (0.150)

0.025 (0.013)* 0.003 (0.002) 0.000 (0.002) 0.002 (0.001)** -0.273 (0.109)** -0.070 (0.214)

0.87 5,420 241.00 0.07

0.95 5,382 240.00 0.07

0.002 (0.001)*

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Number of Individuals Number of Counties R2

0.023 (0.013)* 0.003 (0.002)*

2.60 5,739 253.00 0.07

2.70 5,739 253.00 0.07

1.85 5,739 253.00 0.07

2.62 5,701 252.00 0.07

(6)

Notes: This table subjects the column 5 result in Table 4 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.20: Robustness of TFE and Reported Property Tax Rates in 2010 frontier experience

(1)

(2)

(3)

(4)

(5)

(6)

-0.054*** (0.009)

-0.053*** (0.009) 0.011 (0.009)

-0.040*** (0.009) -0.020*** (0.004) 0.029*** (0.004)

-0.045*** (0.009) 0.008 (0.007)

-0.031*** (0.008) 0.006 (0.006)

0.252*** (0.055) 1.713*** (0.228)

-0.024*** (0.008) -0.015*** (0.003) 0.019*** (0.005) -0.003*** (0.000) 0.230*** (0.053) 1.152*** (0.229)

1.0 1.02 1,908 0.70

0.7 1.03 1,836 0.72

population density 1890 population density 2010 first year connected to rail

-0.004*** (0.001)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Counties R2

2.4 1.02 2,028 0.65

2.3 1.02 2,028 0.65

1.5 1.02 2,028 0.68

1.6 1.03 1,953 0.67

Notes: This table subjects the column 6 result in Table 4 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

70

Table A.21: Robustness of TFE and Average Republican Vote Share 2000–2016 frontier experience

(1)

(2)

(3)

(4)

(5)

(6)

2.047*** (0.320)

1.895*** (0.319) -1.039*** (0.384)

1.524*** (0.318) -0.125 (0.129) -0.877*** (0.122)

1.499*** (0.343) -0.946*** (0.332)

1.457*** (0.325) -0.851*** (0.259)

-7.050*** (2.317) -48.529*** (6.864)

1.148*** (0.334) -0.306** (0.136) -0.515*** (0.131) 0.082*** (0.022) -6.245*** (2.309) -29.706*** (7.147)

3.3 59.87 1,910 0.41

2.2 59.57 1,838 0.42

population density 1890 population density 2010 first year connected to rail

0.127*** (0.023)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Counties R2

12.3 60.04 2,035 0.34

8.4 60.04 2,035 0.36

4.3 60.04 2,035 0.40

4.5 59.77 1,960 0.36

Notes: This table subjects the column 1 result in Table 5 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.22: Robustness of TFE and ∆ Republican Vote Share 2000 to 2016 frontier experience

(1)

(2)

(3)

(4)

(5)

(6)

2.080*** (0.254)

1.989*** (0.252) -0.631* (0.324)

1.494*** (0.224) 0.586*** (0.136) -1.169*** (0.142)

1.582*** (0.229) -0.509** (0.256)

1.348*** (0.245) -0.432** (0.187)

-3.048* (1.807) -72.281*** (6.978)

0.972*** (0.203) 0.354*** (0.118) -0.730*** (0.149) 0.113*** (0.017) -0.572 (1.689) -48.893*** (6.065)

5.9 9.19 1,910 0.48

2.6 9.00 1,838 0.55

population density 1890 population density 2010 first year connected to rail

0.158*** (0.017)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Counties R2

54.1 8.87 2,035 0.36

22.6 8.87 2,035 0.37

5.6 8.87 2,035 0.49

5.6 8.69 1,960 0.40

Notes: This table subjects the column 1 result in Table 5 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

71

Table A.23: Robustness of TFE and Opposition to the Affordable Care Act (1) frontier experience

0.023*** (0.004)

population density 1890

Dependent Variable: Supports Repealing Affordable Care Act (2) (3) (4) (5) 0.021*** (0.004) -0.003** (0.002)

population density 2010

0.017*** (0.004) 0.001 (0.001) -0.003*** (0.001)

first year connected to rail

0.020*** (0.004) -0.002* (0.001)

-0.015 (0.036) -0.300*** (0.077)

0.012*** (0.004) 0.000 (0.001) -0.002** (0.001) 0.001*** (0.000) 0.004 (0.034) -0.195** (0.084)

1.4 0.53 28,370 1,644 0.06

1.1 0.53 28,151 1,590 0.06

0.001*** (0.000)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

0.015*** (0.004) -0.002* (0.001)

3.0 0.53 29,432 1,727 0.06

3.7 0.53 29,432 1,727 0.05

1.6 0.53 29,432 1,727 0.06

2.3 0.53 29,212 1,672 0.06

(6)

Notes: This table subjects the column 1 result in Table 6 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.24: Robustness of TFE and Opposition to Minimum Wage Increases (1) frontier experience

-0.023*** (0.007)

population density 1890

Dependent Variable: Supports an Increase in the Minimum Wage (2) (3) (4) (5) -0.020*** (0.007) 0.009*** (0.003)

population density 2010

-0.019** (0.008) 0.003 (0.004) 0.001 (0.001)

first year connected to rail

-0.021*** (0.008) 0.006* (0.003)

-0.039 (0.066) 0.128 (0.086)

-0.020** (0.008) 0.006 (0.004) -0.000 (0.001) -0.000 (0.001) -0.036 (0.066) 0.118 (0.110)

2.0 0.69 4,959 1,030 0.06

2.1 0.69 4,905 1,004 0.06

-0.000 (0.001)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

-0.019** (0.008) 0.006 (0.004)

3.2 0.69 5,134 1,066 0.06

4.0 0.69 5,134 1,066 0.05

2.1 0.69 5,134 1,066 0.06

2.7 0.69 5,080 1,040 0.06

(6)

Notes: This table subjects the column 2 result in Table 6 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

72

Table A.25: Robustness of TFE and Opposition to Banning Assault Rifles (1) frontier experience

-0.017*** (0.004)

population density 1890

Dependent Variable: Supports Banning Assault Rifles (2) (3) (4) (5) -0.014*** (0.004) 0.005** (0.002)

population density 2010

-0.009** (0.004) 0.000 (0.001) 0.003*** (0.001)

first year connected to rail

-0.012*** (0.004) 0.004** (0.002)

0.042 (0.033) 0.289*** (0.074)

-0.005 (0.004) 0.001 (0.001) 0.002** (0.001) -0.001*** (0.000) 0.017 (0.033) 0.188** (0.079)

0.7 0.63 28,331 1,641 0.09

0.4 0.63 28,114 1,588 0.09

-0.002*** (0.000)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

-0.007** (0.004) 0.003*** (0.001)

2.7 0.63 29,390 1,722 0.09

2.9 0.63 29,390 1,722 0.09

1.1 0.63 29,390 1,722 0.09

1.7 0.63 29,173 1,669 0.09

(6)

Notes: This table subjects the column 3 result in Table 6 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

Table A.26: Robustness of TFE and Opposition to EPA Regulation of Pollution (1) frontier experience

0.016*** (0.004)

population density 1890

Dependent Variable: Opposes EPA Regulation of CO2 (2) (3) (4) (5) 0.015*** (0.004) -0.003*** (0.001)

population density 2010

0.010*** (0.004) 0.001 (0.001) -0.002*** (0.001)

first year connected to rail

0.012*** (0.003) -0.002*** (0.001)

-0.041 (0.029) -0.205*** (0.058)

0.007** (0.003) 0.000 (0.001) -0.001** (0.001) 0.001*** (0.000) -0.023 (0.029) -0.111* (0.062)

0.9 0.32 28,146 1,634 0.08

0.6 0.31 27,929 1,581 0.08

0.001*** (0.000)

birthplace fractionalization, 1870 birthplace fractionalization, 2000

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

0.010*** (0.003) -0.002** (0.001)

2.2 0.32 29,203 1,717 0.08

2.9 0.32 29,203 1,717 0.08

1.1 0.32 29,203 1,717 0.08

1.5 0.32 28,985 1,663 0.08

(6)

Notes: This table subjects the column 4 result in Table 6 to additional, non-predetermined controls for population density, birthplace diversity, and the first year of connection to the railroad as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

73

Table A.27: TFE and Opposition to Redistribution Beyond Party and Class Lines Dependent Variable:

frontier experience

Prefers Cutting Public Spending on Welfare (1) (2) (3) 0.011*** (0.004)

0.008** (0.003) 0.302*** (0.009)

0.008** (0.004) 0.304*** (0.010) 0.101*** (0.008) -0.001 (0.007)

0.013*** (0.002)

0.009*** (0.002) 0.390*** (0.006)

0.009*** (0.002) 0.379*** (0.006) 0.048*** (0.005) -0.006 (0.005)

6.5 0.40 27,749 1,732 0.04

2.8 0.40 27,724 1,731 0.09

4.1 0.39 24,955 1,693 0.10

2.4 0.43 169,536 2,000 0.04

1.7 0.40 94,485 1,949 0.11

1.8 0.38 80,113 1,929 0.11

identifies as very strong Republican family income > USD 50,000 education > high school

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

Prefers Balancing Budget By Cutting Spending (4) (5) (6)

Notes: This table subjects the columns 2 and 4 results in Table 4 to additional, non-predetermined controls for education and family income as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

74

Table A.28: TFE and Opposition to Regulation Beyond Party and Class Lines Dependent Variable:

frontier experience

Prefers Repealing Affordable Care Act (1) (2) (3) 0.023*** (0.004)

identifies as very strong Republican

0.021*** (0.003) 0.409*** (0.008)

family income > USD 50,000 education > high school

Oster δ for β = 0 Mean of Dependent Variable Number of Individuals Number of Counties R2

3.0 0.53 29,432 1,727 0.06

2.8 0.53 29,373 1,727 0.14

Supports Increasing Minimum Wage (4) (5) (6)

0.021*** (0.003) 0.415*** (0.010) -0.019*** (0.007) -0.080*** (0.006)

-0.023*** (0.007)

2.3 0.53 26,119 1,697 0.14

3.2 0.69 5,134 1,066 0.06

-0.015** (0.007) -0.455*** (0.017)

-0.017** (0.007) -0.457*** (0.017)

Supports Banning Assault Rifles (7) (8) (9) -0.017*** (0.004)

-0.016*** (0.004) -0.277*** (0.008)

-0.017*** (0.004) -0.283*** (0.008) 0.004 (0.005) -0.015** (0.007)

0.016*** (0.004)

0.014*** (0.003) 0.338*** (0.008)

0.016*** (0.003) 0.338*** (0.009) 0.019*** (0.006) 0.018*** (0.006)

2.7 0.63 29,390 1,722 0.09

2.4 0.63 29,328 1,722 0.13

2.4 0.64 26,081 1,694 0.13

2.2 0.32 29,203 1,717 0.08

1.9 0.32 29,144 1,717 0.14

2.1 0.31 25,928 1,685 0.14

-0.077*** (0.014) 1.9 0.70 4,618 1,007 0.21

2.3 0.70 4,618 1,007 0.21

Opposes EPA Regulation of CO2 (10) (11) (12)

75

Notes: This table subjects the results in Table 6 to additional, non-predetermined controls for education and family income as described in Section 6.5. Standard errors are clustered based on the arbitrary grid-cell approach of Bester et al. (2011) as detailed in Section 6.1. Significance levels: ∗ : 10% ∗∗ : 5% ∗ ∗ ∗ : 1%.

B B.1

Data Sources and Construction Definitions and Sources

Construction of Frontier Lines Population density. Population/area. Digitized U.S. Census data on population for every decade in 1790– 2010, from Minnesota Population Center (2016). The county level population data along with other pre-2010 data are harmonized to the 2010 county boundaries using the procedure discussed in Section 3.1. The data on area is calculated using the 2010 county shapefiles from NHGIS (Minnesota Population Center, 2011) using GIS software. Section B.2 provides a detailed step-by-step discussion of this process. Outcome Variables : Demographic Characteristics of the Frontier Sex Ratio. Whites males/white females. The data is available for every decade in 1790-1860 and 1890. Data source: (Minnesota Population Center, 2011). Prime Age Adult Share. Whites aged 15–49/all whites. The data used is consistently available for every decade in 1830-1860. Data source: (Minnesota Population Center, 2011). Fertility. Our proxy for fertility is the number of whites aged 0–9 over the number of white females aged 15-39. The variable used is consistently available for every decade in 1830-1850. Data source: (Minnesota Population Center, 2011). Illiteracy. Illiterate whites aged above 20/whites aged over 20. The variable is available consistently for 1830-1860. Data source: (Minnesota Population Center, 2011). Foreign Born Share. Foreign born/population. The variable used is available for every decade in 1820-1890 (excluding 1840). Data source: (Minnesota Population Center, 2011). Out of State Born Share. Out-of-state born/population. The variable is consistently available for every decade in 1850-1880. Data source: (Minnesota Population Center, 2011). Land inequality. Gini index using distribution of farm sizes, based on county level data on the number of farms of sizes 0–10, 10–19, 20–49, 50–99, 100–499, 500–1000, and above 1000 acres. Available for every decade in 1860-1890. (Minnesota Population Center, 2011). Unique Children Names. White Children Aged 0–10 with Non-Top 10 First Names in Division/White Children Aged 0-10. We also construct similar variables further restricting to children aged 0–10 with native parents, and native grandparents. In addition, for the same sample, we construct additional variables by calculating the popularity of names at the national level instead of the Census division. We use the following procedure to generate the name shares: start by restricting the sample as desired (e.g. white children aged 0-10 with native parents), then calculate the number of children in the county for each given name, then using that value identify the top 10 given names within the census division (or nationally), and then accordingly count the number of children in that county with the identified top 10 names in their corresponding census division. The variables restricting to white children aged 0-10 is available for every decade in 1850–1940 (excluding 1890), with further native-parent restriction for 1850 and 1880-1940 (excluding 1890), and with grandparent restriction for 1880–1940 (excluding 1890). To give some examples, in 1850 the top 10 boy names nationally in descending order of popularity were John, William, James, George, Charles, Henry, Thomas, Joseph, Samuel and David. Meanwhile, a 76

random sample of less common names (outside the top 25) includes ones like Alfred, Nathan, Patrick, Reuben, Herbert, Matthew, Thaddeus and Luke. For girls, the top 10 include Mary, Sarah, Elizabeth, Martha, Margaret, Nancy, Ann, Susan, Jane, and Catherine while less common names (outside the top 25) include ones like Rachel, Susannah, Nina, Olive, Charlotte, Lucinda, and Roxanna. By 1880, the rankings shifted only slightly for boys with Samuel falling outside the top 10 and Harry entering. For girls, the changes were a bit more dramatic with the new top 10 list being Mary, Sarah, Emma, Ida, Minnie, Anna, Annie, Martha, Cora, and Alice. Data source: The NAPP full count census data for 1850 and the Ancestry data collected by NBER for 1860–1940. Economic Status. We measure economics status using either the socioeconomic index (seius) or the occupational score (ocscorus) measures provided by the North Atlantic Population Project: Complete Count Microdata. Both measures range from 0 to 100, and capture the income returns associated with specific occupations in the 1950 Census while the seius measure additionally captures notions of prestige as well as educational attainment. Data source: (Minnesota Population Center, 2011). Outcome Variables : Long-Run Outcomes Prefers Cutting Public Spending On Poor. The Prefers Cutting Public Spending On Poor is an indicator variable based on the following survey question: ”Should federal spending be increased, decreased, or kept about the same on poor people?” The variable takes a value of 1 if the respondent answered ”decreased” and 0 otherwise, and it is available for 1992 and 1996. Data source: The American National Election Studies Time Series (The American National Election Studies, n.d.) Cumulative Data (2012). The ANES is a large, nationally-representative survey of the American electorate in the United States taken during the presidential and midterm election years. See Appendix Figure B.1 for the map of the maximum survey coverage in the final sample of ANES data merged with the frontier related data. Prefers State Decrease Welfare Spending. This is an indicator variable based on the following survey question: “State legislatures must make choices when making spending decisions on important state programs. Would you like your legislature to increase or decrease spending on Welfare? 1. Greatly Increase 2. Slightly Increase 3. Maintain 4. Slightly Decrease 5. Greatly Decrease.” Prefers Cut Public Spending on Welfare takes a value of 1 if the respondent answered ”Slightly Decrease” or ”Greatly Decrease” and 0 otherwise. The data is available in the 2014 and 2016 waves. Data source: Cooperative Congressional Election Study (Ansolabehere, 2017) Common Content surveys. The CCES was formed in 2006, through the cooperation of several academic institutions, to study how congressional elections, representation and voters’ behavior and experiences vary with political geography and social context using very large scale national surveys. The 2014 and 2016 CCES surveys were conducted over the Internet by YouGov using a matched random sample methodology. The Common Content portion of the survey, which contains our variables of interest, surveyed 56,200 adults in 2014 and 64,600 adults in 2016. See Appendix Figure B.2 for the map of the maximum survey coverage in the final sample of CCES data merged with frontier related data. Believes Government Should Redistribute. Based on the following survey question: ”Some people think that the government in Washington ought to reduce the income differences between the rich and the poor, perhaps by raising the taxes of wealthy families or by giving income assistance to the poor. Others think that the government should not concern itself with reducing this income difference between the rich and the poor. Here is a card with a scale from 1 to 7.” We have recoded the variable so that it is increasing in preference for redistribution, where a score of 1 means that the government should not concern itself with reducing income differences and a score of 7 means the government ought to reduce the income differences between rich and poor. The Believes Government Should Redistribute is a normalized version of the above variable, and it is available in our sample for 1993 and all even years between 1994-2016. Data 77

source: The General Social Survey (Smith et al., 2015). The GSS is a repeated cross-sectional survey of a nationally representative sample of non-institutionalized adults who speak either English or Spanish. The surveys has been conducted since 1972, almost every year between 1972-1993 and biennial since 1994. While the sample size for the annual surveys was 1500, since 1994 the GSS administers the surveys to two samples in even-numbered years, each with a target sample size of 1500. The surveys provide detailed questionnaires on issues such as national spending priorities, intergroup relations, and confidence in institutions. See Appendix Figure B.3 for the map of the maximum survey coverage in the final sample of CCES merged with frontier related data.

Prefers Reducing Debt by Cutting Spending. The variable is based on the CCES survey question: “The federal budget deficit is approximately [$ year specific amount] this year. If the Congress were to balance the budget it would have to consider cutting defense spending, cutting domestic spending (such as Medicare and Social Security), or raising taxes to cover the deficit. Please rank the options below from what would you most prefer that Congress do to what you would least prefer they do: Cut Defense Spending; Cut Domestic Spending; Raise Taxes.”. While this question varies slightly from year to year, the underlying theme is the same. The Prefers Reducing Debt by Cutting Spending variable takes a value of 1 if the respondent chose ”Cut Domestic Spending” as a first priority. The data is available for 2006-2014 (excluding 2013). Data source: (Ansolabehere, 2017). Index of Preferences for Spending Cuts. The index is the principal component of nine dummy variables that take the value of 1 if the respondents answers ”too much” to the following questions: “We are faced with many problems in this country, none of which can be solved easily or inexpensively. I’m going to name some of these problems, and for each one I’d like you to name some of these problems, and for each one I’d like you to tell me whether you think we’re spending too much money on it, too little money, or about the right amount. First (READ ITEM A) . . . are we spending too much, too little, or about the right amount on (ITEM)?”. The items considered are improving and protecting the environment, improving healthcare, solving big city problems, halting increasing crimes, dealing with drug addictions, improving the education system, improving conditions for blacks, military spending, foreign aid, welfare, and roads. The variable is available in our sample for 1993 and all even years between 1994-2016. Data source: (Smith et al., 2015). County Property Tax Rate. The average effective property tax rates per $100 of value, calculated at the county level as the ratio of the average real estate tax over the average house value. Data source: The data is obtained from the National Association of Home Builders, which calculated the average effective property tax rates based on the 2010-2014 American Community Survey (ACS) data from the Census Bureau. Republican Vote Share in Presidential Elections. Votes for a GOP candidate/total votes, at the county level. For simplicity, we only consider the five presidential elections since 2000. Data source: Dave Leip’s Atlas of U.S. Presidential Elections (2017). Prefers Repealing Affordable Care Act. Based on the CCES survey question: “The Affordable Health Care Act was passed into law in 2010. It does the following: Requires all Americans to obtain health insurance, Prevents insurance companies from denying coverage for pre-existing condition, Allows people to keep current health insurance and care provider, and Sets up national health insurance option for those without coverage, but allows states the option to implement their own insurance system. Would you have voted for the Affordable Care Act if you were in Congress in 2010?” The Prefers Repealing Affordable Care Act variable takes a value of 1 if the respondent answers ”Yes” and 0 if the answer is ”No”. The data is available for 2014. Data source: (Ansolabehere, 2017).

78

Supports Increasing Minimum Wage. Based on the survey question:“As you may know, the federal minimum wage is currently $5.15 an hour. Do you favor or oppose raising the minimum wage to $7.25 an hour over the next two years, or not?”. The variable Supports Increasing Minimum Wage takes a value of 1 if the respondent choses ”favor” and 0 otherwise. Available in 2007. Data source: (Ansolabehere, 2017). Supports Banning Assault Rifles. Based on the CCES survey question: “On the issue of gun regulation, are you for or against for each of the following proposal? proposal: banning assault rifles”. Supports Banning Assault Rifles takes value 1 if the respondent supports banning assault rifles and 0 otherwise. Available for 2014. Data source: (Ansolabehere, 2017). Opposes EPA Regulations of CO2 Emissions. Based on the CCES survey question “Do you support or oppose each of the following proposals? proposal: Environmental Protection Agency regulating Carbon Dioxide emissions.” The Opposes EPA Regulations of CO2 Emissions takes one if the respondent supports the proposal and 0 the respondent opposes. Available for 2014. Data source: (Ansolabehere, 2017). Cooperation vs. Self-Reliance. Based on the survey question: “I am going to ask you to choose which of two statements I read comes closer to your own opinion. You might agree to some extent with both, but we want to know which one is closer to your views: ONE, it is more important to be a cooperative person who works well with others; or TWO, it is more important to be a self-reliant person able to take care of oneself”. The Cooperation vs. Self-Reliance variable takes a value of 1 if the respondent chooses ”cooperative” and 0 otherwise. Available in 1990. Data source: (The American National Election Studies, n.d.) Identifies As A Strong Republican. An indicator variable that takes 1 if the respondent identifies as a ”Strong Republican.” Available for 2007, 2012, 2014 and 2016. Data source: (Ansolabehere, 2017). Geographic and Agroclimatic Controls Land productivity measures. Average of attainable yields for alfalfa, barley, buckwheat, cane sugar, carrot, cabbage, cotton, ax, maize, oats, onion, pasture grasses, pasture legumes, potato, pulses, rice, rye, sorghum, sweet potato, tobacco, tomato, and wheat. We normalize each product’s values dividing it by the maximum value for that product in the sample. Measures of attainable yields were constructed by the FAO’s Global Agro-Ecological Zones project v3.0 (IIASA/FAO, 2012) using climatic data, including precipitation, temperature, wind speed, sunshine hours and relative humidity (based on which they determine thermal and moisture regimes), together with crop-specific measures of cycle length (i.e. days from sowing to harvest), thermal suitability, water requirements, and growth and development parameters (harvest index, maximum leaf area index, maximum rate of photosynthesis, etc). Combining these data, the GAEZ model determines the maxi- mum attainable yield (measured in tons per hectare per year) for each crop in each grid cell of 0.083×0.083 degrees. We use FAO’s measures of agroclimatic yields (based solely on climate, not on soil conditions) for intermediate levels of inputs/technology and rain-fed conditions. Area. The log of surface area in square miles, calculated using the 2010 county shapefiles from NHGIS (Minnesota Population Center, 2011) using GIS software. Temperature. County-level mean annual temperature measured in Celsius degrees. (IIASA/FAO, 2012).

Data source:

Rainfall. County-level average annual precipitation measured in mm. Data source: (IIASA/FAO, 2012). Elevation. County-level average terrain elevation in km. Data source: (IIASA/FAO, 2012). 79

Latitude. Absolute latitudinal distance from the equator in decimal degrees, calculated from the centroid of each county using GIS software and county shapefiles from NHGIS (Minnesota Population Center, 2011). Longitude. Absolute longitudinal distance from the Greenwich Meridian in decimal degrees, calculated from the centroid of each county using GIS software and county shapefiles from NHGIS (Minnesota Population Center, 2011). Distance to the coastline, rivers, and lakes. Minimum distance to a point in the coastline, rivers, and lakes in km, calculated from the centroid of each county using GIS software and county shapefiles from NHGIS. Data source: (Minnesota Population Center, 2011) Additional Variables Annual Migration Inflow. Total number of migrants entering the United States every year. The data is only available for 1820-1940. Due to lack of data for earlier periods, we impute an annual migration inflow equal to 7500 for every year from 1790-1819. Data source: Migration Policy Institute (2016). First Year Connected to Rail. The first year when at least one railroad intersects a county. Data source: Atack et al. (2010). P Birthplace Fractionalization. We take 1 − o (birthplaceoc /populationc )2 , which is simply 1 minus the Herfindahl concentration index for origin o birthplace diversity in county c. Birthplaces are defined as either a given state within the U.S. or a given country or country grouping abroad. Data used is for 1870 and 2000. Data source: The 1870 data uses the NAPP full count census data while the 2000 data based on county level census data from the NHGIS (Minnesota Population Center, 2011).

B.2

Construction of the Frontier Lines

This section provides a step-by-step description of how we construct the frontier lines for each year between 1790–1890. 1. Calculate county level population density per square mile for each year in 1790–1890 using the 2010 county boundaries. First, we harmonize the county-level population data from each year to the 2010 county boundaries using the procedure discussed in Section 3. For intercensal years, we interpolate county-level population by assuming a constant annual population growth rate that matches the decadal growth rate (replacing initial zeros with 0.01 to avoid infinite growth rates). Then, using the 2010 county boundaries shapefile, we calculate the county-level population density as the ratio of population over county area in square miles. 2. Using GIS software and the 2010 county boundaries, draw a contour line at population density equal to 2 people per square mile for each year. We use tools available in ArcGIS. First, convert the polygon containing the county level population density data into a raster file usingPolygonToRaster tool and set population density as the ”value field” for the conversion. Then, using the ContourList tool, select the raster file created in the preceding step as an input and set the ”contour value” to “2” to create contour lines at population density equal to 2. The resulting lines delineate the counties that have a population density below 2 people per square mile from those counties that have a population density above 2.

80

3. Clean the above contour lines to retain only the significant frontier lines. The contour line produced in the above step some creates “frontier lines” that are isolated pockets of dense populations beyond the main frontier line and pockets of sparse population inside that line. Neither of these types of pockets closely correspond to the historical notion of the frontier as “margins of civilization” (see Section 3). Furthermore, there are frontier lines that emerge from the West Coast during the latter part of the 19th century. Since the process of settlement on the West Coast was marked by the Gold Rush and distinct historical forces that are beyond the scope of the original Frontier thesis, we discard all the lines emerging from the west. In order to retain the lines that correspond to the main frontier lines, for each year, we take the contour lines produced in Step 2 and keep only those that are at least 500 km long and that are east of the main 1890 frontier line. In ArcGIS, the above can accomplished using SelectLayerByAttribute tool to select the line segments based on length and location (e.g., X centroid) and applying CopyFeatures to keep only the selected lines. We consider alternative cutoffs as well as a robustness check discussed in Section 6.5 that only considers total frontier experience defined over the longest, westernmost frontier line.

B.3

Maps of Survey Coverage In the Final Sample for ANES, CCES, and GSS Surveys

Figure B.1: Data Availability for the ANES

Notes: This figure provides the geographical distribution of the maximum number of counties available in our final sample matched with the ANES data. Note that not all the counties in the above map are included in every the regression using the ANES data.

81

Figure B.2: Data Availability for the CCES

Notes: This figure provides the geographical distribution of the maximum number of counties available in our final sample matched with the CCES data. Note that not all the counties in the above map are included in every regression using the CCES data.

Figure B.3: Data Availability for the GSS

Notes: This figure provides the geographical distribution of the maximum number of counties available of the GSS data after matching with county identifiers. Note that not all the counties in the above map are included in every the regression using the GSS data.

82

Frontier Culture

business, and to build strong social programs that provide assistance to the most .... The process of frontier settlement advanced continuously throughout the 1800s. .... of historical frontier experience at the local level, we calculate the number.

6MB Sizes 3 Downloads 222 Views

Recommend Documents

The Land Frontier and the Sea Frontier
As events turned out, this tiny naval force never made a real dent in Britain's thunderous fleets. Its chief contribution was in destroying British merchant shipping and thus carrying the war into the waters around the. British Isles. More numerous a

Frontier Culture: The Roots and Persistence of “Rugged ...
Rapid westward expansion marked the early history of the United States. ... We measure total frontier experience as the time spent on the frontier between.

Frontier experiments: Tough science- Five ... - Academic Journals
Oct 25, 2013 - electron, which causes ripples in the surrounding electric and magnetic ... same way that the meter is now defined in terms of the speed of light ...

EFF etc amicus brief.pdf - Electronic Frontier Foundation
Small businesses, individual entrepreneurs ..... and Other Computer Program-Related. Inventions .... technological methods such as those in the business and.

Northeast Frontier Railway Recruitment [email protected] ...
Page 1 of 3. DATE OF INTERVIEW. TIME. VENUE. NUMBER OF POST. NAME OF THE STATIO. NORTHEAST FRONTIER RAILWAY. PRESS NOTIFICATION. WALK- IN-INTERVIEW FOR THE POST OF MEDICAL PRACTITIONERS ON FULL TIME CONTRACT BASIS IN. LUMDING DIVISION OF NORTHEAST FR

EFF etc amicus brief.pdf - Electronic Frontier Foundation
(800) 274-3321 • (800) 359-6859. BERNARD L. ... Small businesses, individual entrepreneurs, and ...... increase in the number of business and service method.

Frontier One-Pager 2016.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Frontier ...

Frontier-Road-Ethan-Payne-Novels.pdf
eBook PDF Frontier Road (Ethan Payne Novels) ... and times of a young man coming of age in a harsh world of unlimited possibility and swift, savage death.

alaska the last frontier:.pdf
Loading… Page 1. Whoops! There was a problem loading more pages. alaska the last frontier:.pdf. alaska the last frontier:.pdf. Open. Extract. Open with. Sign In.

man-50\nissan-frontier-service-manual-98.pdf
man-50\nissan-frontier-service-manual-98.pdf. man-50\nissan-frontier-service-manual-98.pdf. Open. Extract. Open with. Sign In. Main menu.

Watch North West Frontier (1959) Full Movie Online Free ...
Watch North West Frontier (1959) Full Movie Online Free .MP4__________.pdf. Watch North West Frontier (1959) Full Movie Online Free .MP4__________.pdf.