A Note on Heterogeneity and Aggregation Using Data on Agricultural Productivity∗ Samuel Bazzi† University of California, San Diego October 2012

Abstract: Using data from Indonesia, I show that both household- and village-level rice output per hectare follow a skew log Laplace (also, double Pareto) distribution. Exploiting this fact, I derive an estimate of the Cobb-Douglas production function parameter for land, which is similar to those in existing literature based on richer models with detailed input data. I show that the close correspondence between household- and village-level estimates is a systematic feature of the power law properties of landholdings and productivity.

Suppose that rice paddy output Yiv for household i in village v of Indonesia is given by θ Yiv = F (Kiv , Riv ) = Kiv (ξiv Riv )β ,

(1)

where Kiv is a composite of other inputs including public capital, rainfall, and labor; ξiv is landaugmenting technology with an unknown distribution; and Riv is household landholdings in hectares. Suppose also that constant returns to scale hold and that landholdings Riv above R = 0.1 Ha follow a Pareto distribution with shape parameter λv (see Bazzi, 2012). Suppose additionally that household landholdings Riv and output Yiv are unobservable but that data on the average product of P P land, AP Rv = ( i Yiv / i Riv ) ≡ (Yv /Rv ), are available for every village (i.e., we observe total village output Yv and total area planted Rv ). Several distributional properties of AP Rv can be established in relation to unobservable householdlevel output per hectare Yiv /Riv . First, note that under certain conditions, Yiv follows a Pareto disθ (ξ R)β . This presumes that tribution with shape parameter ρv = λv /β and lower bound Y = Kiv iv

although ξiv (and Kiv ) may also obey a power law distribution such as the Pareto, its shape param-

∗ Some of the results in this brief note are referenced in my job market paper, “Wealth Heterogeneity, Income Shocks, and International Migration: Theory and Evidence from Indonesia.” Their applicability beyond the topics addressed therein warranted the present, very brief draft (rather than solely an addendum in the online appendix to the job market paper). I acknowledge financial support from the Center on Emerging and Pacific Economies at UC, San Diego. Any errors are of course my own. † Dept. of Economics, UCSD, 9500 Gilman Dr. # 0508, San Diego, CA 92093-0508; [email protected]

1

eter ωv is larger than λv (i.e., exhibits less dispersion than landholdings).1 Since β < 1, it follows that ρv > λv . Moreover, so long as Yiv is i.i.d. across households within v, then the dispersion of P P Yv ≡ i Yiv also equals ρv , and the same holds true for Rv = i Riv , which has shape parameter λv . Gabaix (2009) refers to these relationships as inheritance properties of power law distributions. As the ratio of two Pareto random variables, AP Rv (and AP Riv ) follows a skew log Laplace ´ distribution (also known as double Pareto, see, Kozubowski and Podgorski, 2003; Reed, 2001)  f (AP Rv ) = (R/Y )

ρλ ρ+λ



 [(R/Y ) AP R ]λ−1 , v × [(R/Y ) AP R ]−ρ−1 , v

0 < AP Rv < Y /R AP Rv ≥ Y /R.

This distribution fits the empirical data quite well as depicted in Figure 2. The top figure plots the distribution of AP Riv using nationally representative household-level data from Susenas 2004. The bottom figure plots the distribution of AP Rv using the village-level data from the Village Potential Survey (or Podes) in 2002. The inheritance property noted above explains why in Figure 2 the interhousehold distribution of (log) AP Riv for rice paddy output based on Susenas looks so similar to the inter-village distribution of (log) AP Rv . In the tails of the empirical AP R distributions, the skew log Laplace distribution provides a better fit than the symmetric log Laplace, which can be attributed to (i) the Pareto being more appropriate than the exponential distribution for Yiv and Riv , and (ii) the shape parameter λ 6= ρ. Using AP Rv data from Podes 2002, I obtain maximum likelihood estimates b = 1.35 and ρb = 2.04. Meanwhile, using AP Riv data for a sample of households in Susenas 2004, I λ b = 1.35 and ρb = 2.44 across all households. These estimates of landholdings dispersion λ are find λ e = 1.33 that I obtain when applying the Gabaix and remarkably close to an independent estimate λ Ibragimov (2011) OLS estimator to data on land area devoted to rice cultivation in 2002 for every household in Indonesia from the 2003 Agricultural Census (see Bazzi, 2012). The close correspondence between estimates of λ at various levels of aggregation arises in part because the local variation in productivity stems primarily from land size rather than other components of productivity. As I demonstrate in Table 1, the unconditional variation in household productivity is twice as large between villages than among households within a single village. The analysis compares household-level and village-level variation in agricultural productivity based on the households in the Susenas data. The predominance of the between village variation can be explained by (i) differences in soil quality and climate across large regions comprising many villages (Geertz, 1963), and (ii) the fact that irrigation is largely a local public good.2 Within-village variation in soil qualities and access to fertilizers cannot explain the patterns observed in Table 1.

1

That output does not increase one for one with area planted in Indonesian data (see Figure 1) suggests this condition should hold in practice. Matters would be less straightforward in the even that ξiv (and Kiv ) follow some non-power law distribution that does not mix well with the Pareto. However, my focus here is on observable data and what we can learn about other parameters of interest in the absence of richer information on agricultural inputs. 2 In Podes 2008, for example, over 85% of villages report either universal or no technical irrigation; only 15% of villages have some intermediate level.

2

Since ρ = λ/β under the assumed production function in equation (1), the estimates based on aggregate AP Rv data imply that βb ≈ 0.66 while estimates based on household-level AP Riv imply that βb ≈ 0.55, both of which fall between the Cobb-Douglas production function parameters of 0.34 and 0.69 for all cropland estimated, respectively, by Fuglie (2010b) and Mundlak et al. (2004) using detailed data on agricultural input and output at the national level from 1961-1998/2000.3 The landholdings distribution is a key input to understanding the relationship between spatial and idiosyncratic variation in agricultural productivity. These relationships have strong theoretical foundations in the power law properties of landholdings and agricultural output. I shed new light on these foundations by documenting the similarity between estimates of λ (and β) at different levels of aggregation, for distinct samples, and under different assumptions on the production function. Overall, the results in this brief note suggest that in many agricultural contexts one need not observe individual plot-level output or technology in order to identify important moments in the distribution of agricultural income.

References Bazzi, S., “Wealth Heterogeneity, Income Shocks, and International Migration: Theory and Evidence from Indonesia,” Unpublished manuscript, 2012. Fuglie, K. O., “Productivity growth in Indonesian agriculture, 1961-2000,” Bulletin of Indonesian Economic Studies, 2010b, 40, 209–225. Gabaix, X., “Power Laws in Economics and Finance,” Annual Review of Economics, 2009, 1, 255–293. and R. Ibragimov, “Rank-1/2: A Simple Way to Improve the Estimation of Tail Exponents,” Journal of Business and Economic Statistics, 2011, 29, 24–39. Geertz, C., Agricultural Involution: The Process of Ecological Change in Indonesia, University of California Press, 1963. Kozubowski, T.J. and K. Podgorski, ´ “Log-Laplace distributions,” International Mathematics Journal, 2003, 3, 467–495. Mundlak, Y., D. Larson, and R. Butzer, “Agricultural dynamics in Thailand, Indonesia and the Philippines,” Australian Journal of Agricultural and Resource Economics, 2004, 48, 95–126. Reed, W. J., “The Pareto, Zipf and other power laws,” Economics Letters, 2001, 74, 15–19.

3

Fuglie (2010b) discusses reasons for the large difference in the estimated elasticities across studies. Of course, my indirect estimates of βb could also be due to the composite of other unobservable production inputs having a Pareto distribution with parameter χ < λ in which case ρ = χ/β by another power law inheritance rule highlighted in Gabaix (2009).

3

Figures

10

Figure 1: Output does not increase one-for-one with area planted

5

6

log output (kg) 7 8

9

elasticity = 1 nonparametric elasticity parametric elasticity = 0.75

−4

−3

−2 −1 log harvested area (Ha)

0

1

Notes: The estimates are based on total rice output and area harvested recorded in Susenas data from 2004. The nonparametric estimate is based on a local linear regression with bandwidth of 0.075, an Epanechnikov kernel, and the top and bottom 1 percent of households trimmed.

4

Figure 2: Distribution of productivity per hectare of paddy area harvested Household-Level Data, AP Riv .8

empirical distribution symmetric log Laplace, µ=1.484, β=.587

0

.2

kernel density .4

.6

skew log Laplace, δ=1.707, ρ=2.439, λ=1.352

−5

0

5 log(paddy output per Ha)

10

15

.8

Village-Level Data, AP Rv empirical distribution symmetric log Laplace, µ=1.386, β=.643

0

.2

kernel density .4

.6

skew log Laplace, δ=1.386, ρ=2.042, λ=1.345

−5

0 5 log(paddy output per Ha)

10

Notes: Top Figure—household-level data from Susenas 2004. Bottom Figure—village-level aggregates from Podes 2002. The empirical density is estimated using an Epanechnikov kernel and a bandwidth of 0.2. The dashed lines are the densities based on fitting symmetric and skew log Laplace distributions to the data using  maximum likelihood 

procedures. The symmetric log Laplace distribution of AP R is given by [1/(2β AP R)]×exp − | ln APβR−µ| , and the ´ skew log Laplace is as given in the text, where δ = Y /R. Kozubowski and Podgorski (2003) details the estimation procedure in the latter case, while the MLE for the symmetric log Laplace distribution can be found in most standard texts. 5

Tables Table 1: Variation in Rice Productivity Village-Level Variation Between Within N

n ¯

Household-Level Variation Between Within N

n ¯

log productivity per hectare Overall Island Groups Province District Subdistrict Village

0.99 0.37 0.60 0.64 0.85 —

0.99 0.94 0.90 0.86 0.65 —

— 5 29 346 4563 —

— 9410 1623 136 10 —

0.87 0.10 0.46 0.69 0.79 0.84

0.87 0.87 0.81 0.76 0.57 0.46

— 5 29 339 3509 7724

— 11908 2053 176 17 8

749.8 749.7 749.3 746.8 547.7 494.0

— 5 29 339 3509 7758

— 11957 2061 176 17 8

productivity per hectare Overall Island Groups Province District Subdistrict Village

20.7 1.1 2.4 3.2 7.9 —

20.7 20.6 20.6 20.5 19.3 —

— 5 29 346 4563 —

— 9410 1623 136 10 —

749.8 7.8 32.3 97.6 1052.4 948.0

Notes: Data are obtained from Susenas 2004. N is the number of geographic units (island groups, province, district, subdistrict, village). n ¯ is the average number of villages or households within each of the N units.

6

A Note on Heterogeneity and Aggregation Using Data ...

Abstract: Using data from Indonesia, I show that both household- and ... (i) the Pareto being more appropriate than the exponential distribution for Yiv and Riv, ...

268KB Sizes 1 Downloads 292 Views

Recommend Documents

LABORMARKET HETEROGENEITY, AGGREGATION, AND POLICY ...
Abstract. Data from a heterogeneous-agents economy with incomplete asset markets and indivisible labor supply are simulated under various fiscal policy ...

Heterogeneity and Aggregation: Implications for Labor ...
Dec 2, 2007 - is high (i.e., when the commodity is cheap rela- tive to leisure). ... between the marginal rate of substitution and ...... age hours of 0.2 and an interest rate of 1 percent ... account for the wedge in the data. .... In order for indi

labormarket heterogeneity, aggregation, and policy (in)variance of ...
According to a variance decomposition computed based on our estimated representative-agent model, the measured preference shocks explain jointly between ...

Power Saving Data Aggregation using FR Approach in WSN
Abstract. This Paper Presents the power of wireless sensor networks lies in the ability to deploy large numbers of tiny nodes that assemble and configure themselves. Usage scenarios for these devices range from real time tracking, to monitoring of en

Power Saving Data Aggregation using FR Approach ...
from real time tracking, to monitoring of environmental conditions, to ubiquitous ... Keywords- wireless sensor networks; Energy; Power; Data Aggregation.

On the aggregation of eurozone data
T Tel.: +31 30 2539801. E-mail address: [email protected]. Economics Letters 90 ... t ј. P i xi t/ei. T , is that the corresponding growth rate series is a weighted ...

10x10: Using Extreme Heterogeneity to Build a General-Purpose ...
†Large-scale systems group, Department of Computer Science, University of ... WHAT IS THE PROGRAMMING LEVEL FOR ACHIEVING EFFICIENCY?

10x10: Using Extreme Heterogeneity to Build a General-Purpose ...
What is the programming level for achieving ... High performance at the cost of programming effort and portability ... design automation techniques enable.

Note on Drafting a speech.pdf
Page 1 of 1. Speech is supposed to be an oral presentation. But ,since you have speech as a discourse ,it is desirable. that we must learn the techniques of writing a speech.While presenting a speech on a stage, the speaker has a. lot of advantages .

A note on Kandori-Matsushima
Sep 16, 2004 - Social Science Center, London, ON, N6A 5C2, Tel: 519-661-2111 ext. ... equilibria, where private information is revealed every T-periods, as δ ...

A Note on Quasi-Presuppositions and Focus
Jan 31, 2011 - If John came late, the party was a disaster implies: ..... The above data seem to show that indeed the inference triggered by modifiers seems.

A Note on Quality Disclosure and Competition
I analyze how a change in the degree of horizontal product differentiation affects the incentives of duopolists to disclose quality information. If disclosure is costly, ...

A note on Minimal Unanimity and Ordinally Bayesian ...
E-mail address: [email protected]. 0165-4896/$ - see front matter © 2006 Elsevier B.V. All rights reserved. doi:10.1016/j.mathsocsci.2006.12.

A Note on -Permutations
We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Mathematical Association of America is collaborating with JSTOR to digitize,

A note on for-phrases and derived scales
phrase set to restrict its domain; Bale (2008) takes a gradable adjective to determine ... by a New Initiatives Funding grant from Centre for Research on Language, Mind and Brain in ..... (30) Mia: “Buy me a hat that costs more than $5!” (31) Mia

A Note on Strong Duality and Complementary Slackness
Aug 27, 2015 - where x, c ∈ H1, b ∈ H2, d ∈ H3, A is a linear map from H1 to H2, B is a .... Convex Analysis, Princeton Landmarks in Mathematics, Princeton.

A Note on Discrete- and Continuous-time Optimal ...
i.e. taking into account that next period's policy-maker will choose policy in the same way as today's policy-maker: gt+1 = G(kt+1), kt+2 = K(kt+1), ct+1 = f(kt+1,1) ...

A Survey on Brain Tumour Detection Using Data Mining Algorithm
Abstract — MRI image segmentation is one of the fundamental issues of digital image, in this paper, we shall discuss various techniques for brain tumor detection and shall elaborate and compare all of them. There will be some mathematical morpholog

A note on the upward and downward intruder ... - Springer Link
From the analytic solution of the segregation velocity we can analyze the transition from the upward to downward intruder's movement. The understanding of the ...

A note on extremality and completeness in financial ...
σ(X, Y ), and we apply it to the space L∞ with the topology σ(L∞,Lp) for p ≥ 1. .... Now, we want to apply Theorem 3 to the special case X = L∞(µ) equipped with ...

A Note on Discrete Convexity and Local Optimality
... (81)-45-339-3531. Fax: (81)-45-339-3574. ..... It is easy to check that a separable convex function satisfies the above condition and thus it is semistrictly quasi ...

briefing note on - Services
systems. In the light of these conclusions, a series of meetings in Africa, including the Foresight. Africa workshop in Entebbe, the AU meeting of Directors for Livestock Development in. Kigali 2004, the Congress ... OIE meeting of SADC Chief Veterin

A note on juncture homomorphisms.pdf - Steve Borgatti
A full network homomorphism f: N -+ N' is a regular network homomorphism if for each R E [w fi( a) f2( R) fi( b) * 3 c, d E P such that fi(u) = fi( c), fi( b) = fi( d), cRb and uRd for all a, b E P. In a network N the bundle of relations B,, from a t