Publication Bias and the Cross-Section of Stock Returns Andrew Y. Chen1 1 Federal

Tom Zimmermann2 Reserve Board

2 Quantco,

Inc

AFA: 2018

1 / 14

Disclaimer: The views expressed herein are those of the author and do not necessarily reflect the position of the Board of Governors of the Federal Reserve or the Federal Reserve System

1 / 14

“The Lord of the p-value”

1 / 14

“The Lord of the p-value”

1 / 14

“The Lord of the p-value”

1 / 14

“The Lord of the p-value”

1 / 14

“The Lord of the p-value”

The Cross-Sectional Asset Pricing Lit

1 / 14

“The Lord of the p-value”

p-hacking • data-mining, data-snooping • suspicion and ambition • collective re-use of data The Cross-Sectional Asset Pricing Lit

1 / 14

“The Lord of the p-value”

p-hacking • data-mining, data-snooping • suspicion and ambition • collective re-use of data

Journal Review • robustness tests • theoretical motivations • supporting results • a scientific, ethical culture

The Cross-Sectional Asset Pricing Lit

1 / 14

“The Lord of the p-value”

p-hacking • data-mining, data-snooping • suspicion and ambition • collective re-use of data

Journal Review • robustness tests • theoretical motivations • supporting results • a scientific, ethical culture

The Cross-Sectional Asset Pricing Lit

Our Question: Which Side is Winning? 1 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning

2 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning (1) Focus: replications of 172 published cross-sectional predictors – Excludes non-predictive and aggregate factors in Harvey, Liu, Zhu 2016 – Excludes un-published predictors in Chordia, Goyal, Saretto 2017

2 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning (1) Focus: replications of 172 published cross-sectional predictors – Excludes non-predictive and aggregate factors in Harvey, Liu, Zhu 2016 – Excludes un-published predictors in Chordia, Goyal, Saretto 2017

(2) Structure: estimated model of biased publication – Allows for p-hacking effects and journal review – Unlike Hou, Xue, Zhang’s 2017 informal approach

2 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning (1) Focus: replications of 172 published cross-sectional predictors – Excludes non-predictive and aggregate factors in Harvey, Liu, Zhu 2016 – Excludes un-published predictors in Chordia, Goyal, Saretto 2017

(2) Structure: estimated model of biased publication – Allows for p-hacking effects and journal review – Unlike Hou, Xue, Zhang’s 2017 informal approach

Result: I Journal review dominates. Nearly all predictors were real!! – Consistent w/ McLean-Pontiff 2016, Jacobs-M¨ uller 2016, Yan-Zheng 2017

2 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!!

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true??

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true?? I

Standard logic (Bonferroni, Benjamini-Hochberg 1995) – After looking at 172+ predictors, many in-sample returns will be large by pure chance

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true?? I

Standard logic (Bonferroni, Benjamini-Hochberg 1995) – After looking at 172+ predictors, many in-sample returns will be large by pure chance ⇒ many predictors were fairy tales

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true?? I

Standard logic (Bonferroni, Benjamini-Hochberg 1995) – After looking at 172+ predictors, many in-sample returns will be large by pure chance ⇒ many predictors were fairy tales

I

Our more structured logic (James-Stein 1961, Efron-Morris 1973)

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true?? I

Standard logic (Bonferroni, Benjamini-Hochberg 1995) – After looking at 172+ predictors, many in-sample returns will be large by pure chance ⇒ many predictors were fairy tales

I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 predictors tell us about the nature of the publication process

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true?? I

Standard logic (Bonferroni, Benjamini-Hochberg 1995) – After looking at 172+ predictors, many in-sample returns will be large by pure chance ⇒ many predictors were fairy tales

I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 predictors tell us about the nature of the publication process – They tell us that journal review dominates p-hacking

3 / 14

This Paper: A Focused, Structured Estimate of Who’s Winning Nearly all predictors were real!! How can this be true?? I

Standard logic (Bonferroni, Benjamini-Hochberg 1995) – After looking at 172+ predictors, many in-sample returns will be large by pure chance ⇒ many predictors were fairy tales

I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 predictors tell us about the nature of the publication process – They tell us that journal review dominates p-hacking ⇒ nearly all predictors were real.

3 / 14

Replications of 172 Published Predictors

3 / 14

Data: Replications of 172 Published Predictors

(1) Replicate McLean and Pontiff’s (2016) 97 published cross-sectional predictors (2) Replicate 75 additional variables that were – shown to predict cross-sectional returns – published in “top-tier” journals

Data available at sites.google.com/site/chenandrewy/

4 / 14

Distribution of Replicated t-stats 0.14 0.12

Frequency

0.1 0.08 0.06 0.04 0.02 0 0

2

4

6

8

10

12

14

t-stat

I

Sharp left shoulder ⇒ strongly suggestive of p-hacking

I

But what explains the long right tail? 5 / 14

Distribution of Replicated t-stats 0.14 0.12

Frequency

0.1 0.08 0.06 0.04 0.02 0 0

2

4

6

8

10

12

14

t-stat

I

Sharp left shoulder ⇒ strongly suggestive of p-hacking

I

But what explains the long right tail? ⇒ need model 5 / 14

Model and Estimation

5 / 14

A Statistical Model of Publication 1/2 Motivating Story: 1. Anything that might be published is submitted to journals – Allows for p-hacking

2. Only portfolios with “narratives” are considered for publication – Allows for journal review: robustness tests, supporting results, ...

3. Only narratives with high t-stats are published – Another p-hacking effect

6 / 14

A Statistical Model of Publication 1/2 Motivating Story: 1. Anything that might be published is submitted to journals – Allows for p-hacking

2. Only portfolios with “narratives” are considered for publication – Allows for journal review: robustness tests, supporting results, ...

3. Only narratives with high t-stats are published – Another p-hacking effect

⇒ statistical model of publication similar to Harvey, Liu, and Zhu’s (2016) model with correlations

6 / 14

A Statistical Model of Publication 2/2

Key equations I

If portfolio i has a narrative, true return µi ∼ scaled student’s t with σµ , νµ

I

dispersion of true returns σµ measures power of journal review – large σµ ⇒ narratives find variation in true returns

7 / 14

A Statistical Model of Publication 2/2

Key equations I

If portfolio i has a narrative, true return µi ∼ scaled student’s t with σµ , νµ

I

dispersion of true returns σµ measures power of journal review – large σµ ⇒ narratives find variation in true returns

I

In-sample returns are noisy and biased signals of µi ri = µi + i

7 / 14

Maximum Likelihood Estimation I

Choose 7 parameters to maximize likelihood of replicated data – 172 in-sample returns and standard errors

8 / 14

Maximum Likelihood Estimation I

Choose 7 parameters to maximize likelihood of replicated data – 172 in-sample returns and standard errors

I

Identification of σµ comes from dispersion of t-stats

8 / 14

Maximum Likelihood Estimation I

Choose 7 parameters to maximize likelihood of replicated data – 172 in-sample returns and standard errors

Identification of σµ comes from dispersion of t-stats σµ = 0.10 0.5

1

1

0.9

0.9

0.8

0.8

0.7

0.7

0.6

0.6

0.5

0.5

0.4

0.4

0.3

0.3

0.2

0.2

0.1

0.1

Data

0.4

Frequency

I

0.3

0.2

0.1

0

0

0

5

10

15

0 0

0.2

0.4

0.6

0.8

1

0

0.2

0.4

0.6

0.8

1

t-stat

8 / 14

Maximum Likelihood Estimation I

Choose 7 parameters to maximize likelihood of replicated data – 172 in-sample returns and standard errors

Identification of σµ comes from dispersion of t-stats σµ = 0.10 0.5 Data Model 0.4

Frequency

I

Log Like = -371.90

0.3

0.2

0.1

0

1

1

0.9

0.9

0.8

0.8

0.7

0.7

0.6

0.6

0.5

0.5

0.4

0.4

0.3

0.3

0.2

0.2

0.1

0.1

0

0

5

10

15

0 0

0.2

0.4

0.6

0.8

1

0

0.2

0.4

0.6

0.8

1

t-stat

8 / 14

Maximum Likelihood Estimation I

Choose 7 parameters to maximize likelihood of replicated data – 172 in-sample returns and standard errors

Identification of σµ comes from dispersion of t-stats σµ = 0.10

σµ = 0.20

0.5

0.5

1

Data Model

0.9

0.4

0.4

0.8 0.7

Log Like = -371.90

0.3

Frequency

Frequency

I

0.2

Log Like = -250.19

0.3

0.6 0.5

0.2

0.4 0.3

0.1

0.1

0

0

0.2 0.1

0

5

10

t-stat

15

0

0

5

10

15

0

0.2

0.4

0.6

0.8

1

t-stat

8 / 14

Maximum Likelihood Estimation I

Choose 7 parameters to maximize likelihood of replicated data – 172 in-sample returns and standard errors

Identification of σµ comes from dispersion of t-stats σµ = 0.10

σµ = 0.20

0.5

Estimated: σ ˆµ = 0.45

0.5

0.5

0.4

0.4

Data Model

Frequency

Log Like = -371.90

0.3

0.2

0.1

Log Like = -250.19

0.3

Frequency

0.4

Frequency

I

0.2

0.1

0 5

10

t-stat

15

0.2

0.1

0 0

Log Like = -197.69

0.3

0 0

5

10

t-stat

15

0

5

10

15

t-stat

8 / 14

Bias Adjustment and Shrinkage I

We focus on Shrinkage defined by [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i – 100% Shrinkage ⇒ p-hacking dominates, bias-adjusted return = 0 – 0% Shrinkage ⇒ journal review works, bias-adjusted = in-sample

9 / 14

Bias Adjustment and Shrinkage I

We focus on Shrinkage defined by [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i – 100% Shrinkage ⇒ p-hacking dominates, bias-adjusted return = 0 – 0% Shrinkage ⇒ journal review works, bias-adjusted = in-sample

I

Bayesian logic gives a shrinkage formula (Dawid 1994, Senn 2008, Efron 2011, 2012) Shrinkagei ≈

[Standard Error]2i σ ˆµ2 + [Standard Error]2i

σ ˆµ2 = Estimated Dispersion of True Returns

9 / 14

Results

9 / 14

Main Result 1/2: Bias Adjustments are Modest

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

<-- 47 predictors (out of 172) have tiny shrinkage

5 to 10

10 to 15

15 to 10

20 to 25

Shrinkage (%)

25 to 30

30 to 35

35 to 40

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

10 to 15

15 to 10

20 to 25

Shrinkage (%)

25 to 30

30 to 35

35 to 40

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

<-- 94 predictors (out of 172) have small shrinkage

10 to 15

15 to 10

20 to 25

Shrinkage (%)

25 to 30

30 to 35

35 to 40

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

AM AssetTur BM CF DivInit DivYield ExchSwit FirmAgeM G GHZlev Illiquid IntMom IntanCFP IntanEP Mom12m Mom6m NetDebtP NetEquit OScore Predicte Surprise Tax cfp sfe std_dolv zerotrad

10 to 15

Announce CBOperPr Cash ConsReco Forecast Frontier IntanBM Mom1813 Mom36m PayoutYi RDIPO SEO SP SmileSlo Spinoff VarCF VolumeTr std_turn

15 to 10

DolVol High52 IntanSP MaxRet MomVol OrgCap PS RDS RIO_Disp ShareVol tang

20 to 25

Shrinkage (%)

Accruals DelDRC EP IO_Short Mom6mJun PM RIO_BM RIO_Turn Size ZScore

25 to 30

BidAskSp IdioRisk OptVol1 VolMkt

30 to 35

Beta BetaSqua NetPayou OptVol2 Price fgr5yrLa

35 to 40

AgeIPO CredRatDG IndIPO

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

The other half are skewed right, but nearly all are < 40%

AM AssetTur BM CF DivInit DivYield ExchSwit FirmAgeM G GHZlev Illiquid IntMom IntanCFP IntanEP Mom12m Mom6m NetDebtP NetEquit OScore Predicte Surprise Tax cfp sfe std_dolv zerotrad

10 to 15

Announce CBOperPr Cash ConsReco Forecast Frontier IntanBM Mom1813 Mom36m PayoutYi RDIPO SEO SP SmileSlo Spinoff VarCF VolumeTr std_turn

15 to 10

DolVol High52 IntanSP MaxRet MomVol OrgCap PS RDS RIO_Disp ShareVol tang

20 to 25

Shrinkage (%)

Accruals DelDRC EP IO_Short Mom6mJun PM RIO_BM RIO_Turn Size ZScore

25 to 30

BidAskSp IdioRisk OptVol1 VolMkt

30 to 35

Beta BetaSqua NetPayou OptVol2 Price fgr5yrLa

35 to 40

AgeIPO CredRatDG IndIPO

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

top quartile return volatility

AM AssetTur BM CF DivInit DivYield ExchSwit FirmAgeM G GHZlev Illiquid IntMom IntanCFP IntanEP Mom12m Mom6m NetDebtP NetEquit OScore Predicte Surprise Tax cfp sfe std_dolv zerotrad

10 to 15

Announce CBOperPr Cash ConsReco Forecast Frontier IntanBM Mom1813 Mom36m PayoutYi RDIPO SEO SP SmileSlo Spinoff VarCF VolumeTr std_turn

15 to 10

DolVol High52 IntanSP MaxRet MomVol OrgCap PS RDS RIO_Disp ShareVol tang

20 to 25

Shrinkage (%)

Accruals DelDRC EP IO_Short Mom6mJun PM RIO_BM RIO_Turn Size ZScore

25 to 30

BidAskSp IdioRisk OptVol1 VolMkt

30 to 35

Beta BetaSqua NetPayou OptVol2 Price fgr5yrLa

35 to 40

AgeIPO CredRatDG IndIPO

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

top quartile return volatility

High volatility => high shrinkage More noise => higher chance of p-hacking

AM AssetTur BM CF DivInit DivYield ExchSwit FirmAgeM G GHZlev Illiquid IntMom IntanCFP IntanEP Mom12m Mom6m NetDebtP NetEquit OScore Predicte Surprise Tax cfp sfe std_dolv zerotrad

10 to 15

Announce CBOperPr Cash ConsReco Forecast Frontier IntanBM Mom1813 Mom36m PayoutYi RDIPO SEO SP SmileSlo Spinoff VarCF VolumeTr std_turn

15 to 10

DolVol High52 IntanSP MaxRet MomVol OrgCap PS RDS RIO_Disp ShareVol tang

20 to 25

Shrinkage (%)

Accruals DelDRC EP IO_Short Mom6mJun PM RIO_BM RIO_Turn Size ZScore

25 to 30

BidAskSp IdioRisk OptVol1 VolMkt

30 to 35

Beta BetaSqua NetPayou OptVol2 Price fgr5yrLa

35 to 40

AgeIPO CredRatDG IndIPO

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

top quartile return volatility

But even IndIPO (48% shrinkage) has a good bias-adjusted return bias-adjusted return = 1.04*(1-0.48)=0.54% monthly

AM AssetTur BM CF DivInit DivYield ExchSwit FirmAgeM G GHZlev Illiquid IntMom IntanCFP IntanEP Mom12m Mom6m NetDebtP NetEquit OScore Predicte Surprise Tax cfp sfe std_dolv zerotrad

10 to 15

Announce CBOperPr Cash ConsReco Forecast Frontier IntanBM Mom1813 Mom36m PayoutYi RDIPO SEO SP SmileSlo Spinoff VarCF VolumeTr std_turn

15 to 10

DolVol High52 IntanSP MaxRet MomVol OrgCap PS RDS RIO_Disp ShareVol tang

20 to 25

Shrinkage (%)

Accruals DelDRC EP IO_Short Mom6mJun PM RIO_BM RIO_Turn Size ZScore

25 to 30

BidAskSp IdioRisk OptVol1 VolMkt

30 to 35

Beta BetaSqua NetPayou OptVol2 Price fgr5yrLa

35 to 40

AgeIPO CredRatDG IndIPO

>40

10 / 14

Main Result 1/2: Bias Adjustments are Modest [Bias-Adjusted Return]i = (1 − Shrinkagei )[In-Sample Return]i 50

45

40

35

Count

30

25

20

15

10

5

0

AbnAccr BPEBM ChForeca ChInv ChInvIA ChNAnaly ChNCOA ChNWC ChPM ChTax Composit ConvDebt DebtIssu DelBread DelCOA DelCOL DelFINL DelLTI DivOmit DownFore EBM EarnCons EarnSurp EntMult ExclExp FirmAge GrAdExp GrLTNOA GrSaleTo Herf IndRetBi Investme KZ Mom1m NOA NetDebtF NumEarnI PriceDel Profitab RevenueS ShareRep UpForeca grcapx hire invest realesta roaq

0 to 5

AOP Accruals AdExp AnalystV AssetGro BetaTail ChAssetT ChEQ ChangeIn CompEquI Coskewne DelEqu DivInd EarnIncr EarnSupB FR FailureP GP GrGMToGr GrSaleTo IndMom Intrinsi LTLevera MS MeanRank MomRev MomSeas OPLevera OperProf OrderBac PctAcc PctTotAc RD REV6 RIO_Idio RoE ShareIs1 ShareIs5 ShortInt Skew1 VolSD XFIN pchdepr pchgm_pc retCongl sgr sinAlgo

5 to 10

top quartile return volatility

Summary: shrinkage is modest, journal review dominates Consistent with McLean-Pontiff 2016 AM AssetTur BM CF DivInit DivYield ExchSwit FirmAgeM G GHZlev Illiquid IntMom IntanCFP IntanEP Mom12m Mom6m NetDebtP NetEquit OScore Predicte Surprise Tax cfp sfe std_dolv zerotrad

10 to 15

Announce CBOperPr Cash ConsReco Forecast Frontier IntanBM Mom1813 Mom36m PayoutYi RDIPO SEO SP SmileSlo Spinoff VarCF VolumeTr std_turn

15 to 10

DolVol High52 IntanSP MaxRet MomVol OrgCap PS RDS RIO_Disp ShareVol tang

20 to 25

Shrinkage (%)

Accruals DelDRC EP IO_Short Mom6mJun PM RIO_BM RIO_Turn Size ZScore

25 to 30

BidAskSp IdioRisk OptVol1 VolMkt

30 to 35

Beta BetaSqua NetPayou OptVol2 Price fgr5yrLa

35 to 40

AgeIPO CredRatDG IndIPO

>40

10 / 14

Main Result 2/2: Nearly All Anomalies were Real

11 / 14

Main Result 2/2: Nearly All Anomalies were Real We can estimate the false discovery rate (FDR) (`a la HLZ 2016) 2

True Return

1.5 1 0.5 0 -0.5 -1 0

1

2

3

4

5

6

7

8

9

10

In-Sample t-stat

I

Simulate true returns and t-stats using estimated parameters

11 / 14

Main Result 2/2: Nearly All Anomalies were Real We can estimate the false discovery rate (FDR) (`a la HLZ 2016) 2

True Return

1.5 1 0.5 0 -0.5

false discovery -1 0

1

2

3

4

5

6

7

8

9

10

In-Sample t-stat

I

Define false discoveries: true returns ≤ 0 (equivalent to HLZ)

11 / 14

Main Result 2/2: Nearly All Anomalies were Real We can estimate the false discovery rate (FDR) (`a la HLZ 2016) 2

True Return

1.5 1 0.5 0

false discovery naive hurdle: false discovery rate = 0.6%

-0.5 -1 0

1

2

3

4

5

6

7

8

9

10

In-Sample t-stat

I

Calculate false discovery rate (FDR) for a given t-stat hurdle

I

Naive hurdle (1.96) implies a tiny FDR of 0.6%

11 / 14

Main Result 2/2: Nearly All Anomalies were Real We can estimate the false discovery rate (FDR) (`a la HLZ 2016) 2

True Return

1.5 1 0.5 0

false discovery naive hurdle: false discovery rate = 0.6%

-0.5 -1 0

1

2

3

4

5

6

7

8

9

10

In-Sample t-stat

I

Calculate false discovery rate (FDR) for a given t-stat hurdle

I

Naive hurdle (1.96) implies a tiny FDR of 0.6%

I

Nearly all anomalies were real (in-sample) 11 / 14

Main Result 2/2: Nearly All Anomalies were Real We can estimate the false discovery rate (FDR) (`a la HLZ 2016) 2

True Return

1.5 1 0.5 0

false discovery naive hurdle: false discovery rate = 0.6% hurdle for false discovery rate = 5%

-0.5 -1 0

1

2

3

4

5

6

7

8

9

10

In-Sample t-stat

I

Can calculate hurdles for other FDRs

11 / 14

Main Result 2/2: Nearly All Anomalies were Real We can estimate the false discovery rate (FDR) (`a la HLZ 2016) 2

True Return

1.5 1 0.5 0

false discovery naive hurdle: false discovery rate = 0.6% hurdle for false discovery rate = 5%

-0.5 -1 0

1

2

3

4

5

6

7

8

9

10

In-Sample t-stat

I

Can calculate hurdles for other FDRs

I

Standard t-stat hurdles can actually be lowered!!!

11 / 14

Main Result 2/2: Nearly All Anomalies were Real Standard t-stat hurdles can actually be lowered!!! How can this be true???

12 / 14

Main Result 2/2: Nearly All Anomalies were Real Standard t-stat hurdles can actually be lowered!!! How can this be true??? I Standard multiple-testing logic (Bonferroni, Benjamini-Hochberg 1995) – After running 172+ tests, the null will be rejected by pure chance ⇒ t-stat hurdles should be raised

12 / 14

Main Result 2/2: Nearly All Anomalies were Real Standard t-stat hurdles can actually be lowered!!! How can this be true??? I Standard multiple-testing logic (Bonferroni, Benjamini-Hochberg 1995) – After running 172+ tests, the null will be rejected by pure chance ⇒ t-stat hurdles should be raised I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 tests tell us about the nature of the publication process

12 / 14

Main Result 2/2: Nearly All Anomalies were Real Standard t-stat hurdles can actually be lowered!!! How can this be true??? I Standard multiple-testing logic (Bonferroni, Benjamini-Hochberg 1995) – After running 172+ tests, the null will be rejected by pure chance ⇒ t-stat hurdles should be raised I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 tests tell us about the nature of the publication process – The publication process produces dispersed true returns

12 / 14

Main Result 2/2: Nearly All Anomalies were Real Standard t-stat hurdles can actually be lowered!!! How can this be true??? I Standard multiple-testing logic (Bonferroni, Benjamini-Hochberg 1995) – After running 172+ tests, the null will be rejected by pure chance ⇒ t-stat hurdles should be raised I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 tests tell us about the nature of the publication process – The publication process produces dispersed true returns ⇒ t-stats are informative about true returns

12 / 14

Main Result 2/2: Nearly All Anomalies were Real Standard t-stat hurdles can actually be lowered!!! How can this be true??? I Standard multiple-testing logic (Bonferroni, Benjamini-Hochberg 1995) – After running 172+ tests, the null will be rejected by pure chance ⇒ t-stat hurdles should be raised I

Our more structured logic (James-Stein 1961, Efron-Morris 1973) – 172 tests tell us about the nature of the publication process – The publication process produces dispersed true returns ⇒ t-stats are informative about true returns ⇒ t-stat hurdles can be lowered!

12 / 14

Main Result 2/2: Nearly All Anomalies were Real I

Other multiple testing studies find most results are false – Harvey, Liu, Zhu (2016); Chordia, Goyal, Saretto (2017)

I

Difference: focus on cross-sectional predictors in top-tier journals

13 / 14

Main Result 2/2: Nearly All Anomalies were Real I

Other multiple testing studies find most results are false – Harvey, Liu, Zhu (2016); Chordia, Goyal, Saretto (2017)

I

Difference: focus on cross-sectional predictors in top-tier journals Variable Counts HarveyLiu-Zhu

Chordia-GoyalSaretto

Our Paper

Aggregate Risk Factor X-Sectional Predictor X-Sectional & Top Tier Pub

113 202 146

0 2,100,000 <500

0 172 151

Total

315

2,100,000

172

13 / 14

Main Result 2/2: Nearly All Anomalies were Real I

Other multiple testing studies find most results are false – Harvey, Liu, Zhu (2016); Chordia, Goyal, Saretto (2017)

I

Difference: focus on cross-sectional predictors in top-tier journals Variable Counts

I

HarveyLiu-Zhu

Chordia-GoyalSaretto

Our Paper

Aggregate Risk Factor X-Sectional Predictor X-Sectional & Top Tier Pub

113 202 146

0 2,100,000 <500

0 172 151

Total

315

2,100,000

172

Suggests p-hacking much worse among aggregate risk factors and outside top journals 13 / 14

Conclusion

13 / 14

Conclusion

I

A structured, focused estimation finds – Journal review has triumphed over p-hacking∗ ∗

in top-tier pubs predicting cross-sectional stock returns, for now

Consistent w/ McLean-Pontiff 2016, Jacobs-M¨ uller 2016, Yan-Zheng 2017

14 / 14

Conclusion

I

A structured, focused estimation finds – Journal review has triumphed over p-hacking∗ ∗

in top-tier pubs predicting cross-sectional stock returns, for now

Consistent w/ McLean-Pontiff 2016, Jacobs-M¨ uller 2016, Yan-Zheng 2017 I

Suggests a complete accounting for the typical anomaly return – 13% publication bias (this paper) – 35% mispricing that can be traded away (McLean and Pontiff 2016) – 52% trading costs (Chen and Velikov 2017) 14 / 14

Publication Bias and the Cross-Section of Stock Returns

“The Lord of the p-value” p-hacking. • data-mining, data-snooping. • suspicion and ambition. • collective re-use of data. Journal Review. • robustness tests. • theoretical motivations. • supporting results. • a scientific, ethical culture. The Cross-Sectional Asset Pricing Lit. 1 / 14 ...

3MB Sizes 4 Downloads 157 Views

Recommend Documents

Modelling and Forecasting Volatility of Returns on the Ghana Stock ...
1Department of Finance and Accounting, KNUST School of Business,. Kwame Nkrumah University of .... large changes and small changes by small changes [8] leading to contiguous ..... both EViews 5.1 and PcGive programs. RESULTS AND ...

Firm Migration and Stock Returns
Firm Migration and Stock Returns ... Value stocks are exposed to cash-flow risk ... S′′. 2 Which generates the value premium? Convergence or drift in M/B.

Organization Capital and the CrossSection of Expected ...
Frydman and Erik Loualiche for sharing their data, and Tyler Muir and Omair ..... or other form of venture financing, depending on how the purchase of physical.

Correlated beliefs, returns, and stock market volatility
Available online 1 December 2015. JEL classification: G15. G12. G17. D8 ..... 5 It is worth noting that other studies obtain somewhat different findings: for example,. Crawford et al. (2012) show that ...... from −0.02 to a max- imum of only 0.03.

Prospect Theory and Stock Returns: An Empirical Test
Jul 20, 2016 - cross-section of stock returns, of a simple prospect theory-based model and test these predictions in both U.S. and ... model makes a simple prediction which we test in our empirical work: that stocks with high prospect theory ...... p

Days to Cover and Stock Returns
the over-pricing of the stock from the point of view of the arbitrageurs. ...... Furthermore, since f does not enter the optimization problem of the optimists, we still.

Investor Sentiment and Sectoral Stock Returns
Mar 31, 2016 - An increasing number of empirical works suggests that investor sentiment has ... 1 Tel.: +49 69798 33882. 2 Tel.: +44 2392 844635. 3 Tel.

PUBLICATION ETHICS AND PUBLICATION MALPRACTICE ...
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. PUBLICATION ETHICS AND PUBLICATION MALPRACTICE STATEMENT-ACTA GEOBALCANICA.pdf. PUBLICATION ETHICS AND PUBLI

Common Nonlinearity in Long-horizon Stock Returns
Employing annual returns generated from overlapping monthly price indexes for the G-7 ... techniques to the stock price-dividend relation of major stock markets.

Common Nonlinearity in Long-horizon Stock Returns
Sy Syms School of Business, Yeshiva University, New York, NY 10033, USA .... returns for most of national stock markets is too small to be used for reliable statistical ..... While lending support to the notion that the non-linear dynamics of G-7 ...

Topical interests and the mitigation of search engine bias
Aug 22, 2006 - Search engines have become key media for our scientific, eco- nomic, and social activities by enabling people to access informa- tion on the web despite its ... not support this conclusion; popular sites receive far less traffic than p

Burn-in, bias, and the rationality of anchoring - Stanford University
The model's quantitative predictions match published data on anchoring in numer- ... In cognitive science, a recent analysis concluded that time costs make.

neural networks and the bias variance dilemma.pdf
can be learned? ' Nonparametric inference has matured in the past 10 years. There ... Also in Section 4, we will briefly discuss the technical issue of consistency, which .... m classes, 3 e {1,2.....rr1}, and an input, or feature, vector x. Based o

Momentum in stock market returns: Implications for risk ...
)82.3(. 18.0. 0.09. P3. )41.0(. 24.0. −. −. )43.5(. 19.0. 0.11. P4. )33.0(. 57.0. )33.4(. 15.0. 0.07. P5. )33.1(. 39.2. )58.3(. 17.0. 0.09. P6. )65.0(. 05.1. )13.5(. 15.0. 0.08 ...

Absolute-Returns-The-Risk-And-Opportunities-Of-Hedge-Fund ...
Whoops! There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Absolute-Returns-The-Risk-And-Opportunities-Of-Hedge-Fund-Investing.pdf. Absolute-Returns-The-Risk-A

The estimation of present bias and time preferences ...
card and the average credit is only about 200 Euros per card. 5See also .... result with 0.6 in order to take into account that land with property has less value than ...

Learning Bias, Cultural Evolution of Language, and the ...
E-mail: [email protected]. ... However, a recent trend has been to revise these strong conclusions about ...... technology I: An experimental simulation.

Learning Bias, Cultural Evolution of Language, and the Biological ...
Sciences, University of Edinburgh, Dugald Stewart Building, 3 Charles Street, ... to be relatively straightforward: all normally developing individuals acquire the ...... be outweighed by the costs of social learning, resulting in selection in favor

Electronic Publication and the Narrowing of Science ...
Jul 17, 2008 - including high-resolution figures, can be found in the online. Updated information and .... any Internet-connected digital device to search all of human .... the speed of the shift toward commercial and free electronic provision of ...

Stock of the town
Feb 14, 2018 - Disclaimer: เอกสาร/รายงานฉบับนี้จัดท าขึ้นบนพื้นฐานข้อมูลที่เปิดเผยต่อสาธารณชน ซึ่งพิจารà

éBIAS/
Nov 13, 1995 - output signal of the photo-detector increases with an increas. U'S' P ...... digital measurement signal produced at the output 110 of the counter.

Bias Neglect
community is that there should be less aggression between ants that share a nest ... condition. Blind studies were much more likely to report aggression between.

The politics of publication
Mar 20, 2003 - some key data and oversimplify the conclu- sions — her postdoc needs this journal on ..... topics, which can lead to unnecessary duplication.