SlideShare a Scribd company logo
1 of 43
Download to read offline
Bayesian Bias Correction: Critically evaluating sets of
studies in the presence of publication bias
Alexander Etz
UC Irvine
Alexander Etz (UC Irvine) Bayesian Bias Correction 1 / 43
Introduction
When we read a published paper, we have two broadly related goals
Goal 1: Evaluate the evidence in favor of a claim
Goal 2: Estimate how big the reported effect could reasonably be
Alexander Etz (UC Irvine) Bayesian Bias Correction 2 / 43
Introduction
Solution (in principle): Compute a Bayes factor (goal 1), or compute
the posterior distribution (goal 2)
Alexander Etz (UC Irvine) Bayesian Bias Correction 3 / 43
Challenge
Challenge: Literature is biased
You only see a subset of all studies performed, because only the studies
that “worked” make it past the gatekeepers (editors, reviewers)
Alexander Etz (UC Irvine) Bayesian Bias Correction 4 / 43
Publication bias
Statistical significance filter: Only studies with p < .05 get published
Censored data, based on the stochastic outcome of the study
Alexander Etz (UC Irvine) Bayesian Bias Correction 5 / 43
Publication bias
Publication bias ruins our ability to evaluate the claims in a paper
Alexander Etz (UC Irvine) Bayesian Bias Correction 6 / 43
Example
Imagine we estimate a group mean and perform a one-sample test
We find observed mean of 17, with a standard error of 8
Alexander Etz (UC Irvine) Bayesian Bias Correction 7 / 43
Example
If the null hypothesis is true, this would be what we predict:
Alexander Etz (UC Irvine) Bayesian Bias Correction 8 / 43
Example
If the effect is small and positive, this is what we predict:
Alexander Etz (UC Irvine) Bayesian Bias Correction 9 / 43
Implications
What gets published if we only acknowledge significant results? Only
studies with observed mean greater than 16
If there is no effect, any published result is a type-1 error (i.e., false
positive)
If there is non-null effect, our effect size estimate will certainly be
inflated
Publication bias is clearly a problem for evaluating single studies, and
its effects can only compound if we meta-analyze many studies at once
Alexander Etz (UC Irvine) Bayesian Bias Correction 10 / 43
Modeling the bias
We are modelers, so naturally we try to model this publication filtering
process
Try to patch up our inferences using Bayesian bias correction
Alexander Etz (UC Irvine) Bayesian Bias Correction 11 / 43
Modeling the bias
Key idea: Modify the sampling distribution to more accurately describe
the results that will be making it to the literature
Alexander Etz (UC Irvine) Bayesian Bias Correction 12 / 43
Modeling the bias
Guan and Vandekerckhove (2016) suggested the following plausible
censoring processes...
(Note added: The x-axis is the p-value and y-axis is relative probability of
publication. The x-axes are truncated at .5 merely for aesthetics)
Alexander Etz (UC Irvine) Bayesian Bias Correction 13 / 43
Modeling the bias
... which make the following predictions about the t-values that make it to
the literature
−5 0 5
0
0.2
0.4
No bias
−5 0 5
0
0.2
0.4
−5 0 5
0
0.5
1
Extreme bias
−5 0 5
0
0.5
1
1.5
−5 0 5
0
0.2
0.4
Constant bias
−5 0 5
0
0.5
1
−5 0 5
0
0.2
0.4
0.6
0.8
Exponential bias
−5 0 5
0
0.5
1
Alexander Etz (UC Irvine) Bayesian Bias Correction 14 / 43
Modeling the bias
(Note added: The first model corresponds to the typical central and
non-central t distributions, where all results are published. The second says
n.s. results are never published, so all mass from the middle of the
distribution is pushed to the tails (hence the name “extreme”). The third
says all n.s. results are published at some rate π that does not depend on
the p-value (hence the name, “constant”). The fourth says n.s. results with
p-values close to .05 have higher publication rate than higher p-values.)
Alexander Etz (UC Irvine) Bayesian Bias Correction 15 / 43
Estimating and testing
Let’s consider a single study that finds t = 2.2 with n = 25.
Start with a prior on δ of p(δ|H1) = N(0, 1)
Goal 1: Calculate Bayes factor in favor of nonzero effect size
Goal 2: Compute posterior distribution for δ, p(δ|data, H1)
Alexander Etz (UC Irvine) Bayesian Bias Correction 16 / 43
Estimating and testing
We can compute the posterior distribution1 for each biasing model:
1
Note: Displayed probs do not sum to 1 since we do not show the probabilities of the
respective null models. A reference line is added at δ = .5 to aid comparisons.
Alexander Etz (UC Irvine) Bayesian Bias Correction 17 / 43
Estimating and testing
We can synthesize the posteriors according to their posterior model
probabilities using Bayesian model averaging (the ratio of the heights of the
black dots gives the model-averaged Bayes factor)
Alexander Etz (UC Irvine) Bayesian Bias Correction 18 / 43
Estimating and testing
Compare to results from the naive posterior, assuming no bias (blue)
Alexander Etz (UC Irvine) Bayesian Bias Correction 19 / 43
Estimating and testing
To summarize, if we observe t = 2.2 with n = 25:
The posterior distribution is shifted to smaller values
The mitigated Bayes factor is BF01 = 1.4, where the original Bayes
factor BF10 = 2.5 slightly favored the alternative. But neither are too
conclusive
Alexander Etz (UC Irvine) Bayesian Bias Correction 20 / 43
Estimating and testing
Consider: The authors publish a followup study, with two new (direct)
replications of the effect: In total we have t = (2.2, 2.3, 2.1) and
n = (25, 20, 35)
Now we want to do a fixed-effects meta-analysis to synthesize the
three sets of observations
Alexander Etz (UC Irvine) Bayesian Bias Correction 21 / 43
Estimating and testing
Alexander Etz (UC Irvine) Bayesian Bias Correction 22 / 43
Estimating and testing
Alexander Etz (UC Irvine) Bayesian Bias Correction 23 / 43
Estimating and testing
To summarize, for t = (2.2, 2.3, 2.1) and n = (25, 20, 35):
The cumulative posterior distribution is largely shifted to smaller values
The mitigated Bayes factor is BF01 = 1.2, again indecisive, whereas
the original Bayes factor BF10 = 17 largely favored the alternative
Alexander Etz (UC Irvine) Bayesian Bias Correction 24 / 43
Extending to random effects
Previously, the method could only apply to single studies or fixed
effects meta-analysis
We have extended it to apply to random effects meta-analysis, by
introducing study-specific effect sizes drawn from a higher level
population
Allows for there to be heterogeneity in true effects across studies (e.g.,
conceptual replications with new measures, replications on different
populations, etc.)
We now demonstrate this extension on some recently published data
Alexander Etz (UC Irvine) Bayesian Bias Correction 25 / 43
The studies
Alexander Etz (UC Irvine) Bayesian Bias Correction 26 / 43
The studies
“Recently, it has been claimed that risk-taking and spending behavior
may be triggered in part by evolutionarily driven motives”
“Sexual cues in advertising product categories such as casinos, fashion,
jewelry, cosmetic surgery, cars, cigarettes, and alcohol, and the
evidence for their effectiveness, suggests that controlling such primes in
real-world settings might constitute a valuable intervention.”
The authors conduct a meta-analysis of the published literature, and
subsequently perform various replications of selected studies
Alexander Etz (UC Irvine) Bayesian Bias Correction 27 / 43
The studies
Alexander Etz (UC Irvine) Bayesian Bias Correction 28 / 43
The studies
Various designs, manipulations, outcomes, all call for a random effects
meta-analysis that allows each study its own effect size:
Can evaluate each study’s individual effect size
Can evaluate overall population mean
Can evaluate variability across studies
Alexander Etz (UC Irvine) Bayesian Bias Correction 29 / 43
The published literature
Alexander Etz (UC Irvine) Bayesian Bias Correction 30 / 43
The replications
Alexander Etz (UC Irvine) Bayesian Bias Correction 31 / 43
Synthesis?
How can we synthesize these two sets of results?
Option: Throw them all together in a joint meta-analysis?
What about publication bias?
Option: Throw out the published literature, and only look at the
replications?
Isn’t this wasteful? Presumably there is at least some signal in there
we can extract
We attempt to jointly analyze all studies using our Bayesian bias correction
method
Alexander Etz (UC Irvine) Bayesian Bias Correction 32 / 43
The model
Random effects meta-analysis can be instantiated as a Bayesian hierarchical
model, but we would like to modify it to account for bias in the published
literature
Alexander Etz (UC Irvine) Bayesian Bias Correction 33 / 43
The model
We can estimate the effect sizes using model 3, of which 1 and 2 are special
cases...
Alexander Etz (UC Irvine) Bayesian Bias Correction 34 / 43
The model
...with an added hierarchical structure:
µ ∼ N(0, 1)
σ ∼ G(4, 4)
π ∼ B(5, 95)
δi | µ, σ ∼ N(µ, σ2
)
ti | δi , Ui = 0 ∼ biased-tν,π(ncp = δi
√
νi /2)
ti | δi , Ui = 1 ∼ tν(ncp = δi
√
νi /2)
where δi is the study-specific effect size for study i, Ui indicates whether
study i has not been through a publication filter. These priors, like always,
are up for debate, but we think they are reasonable
Alexander Etz (UC Irvine) Bayesian Bias Correction 35 / 43
The data, sorted by ascending ES
Alexander Etz (UC Irvine) Bayesian Bias Correction 36 / 43
Naive hierarchical model
Alexander Etz (UC Irvine) Bayesian Bias Correction 37 / 43
Debiasing hierarchical model
Alexander Etz (UC Irvine) Bayesian Bias Correction 38 / 43
Comparing the model estimates
Alexander Etz (UC Irvine) Bayesian Bias Correction 39 / 43
Summary
In both models we see the shrinkage typical of Bayesian methods
Large effects are pulled down, small effects are pulled up, both
“shrinking” toward the overall mean
The main difference between the naive hierarchical model and our
debiasing model: Extra shrinkage for published effects!
Alexander Etz (UC Irvine) Bayesian Bias Correction 40 / 43
More plots
We summarize our results as follows, showing our best estimates of the
mean and sd for the distribution from which we draw each study’s δ
Alexander Etz (UC Irvine) Bayesian Bias Correction 41 / 43
Model predictions
We can generate predictions from the models (i.e., posterior predictives) for
what we should expect for a new romance priming study with N=50
For the naive model, the predictive has a mean of .43 and sd of .42. The
bias correction model predictive has mean of .30 and sd of .38.
Alexander Etz (UC Irvine) Bayesian Bias Correction 42 / 43
Acknowledgements
Funding for this project is provided by the NSF Graduate Research
Fellowship Program, as well as a grant from the NSF’s Methods,
Measurements, and Statistics panel.
Alexander Etz (UC Irvine) Bayesian Bias Correction 43 / 43

More Related Content

What's hot

Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingRyan Herzog
 
What is your question
What is your questionWhat is your question
What is your questionStephenSenn2
 
First in man tokyo
First in man tokyoFirst in man tokyo
First in man tokyoStephen Senn
 
Senn repligate
Senn repligateSenn repligate
Senn repligatejemille6
 
Thinking statistically v3
Thinking statistically v3Thinking statistically v3
Thinking statistically v3Stephen Senn
 
Why I hate minimisation
Why I hate minimisationWhy I hate minimisation
Why I hate minimisationStephen Senn
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively jemille6
 
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."jemille6
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”jemille6
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningBill Liu
 
P values and the art of herding cats
P values  and the art of herding catsP values  and the art of herding cats
P values and the art of herding catsStephen Senn
 
Absence of a gold standard in diagnostic test accuracy research
Absence of a gold standard in diagnostic test accuracy researchAbsence of a gold standard in diagnostic test accuracy research
Absence of a gold standard in diagnostic test accuracy researchMaarten van Smeden
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA FisherStephen Senn
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy jemille6
 
Exploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory ResearchExploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory Researchjemille6
 
Real world modified
Real world modifiedReal world modified
Real world modifiedStephen Senn
 
P values and replication
P values and replicationP values and replication
P values and replicationStephen Senn
 
Final mayo's aps_talk
Final mayo's aps_talkFinal mayo's aps_talk
Final mayo's aps_talkjemille6
 

What's hot (20)

Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis Testing
 
What is your question
What is your questionWhat is your question
What is your question
 
First in man tokyo
First in man tokyoFirst in man tokyo
First in man tokyo
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Thinking statistically v3
Thinking statistically v3Thinking statistically v3
Thinking statistically v3
 
Why I hate minimisation
Why I hate minimisationWhy I hate minimisation
Why I hate minimisation
 
Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively Statistical skepticism: How to use significance tests effectively
Statistical skepticism: How to use significance tests effectively
 
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."
 
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”"The Statistical Replication Crisis: Paradoxes and Scapegoats”
"The Statistical Replication Crisis: Paradoxes and Scapegoats”
 
Causal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine LearningCausal Inference in Data Science and Machine Learning
Causal Inference in Data Science and Machine Learning
 
P values and the art of herding cats
P values  and the art of herding catsP values  and the art of herding cats
P values and the art of herding cats
 
Absence of a gold standard in diagnostic test accuracy research
Absence of a gold standard in diagnostic test accuracy researchAbsence of a gold standard in diagnostic test accuracy research
Absence of a gold standard in diagnostic test accuracy research
 
On being Bayesian
On being BayesianOn being Bayesian
On being Bayesian
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA Fisher
 
D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy D. Mayo: Replication Research Under an Error Statistical Philosophy
D. Mayo: Replication Research Under an Error Statistical Philosophy
 
Exploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory ResearchExploratory Research is More Reliable Than Confirmatory Research
Exploratory Research is More Reliable Than Confirmatory Research
 
Real world modified
Real world modifiedReal world modified
Real world modified
 
P values and replication
P values and replicationP values and replication
P values and replication
 
Final mayo's aps_talk
Final mayo's aps_talkFinal mayo's aps_talk
Final mayo's aps_talk
 
Spss session 1 and 2
Spss session 1 and 2Spss session 1 and 2
Spss session 1 and 2
 

Similar to Bayesian Bias Correction: Critically evaluating sets of studies in the presence of publication bias

1) The path length from A to B in the following graph is .docx
1) The path length from A to B in the following graph is .docx1) The path length from A to B in the following graph is .docx
1) The path length from A to B in the following graph is .docxmonicafrancis71118
 
Quality Engineering material
Quality Engineering materialQuality Engineering material
Quality Engineering materialTeluguSudhakar3
 
Power study: 1-way anova vs kruskall wallis
Power study: 1-way anova vs kruskall wallisPower study: 1-way anova vs kruskall wallis
Power study: 1-way anova vs kruskall wallisDon Cua
 
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...Pat Barlow
 
One way repeated measure anova
One way repeated measure anovaOne way repeated measure anova
One way repeated measure anovaAamna Haneef
 
A Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their PerformancesA Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their PerformancesIOSRJM
 
Day 11 t test for independent samples
Day 11 t test for independent samplesDay 11 t test for independent samples
Day 11 t test for independent samplesElih Sutisna Yanto
 
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptxLec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptxAkhtaruzzamanlimon1
 
D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...
D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...
D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...jemille6
 
Multivariate and Conditional Distribution
Multivariate and Conditional DistributionMultivariate and Conditional Distribution
Multivariate and Conditional Distributionssusered887b
 
Parametric & non parametric
Parametric & non parametricParametric & non parametric
Parametric & non parametricANCYBS
 
Week 5 Lecture 14 The Chi Square TestQuite often, patterns of .docx
Week 5 Lecture 14 The Chi Square TestQuite often, patterns of .docxWeek 5 Lecture 14 The Chi Square TestQuite often, patterns of .docx
Week 5 Lecture 14 The Chi Square TestQuite often, patterns of .docxcockekeshia
 
Anova n metaanalysis
Anova n metaanalysisAnova n metaanalysis
Anova n metaanalysisutpal sharma
 
Assessment 4 ContextRecall that null hypothesis tests are of.docx
Assessment 4 ContextRecall that null hypothesis tests are of.docxAssessment 4 ContextRecall that null hypothesis tests are of.docx
Assessment 4 ContextRecall that null hypothesis tests are of.docxgalerussel59292
 

Similar to Bayesian Bias Correction: Critically evaluating sets of studies in the presence of publication bias (20)

1) The path length from A to B in the following graph is .docx
1) The path length from A to B in the following graph is .docx1) The path length from A to B in the following graph is .docx
1) The path length from A to B in the following graph is .docx
 
Quality Engineering material
Quality Engineering materialQuality Engineering material
Quality Engineering material
 
Power study: 1-way anova vs kruskall wallis
Power study: 1-way anova vs kruskall wallisPower study: 1-way anova vs kruskall wallis
Power study: 1-way anova vs kruskall wallis
 
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
What's Significant? Hypothesis Testing, Effect Size, Confidence Intervals, & ...
 
non para.doc
non para.docnon para.doc
non para.doc
 
One way repeated measure anova
One way repeated measure anovaOne way repeated measure anova
One way repeated measure anova
 
A Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their PerformancesA Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their Performances
 
ECONOMETRICS I ASA
ECONOMETRICS I ASAECONOMETRICS I ASA
ECONOMETRICS I ASA
 
Anova copy
Anova   copyAnova   copy
Anova copy
 
Austin Statistics
Austin StatisticsAustin Statistics
Austin Statistics
 
Day 11 t test for independent samples
Day 11 t test for independent samplesDay 11 t test for independent samples
Day 11 t test for independent samples
 
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptxLec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
Lec1_Methods-for-Dummies-T-tests-anovas-and-regression.pptx
 
D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...
D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...
D. Mayo: Putting the brakes on the breakthrough: An informal look at the argu...
 
Multivariate and Conditional Distribution
Multivariate and Conditional DistributionMultivariate and Conditional Distribution
Multivariate and Conditional Distribution
 
Analysis of variance
Analysis of varianceAnalysis of variance
Analysis of variance
 
Parametric & non parametric
Parametric & non parametricParametric & non parametric
Parametric & non parametric
 
Week 5 Lecture 14 The Chi Square TestQuite often, patterns of .docx
Week 5 Lecture 14 The Chi Square TestQuite often, patterns of .docxWeek 5 Lecture 14 The Chi Square TestQuite often, patterns of .docx
Week 5 Lecture 14 The Chi Square TestQuite often, patterns of .docx
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Anova n metaanalysis
Anova n metaanalysisAnova n metaanalysis
Anova n metaanalysis
 
Assessment 4 ContextRecall that null hypothesis tests are of.docx
Assessment 4 ContextRecall that null hypothesis tests are of.docxAssessment 4 ContextRecall that null hypothesis tests are of.docx
Assessment 4 ContextRecall that null hypothesis tests are of.docx
 

Recently uploaded

The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxEran Akiva Sinbar
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxGood agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxSimeonChristian
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 

Recently uploaded (20)

The dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptxThe dark energy paradox leads to a new structure of spacetime.pptx
The dark energy paradox leads to a new structure of spacetime.pptx
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptxGood agricultural practices 3rd year bpharm. herbal drug technology .pptx
Good agricultural practices 3rd year bpharm. herbal drug technology .pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 

Bayesian Bias Correction: Critically evaluating sets of studies in the presence of publication bias

  • 1. Bayesian Bias Correction: Critically evaluating sets of studies in the presence of publication bias Alexander Etz UC Irvine Alexander Etz (UC Irvine) Bayesian Bias Correction 1 / 43
  • 2. Introduction When we read a published paper, we have two broadly related goals Goal 1: Evaluate the evidence in favor of a claim Goal 2: Estimate how big the reported effect could reasonably be Alexander Etz (UC Irvine) Bayesian Bias Correction 2 / 43
  • 3. Introduction Solution (in principle): Compute a Bayes factor (goal 1), or compute the posterior distribution (goal 2) Alexander Etz (UC Irvine) Bayesian Bias Correction 3 / 43
  • 4. Challenge Challenge: Literature is biased You only see a subset of all studies performed, because only the studies that “worked” make it past the gatekeepers (editors, reviewers) Alexander Etz (UC Irvine) Bayesian Bias Correction 4 / 43
  • 5. Publication bias Statistical significance filter: Only studies with p < .05 get published Censored data, based on the stochastic outcome of the study Alexander Etz (UC Irvine) Bayesian Bias Correction 5 / 43
  • 6. Publication bias Publication bias ruins our ability to evaluate the claims in a paper Alexander Etz (UC Irvine) Bayesian Bias Correction 6 / 43
  • 7. Example Imagine we estimate a group mean and perform a one-sample test We find observed mean of 17, with a standard error of 8 Alexander Etz (UC Irvine) Bayesian Bias Correction 7 / 43
  • 8. Example If the null hypothesis is true, this would be what we predict: Alexander Etz (UC Irvine) Bayesian Bias Correction 8 / 43
  • 9. Example If the effect is small and positive, this is what we predict: Alexander Etz (UC Irvine) Bayesian Bias Correction 9 / 43
  • 10. Implications What gets published if we only acknowledge significant results? Only studies with observed mean greater than 16 If there is no effect, any published result is a type-1 error (i.e., false positive) If there is non-null effect, our effect size estimate will certainly be inflated Publication bias is clearly a problem for evaluating single studies, and its effects can only compound if we meta-analyze many studies at once Alexander Etz (UC Irvine) Bayesian Bias Correction 10 / 43
  • 11. Modeling the bias We are modelers, so naturally we try to model this publication filtering process Try to patch up our inferences using Bayesian bias correction Alexander Etz (UC Irvine) Bayesian Bias Correction 11 / 43
  • 12. Modeling the bias Key idea: Modify the sampling distribution to more accurately describe the results that will be making it to the literature Alexander Etz (UC Irvine) Bayesian Bias Correction 12 / 43
  • 13. Modeling the bias Guan and Vandekerckhove (2016) suggested the following plausible censoring processes... (Note added: The x-axis is the p-value and y-axis is relative probability of publication. The x-axes are truncated at .5 merely for aesthetics) Alexander Etz (UC Irvine) Bayesian Bias Correction 13 / 43
  • 14. Modeling the bias ... which make the following predictions about the t-values that make it to the literature −5 0 5 0 0.2 0.4 No bias −5 0 5 0 0.2 0.4 −5 0 5 0 0.5 1 Extreme bias −5 0 5 0 0.5 1 1.5 −5 0 5 0 0.2 0.4 Constant bias −5 0 5 0 0.5 1 −5 0 5 0 0.2 0.4 0.6 0.8 Exponential bias −5 0 5 0 0.5 1 Alexander Etz (UC Irvine) Bayesian Bias Correction 14 / 43
  • 15. Modeling the bias (Note added: The first model corresponds to the typical central and non-central t distributions, where all results are published. The second says n.s. results are never published, so all mass from the middle of the distribution is pushed to the tails (hence the name “extreme”). The third says all n.s. results are published at some rate π that does not depend on the p-value (hence the name, “constant”). The fourth says n.s. results with p-values close to .05 have higher publication rate than higher p-values.) Alexander Etz (UC Irvine) Bayesian Bias Correction 15 / 43
  • 16. Estimating and testing Let’s consider a single study that finds t = 2.2 with n = 25. Start with a prior on δ of p(δ|H1) = N(0, 1) Goal 1: Calculate Bayes factor in favor of nonzero effect size Goal 2: Compute posterior distribution for δ, p(δ|data, H1) Alexander Etz (UC Irvine) Bayesian Bias Correction 16 / 43
  • 17. Estimating and testing We can compute the posterior distribution1 for each biasing model: 1 Note: Displayed probs do not sum to 1 since we do not show the probabilities of the respective null models. A reference line is added at δ = .5 to aid comparisons. Alexander Etz (UC Irvine) Bayesian Bias Correction 17 / 43
  • 18. Estimating and testing We can synthesize the posteriors according to their posterior model probabilities using Bayesian model averaging (the ratio of the heights of the black dots gives the model-averaged Bayes factor) Alexander Etz (UC Irvine) Bayesian Bias Correction 18 / 43
  • 19. Estimating and testing Compare to results from the naive posterior, assuming no bias (blue) Alexander Etz (UC Irvine) Bayesian Bias Correction 19 / 43
  • 20. Estimating and testing To summarize, if we observe t = 2.2 with n = 25: The posterior distribution is shifted to smaller values The mitigated Bayes factor is BF01 = 1.4, where the original Bayes factor BF10 = 2.5 slightly favored the alternative. But neither are too conclusive Alexander Etz (UC Irvine) Bayesian Bias Correction 20 / 43
  • 21. Estimating and testing Consider: The authors publish a followup study, with two new (direct) replications of the effect: In total we have t = (2.2, 2.3, 2.1) and n = (25, 20, 35) Now we want to do a fixed-effects meta-analysis to synthesize the three sets of observations Alexander Etz (UC Irvine) Bayesian Bias Correction 21 / 43
  • 22. Estimating and testing Alexander Etz (UC Irvine) Bayesian Bias Correction 22 / 43
  • 23. Estimating and testing Alexander Etz (UC Irvine) Bayesian Bias Correction 23 / 43
  • 24. Estimating and testing To summarize, for t = (2.2, 2.3, 2.1) and n = (25, 20, 35): The cumulative posterior distribution is largely shifted to smaller values The mitigated Bayes factor is BF01 = 1.2, again indecisive, whereas the original Bayes factor BF10 = 17 largely favored the alternative Alexander Etz (UC Irvine) Bayesian Bias Correction 24 / 43
  • 25. Extending to random effects Previously, the method could only apply to single studies or fixed effects meta-analysis We have extended it to apply to random effects meta-analysis, by introducing study-specific effect sizes drawn from a higher level population Allows for there to be heterogeneity in true effects across studies (e.g., conceptual replications with new measures, replications on different populations, etc.) We now demonstrate this extension on some recently published data Alexander Etz (UC Irvine) Bayesian Bias Correction 25 / 43
  • 26. The studies Alexander Etz (UC Irvine) Bayesian Bias Correction 26 / 43
  • 27. The studies “Recently, it has been claimed that risk-taking and spending behavior may be triggered in part by evolutionarily driven motives” “Sexual cues in advertising product categories such as casinos, fashion, jewelry, cosmetic surgery, cars, cigarettes, and alcohol, and the evidence for their effectiveness, suggests that controlling such primes in real-world settings might constitute a valuable intervention.” The authors conduct a meta-analysis of the published literature, and subsequently perform various replications of selected studies Alexander Etz (UC Irvine) Bayesian Bias Correction 27 / 43
  • 28. The studies Alexander Etz (UC Irvine) Bayesian Bias Correction 28 / 43
  • 29. The studies Various designs, manipulations, outcomes, all call for a random effects meta-analysis that allows each study its own effect size: Can evaluate each study’s individual effect size Can evaluate overall population mean Can evaluate variability across studies Alexander Etz (UC Irvine) Bayesian Bias Correction 29 / 43
  • 30. The published literature Alexander Etz (UC Irvine) Bayesian Bias Correction 30 / 43
  • 31. The replications Alexander Etz (UC Irvine) Bayesian Bias Correction 31 / 43
  • 32. Synthesis? How can we synthesize these two sets of results? Option: Throw them all together in a joint meta-analysis? What about publication bias? Option: Throw out the published literature, and only look at the replications? Isn’t this wasteful? Presumably there is at least some signal in there we can extract We attempt to jointly analyze all studies using our Bayesian bias correction method Alexander Etz (UC Irvine) Bayesian Bias Correction 32 / 43
  • 33. The model Random effects meta-analysis can be instantiated as a Bayesian hierarchical model, but we would like to modify it to account for bias in the published literature Alexander Etz (UC Irvine) Bayesian Bias Correction 33 / 43
  • 34. The model We can estimate the effect sizes using model 3, of which 1 and 2 are special cases... Alexander Etz (UC Irvine) Bayesian Bias Correction 34 / 43
  • 35. The model ...with an added hierarchical structure: µ ∼ N(0, 1) σ ∼ G(4, 4) π ∼ B(5, 95) δi | µ, σ ∼ N(µ, σ2 ) ti | δi , Ui = 0 ∼ biased-tν,π(ncp = δi √ νi /2) ti | δi , Ui = 1 ∼ tν(ncp = δi √ νi /2) where δi is the study-specific effect size for study i, Ui indicates whether study i has not been through a publication filter. These priors, like always, are up for debate, but we think they are reasonable Alexander Etz (UC Irvine) Bayesian Bias Correction 35 / 43
  • 36. The data, sorted by ascending ES Alexander Etz (UC Irvine) Bayesian Bias Correction 36 / 43
  • 37. Naive hierarchical model Alexander Etz (UC Irvine) Bayesian Bias Correction 37 / 43
  • 38. Debiasing hierarchical model Alexander Etz (UC Irvine) Bayesian Bias Correction 38 / 43
  • 39. Comparing the model estimates Alexander Etz (UC Irvine) Bayesian Bias Correction 39 / 43
  • 40. Summary In both models we see the shrinkage typical of Bayesian methods Large effects are pulled down, small effects are pulled up, both “shrinking” toward the overall mean The main difference between the naive hierarchical model and our debiasing model: Extra shrinkage for published effects! Alexander Etz (UC Irvine) Bayesian Bias Correction 40 / 43
  • 41. More plots We summarize our results as follows, showing our best estimates of the mean and sd for the distribution from which we draw each study’s δ Alexander Etz (UC Irvine) Bayesian Bias Correction 41 / 43
  • 42. Model predictions We can generate predictions from the models (i.e., posterior predictives) for what we should expect for a new romance priming study with N=50 For the naive model, the predictive has a mean of .43 and sd of .42. The bias correction model predictive has mean of .30 and sd of .38. Alexander Etz (UC Irvine) Bayesian Bias Correction 42 / 43
  • 43. Acknowledgements Funding for this project is provided by the NSF Graduate Research Fellowship Program, as well as a grant from the NSF’s Methods, Measurements, and Statistics panel. Alexander Etz (UC Irvine) Bayesian Bias Correction 43 / 43