SlideShare a Scribd company logo
1 of 30
The seven habits of highly
effective statisticians
Stephen Senn
Consultant Statistician, Edinburgh, UK
© Stephen Senn 2020 1
A Question to Keep You Amused
Consider a ‘coin of ignorance’
   
 
, 1 ,
1, 0 1
P H P T
f
 
 
  
  
The coin is tossed 100 times. If X is the number of heads,
which of these two is more likely?:
 
 
50
100 ?
P X
P X


100!/(50!50!)  1029 sequences
One sequence
© Stephen Senn 2020 2
 Is the
probability
of a head
Every
value of 
is equally
likely
Of course, this is an ironic title
• Any statistician knows that you should think in terms of the three Cs:
• Causation
• Control
• Comparison
• To which a fourth might be added
• Counterfactuals
• The question of interest is
• What habits have a beneficial effect on your probability of being an effective
statistician?
• Many effective statisticians will be in the habit of taking breakfast. This
doesn’t make taking breakfast a cause of being an effective statistician.
© Stephen Senn 2020 3
That which
would have
happened
had you
acted
differently
And my advice is hypocritical
• I earn my living as a statistician promoting, using and evaluating
numerical evidence
• Based on studies with
• Control
• Randomisation
• Replication
• I am proposing instead to give you advice based on one uncontrolled
example
• Me
© Stephen Senn 2020 4
The magnificent seven
• Read
• Listen ( & see)
• Understand
• Think
• Do
• Calculate
• Communicate
• Include some classics in your reading
• Fit the answer to the problem not vice versa
• Requires some subject matter comprehension
• It’s not just a matter of mathematics (but it also is)
• The devil is the detail and doing discovers it
• Use calculations to increase, not instead of understanding
• Think hard about what the simplest honest way is to
communicate the message
© Stephen Senn 2020 5
I am not going to go through this list in detail
• Instead I shall illustrate some of these points by a few examples I shall
present
• Invalid inversion
• Regression to the mean
• Some statistical ‘howlers’
• These will illustrate between them the value of
• Understand
• Communicate
• Think
• Do
• Read
• Calculate
© Stephen Senn 2020 6
What happened to Listen?
That’s where you come in!
A Simple Example of ‘Invalid Inversion’
• Most women do not suffer from breast cancer
• It would be a mistake to conclude, however, that most breast cancer
victims are not women
• To do so would be to transpose the conditionals
• This is an example of invalid inversion
• Why is this important?
• People regularly confuse the probability of the data given the
hypothesis with the probability of the hypothesis given the data
• Misinterpretation of P-values is linked to this
7(c) Stephen Senn
Some Plausible Figures for the UK
8(c) Stephen Senn
Some Plausible Figures for the UK
Probability breast cancer given female = 550/31,418=0.018
9(c) Stephen Senn
Some Plausible Figures for the UK
Probability female given breast cancer =550/553=0.995
10(c) Stephen Senn
The difference is in the denominator
The numerator is the same
11(c) Stephen Senn
Invalid inversion is an error caused by mistaking the relevant marginal class
550/31418 or 550/553
A Little Maths
 
 
 
 
 
 
       Unless ,
P A B
P A B
P B
P A B
P B A
P A
P B P A P A B P B A




 
So invalid inversion is equivalent to a confusion of the marginal probabilities. The
same joint probability is involved in the two conditional probabilities but different
marginal probabilities are involved
12(c) Stephen Senn
The Regression Analogue
Predicting Y from X is not the same as predicting X from Y.
2
2
XY
Y X
X
XY
X Y
Y








Note the similarity with the probability case.
The numerator (the covariance) is a statistic of joint variation.
The denominators (the variances) are statistics of marginal variation. These
marginal statistics are not the same.
13(c) Stephen Senn
The difference is in the denominator
The numerator is the same
Dimensional analysis
• Consider the example of regressing weight from height and vice versa
• Suppose you put height in cm into your ‘black’ box to predict weight in kg
• The input is in cm
• The output is in kg
• You must multiply the cm by a regression coefficient that is in kg/cm
• The covariance is in units of kg x cm and you divide by a variance that is in cm2 to get
kg/cm
• Suppose you put weight in kg into your black box to predict height in cm
• You must multiply the kg in a coefficient that is in cm/kg
• The numerator is the covariance in both cases
• A different variance is used for the denominator
© Stephen Senn 2020 14
Just to make that perfectly clear
© Stephen Senn 2020 15
𝑤𝑒𝑖𝑔ℎ𝑡 𝑘𝑔 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑘𝑔 +
𝑐𝑜𝑣 𝑐𝑚 × 𝑘𝑔
𝑣𝑎𝑟 𝑐𝑚 × 𝑐𝑚
× ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚
ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑐𝑚 +
𝑐𝑜𝑣(𝑐𝑚 × 𝑘𝑔)
𝑣𝑎𝑟(𝑘𝑔 × 𝑘𝑔)
× 𝑤𝑒𝑖𝑔ℎ𝑡(𝑘𝑔)
Morals
• Think carefully about basic and fundamental concepts in probability and
statistics
• Seek an understanding that is not just mathematical but that reveals why
things have to be the way they are
• Make parallels
• Regression is similar to conditional probability in some way
• Dimensional analysis (a tool used by physicists and engineers) is very valuable
• Find the simplest way to communicate important points
• Proofs are good but not for this
• Examples are excellent
• Read widely and seek different explanations of the same thing
© Stephen Senn 2020 16
Regression to the Mean
A Simulated Example
• Diastolic blood pressure (DBP)
• Mean 90mmHg
• Between patient variance 50mmHg2
• Within patient variance 15 mmHg2
• Boundary for hypertensive 95 mmHg
• Simulation of 1000 patients whose DBP at baseline
and outcome are shown
• Blue consistent normotensive
• Red Consistent hypertensive
• Orange hypertensive/normotensive or vice versa
17(c) Stephen Senn
18(c) Stephen Senn
What you will
see if all
patients
are followed up
19(c) Stephen Senn
What you will
see if hypertensive
patients
are followed up
(c) Stephen Senn 20
Mean at baseline and
outcome are the same
Mean at outcome is
lower than at baseline
All patients are hypertensive
at baseline
Many are not at outcome
Probably not the best way to explain this
© Stephen Senn 2020 21
Who wrote this?
Senn, S. J. (1988). How much of the placebo 'effect' is really statistical
regression? [letter]. Statistics in Medicine, 7(11), 1203
Doing and calculating avoids stupid mistakes
Stupid mistake Cure
Proposing allocation ratios of 7:5:3
for a three armed trial.
Calculate the minimum block size.
Hint: It’s 105.
Proposing some software for cross-
over trials that could adjust the
treatments to which patients are
allocated depending on results in
earlier periods.
Try do this is real time.
Hint: This may help you learn that patients do not
arrive simultaneously in a clinical trial.
Claim that the use of placebos in
clinical trials is unethical if there is
an effective treatment.
Run a clinical trial in a serious disease where there
is a partially effective treatment.
Hint: How do you avoid withdrawing the partially
effective treatment from some patients?
© Stephen Senn 2020 22
Advice on Understanding, Thinking, Reading
etc.
• Mathematics is important
• But it’s not enough
• Statistics is not a branch of mathematics although probability theory is
• Applications are important
• Loving your data
• Getting to know the application area
• Biology!
• Pharmacology!
• Reading the classics is good for you
• Especially Fisher
© Stephen Senn 2020 23
That problem
The two events are equally likely. In fact,
 
1
, , 0,1, .
1
n P X k k n
n
   

L
Proof could involve some or all of the following:
marginal, conditional and joint probabilities
calculus
Bayes theorem
posterior probability
predictive distribution
proof by induction
© Stephen Senn 2020 24
Intuition
Imagine one billion tosses.
Your posterior probability would have to be very close to the
observed relative frequency, which would be close to the
‘true’ value.
But your prior probability says every true value is equally
likely.
Therefore, every observable ratio is equally likely.
But the result is also trivially true for n = 1. It is hardly
surprising, therefore, if the result is true for every value of n
between 1 and 1 billion.
© Stephen Senn 2020 25
Moral
• It is important to think about your assumptions carefully
• If you do this you can understand what they imply
• Trying simple cases is helpful
• If you do this you can often see what the solution must be
• Extreme cases (one billion tosses) can also be helpful
• The mathematical solution is valuable but it is not a substitute for this
• Statistics is more than just mathematics
• It is also science and philosophy
© Stephen Senn 2020 26
real problem
real problem
operational
problem
solution application
solution application
idealised
problem
Mathematics
Statistics
© Stephen Senn 2020 27
In the mathematical formulation of any problem it is necessary
to base oneself on some appropriate idealizations and
simplification…... One loses sight of the original nature of the
problem, falls in love with the idealization, and then blames
reality for not conforming to it.
de Finetti, (1975).
It seems a pity that while we statisticians have an opportunity
to rate as first-class scientists we should settle for the rather
dreary role of second-class mathematicians.
George Box (1990)
© Stephen Senn 2020 28
Statistics is a subject where everything has to
be understood three times
•In terms of mathematics
•In terms of philosophy
•In terms of application
© Stephen Senn 2020 29
•Finally, I would like
to leave you with
this question
•Did you know there
are only 120 days
to Christmas?
Traditional Polish Present
Piernik
Alternative suggestion
© Stephen Senn 2020 30
3rd edition out soon

More Related Content

What's hot

Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthMaarten van Smeden
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Maarten van Smeden
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IMaarten van Smeden
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxStephenSenn2
 
Has modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurtHas modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurtStephen Senn
 
Development and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsDevelopment and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsMaarten van Smeden
 
Introduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIIntroduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIMaarten van Smeden
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical researchMaarten van Smeden
 
Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?Maarten van Smeden
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Maarten van Smeden
 
QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...
QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...
QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...GaryCollins74
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligenceMaarten van Smeden
 
Statistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxStatistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxEwout Steyerberg
 
Take it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other mattersTake it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other mattersStephen Senn
 
Minimally important differences v2
Minimally important differences v2Minimally important differences v2
Minimally important differences v2Stephen Senn
 
Personalised medicine a sceptical view
Personalised medicine a sceptical viewPersonalised medicine a sceptical view
Personalised medicine a sceptical viewStephen Senn
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianLaure Wynants
 

What's hot (20)

Prognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient healthPrognosis-based medicine: merits and pitfalls of forecasting patient health
Prognosis-based medicine: merits and pitfalls of forecasting patient health
 
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
Shrinkage in medical prediction: the poor man’s solution for an inadequate sa...
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part I
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptx
 
Has modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurtHas modelling killed randomisation inference frankfurt
Has modelling killed randomisation inference frankfurt
 
Development and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutionsDevelopment and evaluation of prediction models: pitfalls and solutions
Development and evaluation of prediction models: pitfalls and solutions
 
Predictimands
PredictimandsPredictimands
Predictimands
 
Introduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part IIIntroduction to prediction modelling - Berlin 2018 - Part II
Introduction to prediction modelling - Berlin 2018 - Part II
 
P-values in crisis
P-values in crisisP-values in crisis
P-values in crisis
 
P value wars
P value warsP value wars
P value wars
 
Measurement error in medical research
Measurement error in medical researchMeasurement error in medical research
Measurement error in medical research
 
Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?Is it causal, is it prediction or is it neither?
Is it causal, is it prediction or is it neither?
 
Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...Guideline for high-quality diagnostic and prognostic applications of AI in he...
Guideline for high-quality diagnostic and prognostic applications of AI in he...
 
QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...
QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...
QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDIC...
 
Five questions about artificial intelligence
Five questions about artificial intelligenceFive questions about artificial intelligence
Five questions about artificial intelligence
 
Statistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptxStatistics and ML 21Oct22 sel.pptx
Statistics and ML 21Oct22 sel.pptx
 
Take it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other mattersTake it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other matters
 
Minimally important differences v2
Minimally important differences v2Minimally important differences v2
Minimally important differences v2
 
Personalised medicine a sceptical view
Personalised medicine a sceptical viewPersonalised medicine a sceptical view
Personalised medicine a sceptical view
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 

Similar to The 7 habits of highly effective statisticians

What should we expect from reproducibiliry
What should we expect from reproducibiliryWhat should we expect from reproducibiliry
What should we expect from reproducibiliryStephen Senn
 
Thinking statistically v3
Thinking statistically v3Thinking statistically v3
Thinking statistically v3Stephen Senn
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingRyan Herzog
 
Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. EPINOR
 
Senn repligate
Senn repligateSenn repligate
Senn repligatejemille6
 
Seven myths of randomisation
Seven myths of randomisation Seven myths of randomisation
Seven myths of randomisation Stephen Senn
 
In search of the lost loss function
In search of the lost loss function In search of the lost loss function
In search of the lost loss function Stephen Senn
 
Statistics in clinical and translational research common pitfalls
Statistics in clinical and translational research  common pitfallsStatistics in clinical and translational research  common pitfalls
Statistics in clinical and translational research common pitfallsPavlos Msaouel, MD, PhD
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...StephenSenn2
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...jemille6
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notescartlidge
 
The challenge of small data
The challenge of small dataThe challenge of small data
The challenge of small dataStephen Senn
 
Why I hate minimisation
Why I hate minimisationWhy I hate minimisation
Why I hate minimisationStephen Senn
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)jasondeveau
 
P values and replication
P values and replicationP values and replication
P values and replicationStephen Senn
 
probability.pptx
probability.pptxprobability.pptx
probability.pptxbisan3
 
Insights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityInsights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityDorothy Bishop
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxStephenSenn3
 

Similar to The 7 habits of highly effective statisticians (20)

What should we expect from reproducibiliry
What should we expect from reproducibiliryWhat should we expect from reproducibiliry
What should we expect from reproducibiliry
 
Thinking statistically v3
Thinking statistically v3Thinking statistically v3
Thinking statistically v3
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis Testing
 
Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error. Lecture by Professor Imre Janszky about random error.
Lecture by Professor Imre Janszky about random error.
 
How to do the maths
How to do the mathsHow to do the maths
How to do the maths
 
Senn repligate
Senn repligateSenn repligate
Senn repligate
 
Seven myths of randomisation
Seven myths of randomisation Seven myths of randomisation
Seven myths of randomisation
 
In search of the lost loss function
In search of the lost loss function In search of the lost loss function
In search of the lost loss function
 
Statistics in clinical and translational research common pitfalls
Statistics in clinical and translational research  common pitfallsStatistics in clinical and translational research  common pitfalls
Statistics in clinical and translational research common pitfalls
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
 
The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...The replication crisis: are P-values the problem and are Bayes factors the so...
The replication crisis: are P-values the problem and are Bayes factors the so...
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
 
The challenge of small data
The challenge of small dataThe challenge of small data
The challenge of small data
 
Why I hate minimisation
Why I hate minimisationWhy I hate minimisation
Why I hate minimisation
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
On being Bayesian
On being BayesianOn being Bayesian
On being Bayesian
 
P values and replication
P values and replicationP values and replication
P values and replication
 
probability.pptx
probability.pptxprobability.pptx
probability.pptx
 
Insights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibilityInsights from psychology on lack of reproducibility
Insights from psychology on lack of reproducibility
 
Clinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptxClinical trials are about comparability not generalisability V2.pptx
Clinical trials are about comparability not generalisability V2.pptx
 

More from Stephen Senn

What is your question
What is your questionWhat is your question
What is your questionStephen Senn
 
Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19Stephen Senn
 
To infinity and beyond v2
To infinity and beyond v2To infinity and beyond v2
To infinity and beyond v2Stephen Senn
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVAStephen Senn
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Stephen Senn
 
A century of t tests
A century of t testsA century of t tests
A century of t testsStephen Senn
 
Is ignorance bliss
Is ignorance blissIs ignorance bliss
Is ignorance blissStephen Senn
 
To infinity and beyond
To infinity and beyond To infinity and beyond
To infinity and beyond Stephen Senn
 
De Finetti meets Popper
De Finetti meets PopperDe Finetti meets Popper
De Finetti meets PopperStephen Senn
 
Understanding randomisation
Understanding randomisationUnderstanding randomisation
Understanding randomisationStephen Senn
 
In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?Stephen Senn
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTsStephen Senn
 
The Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradoxThe Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradoxStephen Senn
 
The revenge of RA Fisher
The revenge of RA Fisher The revenge of RA Fisher
The revenge of RA Fisher Stephen Senn
 
The story of MTA/02
The story of MTA/02The story of MTA/02
The story of MTA/02Stephen Senn
 
Confounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricksConfounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricksStephen Senn
 
And thereby hangs a tail
And thereby hangs a tailAnd thereby hangs a tail
And thereby hangs a tailStephen Senn
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA FisherStephen Senn
 
Minimally important differences
Minimally important differencesMinimally important differences
Minimally important differencesStephen Senn
 

More from Stephen Senn (19)

What is your question
What is your questionWhat is your question
What is your question
 
Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19Vaccine trials in the age of COVID-19
Vaccine trials in the age of COVID-19
 
To infinity and beyond v2
To infinity and beyond v2To infinity and beyond v2
To infinity and beyond v2
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVA
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?
 
A century of t tests
A century of t testsA century of t tests
A century of t tests
 
Is ignorance bliss
Is ignorance blissIs ignorance bliss
Is ignorance bliss
 
To infinity and beyond
To infinity and beyond To infinity and beyond
To infinity and beyond
 
De Finetti meets Popper
De Finetti meets PopperDe Finetti meets Popper
De Finetti meets Popper
 
Understanding randomisation
Understanding randomisationUnderstanding randomisation
Understanding randomisation
 
In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?In Search of Lost Infinities: What is the “n” in big data?
In Search of Lost Infinities: What is the “n” in big data?
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTs
 
The Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradoxThe Rothamsted school meets Lord's paradox
The Rothamsted school meets Lord's paradox
 
The revenge of RA Fisher
The revenge of RA Fisher The revenge of RA Fisher
The revenge of RA Fisher
 
The story of MTA/02
The story of MTA/02The story of MTA/02
The story of MTA/02
 
Confounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricksConfounding, politics, frustration and knavish tricks
Confounding, politics, frustration and knavish tricks
 
And thereby hangs a tail
And thereby hangs a tailAnd thereby hangs a tail
And thereby hangs a tail
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA Fisher
 
Minimally important differences
Minimally important differencesMinimally important differences
Minimally important differences
 

Recently uploaded

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 

Recently uploaded (20)

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 

The 7 habits of highly effective statisticians

  • 1. The seven habits of highly effective statisticians Stephen Senn Consultant Statistician, Edinburgh, UK © Stephen Senn 2020 1
  • 2. A Question to Keep You Amused Consider a ‘coin of ignorance’       , 1 , 1, 0 1 P H P T f           The coin is tossed 100 times. If X is the number of heads, which of these two is more likely?:     50 100 ? P X P X   100!/(50!50!)  1029 sequences One sequence © Stephen Senn 2020 2  Is the probability of a head Every value of  is equally likely
  • 3. Of course, this is an ironic title • Any statistician knows that you should think in terms of the three Cs: • Causation • Control • Comparison • To which a fourth might be added • Counterfactuals • The question of interest is • What habits have a beneficial effect on your probability of being an effective statistician? • Many effective statisticians will be in the habit of taking breakfast. This doesn’t make taking breakfast a cause of being an effective statistician. © Stephen Senn 2020 3 That which would have happened had you acted differently
  • 4. And my advice is hypocritical • I earn my living as a statistician promoting, using and evaluating numerical evidence • Based on studies with • Control • Randomisation • Replication • I am proposing instead to give you advice based on one uncontrolled example • Me © Stephen Senn 2020 4
  • 5. The magnificent seven • Read • Listen ( & see) • Understand • Think • Do • Calculate • Communicate • Include some classics in your reading • Fit the answer to the problem not vice versa • Requires some subject matter comprehension • It’s not just a matter of mathematics (but it also is) • The devil is the detail and doing discovers it • Use calculations to increase, not instead of understanding • Think hard about what the simplest honest way is to communicate the message © Stephen Senn 2020 5
  • 6. I am not going to go through this list in detail • Instead I shall illustrate some of these points by a few examples I shall present • Invalid inversion • Regression to the mean • Some statistical ‘howlers’ • These will illustrate between them the value of • Understand • Communicate • Think • Do • Read • Calculate © Stephen Senn 2020 6 What happened to Listen? That’s where you come in!
  • 7. A Simple Example of ‘Invalid Inversion’ • Most women do not suffer from breast cancer • It would be a mistake to conclude, however, that most breast cancer victims are not women • To do so would be to transpose the conditionals • This is an example of invalid inversion • Why is this important? • People regularly confuse the probability of the data given the hypothesis with the probability of the hypothesis given the data • Misinterpretation of P-values is linked to this 7(c) Stephen Senn
  • 8. Some Plausible Figures for the UK 8(c) Stephen Senn
  • 9. Some Plausible Figures for the UK Probability breast cancer given female = 550/31,418=0.018 9(c) Stephen Senn
  • 10. Some Plausible Figures for the UK Probability female given breast cancer =550/553=0.995 10(c) Stephen Senn
  • 11. The difference is in the denominator The numerator is the same 11(c) Stephen Senn Invalid inversion is an error caused by mistaking the relevant marginal class 550/31418 or 550/553
  • 12. A Little Maths                    Unless , P A B P A B P B P A B P B A P A P B P A P A B P B A       So invalid inversion is equivalent to a confusion of the marginal probabilities. The same joint probability is involved in the two conditional probabilities but different marginal probabilities are involved 12(c) Stephen Senn
  • 13. The Regression Analogue Predicting Y from X is not the same as predicting X from Y. 2 2 XY Y X X XY X Y Y         Note the similarity with the probability case. The numerator (the covariance) is a statistic of joint variation. The denominators (the variances) are statistics of marginal variation. These marginal statistics are not the same. 13(c) Stephen Senn The difference is in the denominator The numerator is the same
  • 14. Dimensional analysis • Consider the example of regressing weight from height and vice versa • Suppose you put height in cm into your ‘black’ box to predict weight in kg • The input is in cm • The output is in kg • You must multiply the cm by a regression coefficient that is in kg/cm • The covariance is in units of kg x cm and you divide by a variance that is in cm2 to get kg/cm • Suppose you put weight in kg into your black box to predict height in cm • You must multiply the kg in a coefficient that is in cm/kg • The numerator is the covariance in both cases • A different variance is used for the denominator © Stephen Senn 2020 14
  • 15. Just to make that perfectly clear © Stephen Senn 2020 15 𝑤𝑒𝑖𝑔ℎ𝑡 𝑘𝑔 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑘𝑔 + 𝑐𝑜𝑣 𝑐𝑚 × 𝑘𝑔 𝑣𝑎𝑟 𝑐𝑚 × 𝑐𝑚 × ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚 ℎ𝑒𝑖𝑔ℎ𝑡 𝑐𝑚 = 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑐𝑚 + 𝑐𝑜𝑣(𝑐𝑚 × 𝑘𝑔) 𝑣𝑎𝑟(𝑘𝑔 × 𝑘𝑔) × 𝑤𝑒𝑖𝑔ℎ𝑡(𝑘𝑔)
  • 16. Morals • Think carefully about basic and fundamental concepts in probability and statistics • Seek an understanding that is not just mathematical but that reveals why things have to be the way they are • Make parallels • Regression is similar to conditional probability in some way • Dimensional analysis (a tool used by physicists and engineers) is very valuable • Find the simplest way to communicate important points • Proofs are good but not for this • Examples are excellent • Read widely and seek different explanations of the same thing © Stephen Senn 2020 16
  • 17. Regression to the Mean A Simulated Example • Diastolic blood pressure (DBP) • Mean 90mmHg • Between patient variance 50mmHg2 • Within patient variance 15 mmHg2 • Boundary for hypertensive 95 mmHg • Simulation of 1000 patients whose DBP at baseline and outcome are shown • Blue consistent normotensive • Red Consistent hypertensive • Orange hypertensive/normotensive or vice versa 17(c) Stephen Senn
  • 18. 18(c) Stephen Senn What you will see if all patients are followed up
  • 19. 19(c) Stephen Senn What you will see if hypertensive patients are followed up
  • 20. (c) Stephen Senn 20 Mean at baseline and outcome are the same Mean at outcome is lower than at baseline All patients are hypertensive at baseline Many are not at outcome
  • 21. Probably not the best way to explain this © Stephen Senn 2020 21 Who wrote this? Senn, S. J. (1988). How much of the placebo 'effect' is really statistical regression? [letter]. Statistics in Medicine, 7(11), 1203
  • 22. Doing and calculating avoids stupid mistakes Stupid mistake Cure Proposing allocation ratios of 7:5:3 for a three armed trial. Calculate the minimum block size. Hint: It’s 105. Proposing some software for cross- over trials that could adjust the treatments to which patients are allocated depending on results in earlier periods. Try do this is real time. Hint: This may help you learn that patients do not arrive simultaneously in a clinical trial. Claim that the use of placebos in clinical trials is unethical if there is an effective treatment. Run a clinical trial in a serious disease where there is a partially effective treatment. Hint: How do you avoid withdrawing the partially effective treatment from some patients? © Stephen Senn 2020 22
  • 23. Advice on Understanding, Thinking, Reading etc. • Mathematics is important • But it’s not enough • Statistics is not a branch of mathematics although probability theory is • Applications are important • Loving your data • Getting to know the application area • Biology! • Pharmacology! • Reading the classics is good for you • Especially Fisher © Stephen Senn 2020 23
  • 24. That problem The two events are equally likely. In fact,   1 , , 0,1, . 1 n P X k k n n      L Proof could involve some or all of the following: marginal, conditional and joint probabilities calculus Bayes theorem posterior probability predictive distribution proof by induction © Stephen Senn 2020 24
  • 25. Intuition Imagine one billion tosses. Your posterior probability would have to be very close to the observed relative frequency, which would be close to the ‘true’ value. But your prior probability says every true value is equally likely. Therefore, every observable ratio is equally likely. But the result is also trivially true for n = 1. It is hardly surprising, therefore, if the result is true for every value of n between 1 and 1 billion. © Stephen Senn 2020 25
  • 26. Moral • It is important to think about your assumptions carefully • If you do this you can understand what they imply • Trying simple cases is helpful • If you do this you can often see what the solution must be • Extreme cases (one billion tosses) can also be helpful • The mathematical solution is valuable but it is not a substitute for this • Statistics is more than just mathematics • It is also science and philosophy © Stephen Senn 2020 26
  • 27. real problem real problem operational problem solution application solution application idealised problem Mathematics Statistics © Stephen Senn 2020 27
  • 28. In the mathematical formulation of any problem it is necessary to base oneself on some appropriate idealizations and simplification…... One loses sight of the original nature of the problem, falls in love with the idealization, and then blames reality for not conforming to it. de Finetti, (1975). It seems a pity that while we statisticians have an opportunity to rate as first-class scientists we should settle for the rather dreary role of second-class mathematicians. George Box (1990) © Stephen Senn 2020 28
  • 29. Statistics is a subject where everything has to be understood three times •In terms of mathematics •In terms of philosophy •In terms of application © Stephen Senn 2020 29
  • 30. •Finally, I would like to leave you with this question •Did you know there are only 120 days to Christmas? Traditional Polish Present Piernik Alternative suggestion © Stephen Senn 2020 30 3rd edition out soon

Editor's Notes

  1. If you know why the title of this talk is extremely stupid, then you clearly know something about control, data and reasoning: in short, you have most of what it takes to be a statistician. If you have studied statistics then you will also know that a large amount of anything, and this includes successful careers, is luck. In this talk I shall try share some of my experiences of being a statistician in the hope that it will help you make the most of whatever luck life throws you, In so doing, I shall try my best to overcome the distorting influence of that easiest of sciences hindsight. Without giving too much away, I shall be recommending that you read, listen, think, calculate, understand, communicate, and do. I shall give you some example of what I think works and what I think doesn’t In all of this you should never forget the power of negativity and also the joy of being able to wake up every day and say to yourself ‘I love the small of data in the morning’. 30 minutes presentation plus 5 minutes questions
  2. This example is covered in chapter 4 of Senn, S. J. (2003). Dicing with Death. Cambridge: Cambridge University Press.
  3. See Senn, S. J. (2013). Invalid inversion. Significance, 10(2), 40-42
  4. Since we are calculating the probability of having breast cancer given that someone is female, we condition on being ‘female’. We thus strike out the column ‘male’ as being irrelevant. The probability we require is the joint frequency ‘breast cancer’ and ‘female’ divide by the relevant marginal frequency ‘female’
  5. Since we are calculating the probability of being female given that someone suffering from breast cancer, we condition on suffering from breast cancer ’. We thus strike out the column ‘not suffering from breast cancer ’ as being irrelevant. The probability we require is the joint frequency ‘breast cancer’ and ‘female’ divide by the relevant marginal frequency ‘suffering from breast cancer ’
  6. Extract of GenStat program "To simulate regression to the mean" "This version used to try and reproduce the numbers selected (285)in original version of Significance paper" "Set parameters" SCALAR NSIM,mean,betvar,withvar,cut,lower,upper;VALUE=1000,90,50,15,95,60,120 TEXT xlabel,ylabel,title; VALUES='DBP at Baseline (mmHg)','DBP at Outcome (mmHg)','Diastolic blood pressure' "Begin simulation" FOR [NTIMES=1000] GRANDOM [DISTRIBUTION=Normal; NVALUES=NSIM; SEED=0; MEAN=mean; VARIANCE=betvar] True GRANDOM [DISTRIBUTION=Normal; NVALUES=NSIM; SEED=0; MEAN=0; VARIANCE=withvar] E1 CALCULATE X=True+E1 CALCULATE HBase=X>=cut CALCULATE Check=SUM(HBase) IF Check.EQ.285 PRINT Check; DECIMALS=0 EXIT [CONTROL=for] ENDIF ENDFOR VARIATE [NVALUES=2]Xline1,Xline2,Xline3,Yline1,Yline2,Yline3 CALCULATE Xline1=cut CALCULATE Yline1$[1],Yline1$[2]=lower,upper CALCULATE Xline2$[1],Xline2$[2]=lower,upper CALCULATE Yline2=cut CALCULATE Xline3$[1],Xline3$[2]=lower, upper CALCULATE Yline3$[1],Yline3$[2]=lower, upper
  7. See Senn, S. J. (2009). Three things every medical writer should know about statistics. The Write Stuff, 18(3), 159-162
  8. These are prime numbers. The minimum block size is thus the product of them all and that is 105. By the time the last patient has completed period two (say) many of the patients will have completed the whole trial. The way to run such a trial is as an add-on trial. All patients receive the current therapy as standard and they receive either placebo or the new treatment in addition. Trials of HIV infection were often of this sort and (correctly) described as placebo controlled. Senn, S. J. (2001). The Misunderstood Placebo. Applied Clinical Trials, 10(5), 40-46
  9. See Senn, S. J. (1998). Mathematics: governess or handmaiden? Journal of the Royal Statistical Society Series D-The Statistician, 47(2), 251-259