SlideShare a Scribd company logo
1 of 23
Download to read offline
Copyright © 2013, SAS Institute Inc. All rights reserved.
A PRIMER IN STATISTICAL DISCOVERY
ECSITE ANNUAL CONFERENCE 2015
Dr. Ian Cox, ian.cox@jmp.com
Copyright © 2013, SAS Institute Inc. All rights reserved.
ABSTRACT
• This presentation introduces ’Statistical Discovery’, a process that allows you
to work with data to discover new, useful, insights that drive cycles of
learning.
• After a brief overview to introduce the concept, an example involving property
prices in the US will be used to demonstrate the how the process works in
practice.
• Through this example we also exemplify the skills and aptitudes required to
exercise the process successfully.
Copyright © 2013, SAS Institute Inc. All rights reserved.
AGENDA
• Why?
• What?
• How?
• Who?
Copyright © 2013, SAS Institute Inc. All rights reserved.
AGENDA
• Why?
• What?
• How?
• Who?
Copyright © 2013, SAS Institute Inc. All rights reserved.
ECSITE VALUES
• We believe that science is part of culture and that curious minds make for
better societies.
• We are convinced that science needs fair, open-minded, democratic societies
and active citizenship to thrive.
• We know that citizens and scientists working together with other
stakeholders can make research more relevant.
• We find science is fascinating and we are dedicated to making it a source of
inspiration for everyone.
Copyright © 2013, SAS Institute Inc. All rights reserved.
PREMISE OF THIS
PRESENTATION
So, in the areas of:
• Science…
• Evidence-based public policy…
• The more utilitarian world of providing products and services…
it’s good to have an appreciation of how new knowledge is acquired.
Better yet is to be able to communicate how this process works, and perhaps
even to encourage it’s more widespread use.
Especially as ‘big data’ takes hold . . .
Copyright © 2013, SAS Institute Inc. All rights reserved.
SCIENTIFIC
METHOD TIMELINE
Copyright © 2013, SAS Institute Inc. All rights reserved.
A COLLOQUIAL
VIEW
http://idea.ucr.edu/documents/flash/scientific_method/story.htm
Copyright © 2013, SAS Institute Inc. All rights reserved.
‘MAKE
OBSERVATIONS’
PRODUCES DATA,
AND …
• All data is contextual.
• Any analysis that does not understand the data context is pointless, or worse.
• Making new discoveries is not ‘algorithmic’, and never could be.
• Only people understand ‘context’, ‘relevance’ and ‘utility’.
Copyright © 2013, SAS Institute Inc. All rights reserved.
AGENDA
• Why?
• What?
• How?
• Who?
Copyright © 2013, SAS Institute Inc. All rights reserved.
A ‘QUALITY’
PERSPECTIVE
All value-adding activity occurs in a system of interconnected processes,
each of which has suppliers, owner(s) and customers.
Variation exists in all processes.
Causes of process variation can be segregated into ‘common’ and ‘special’
causes.
Understanding the unique nature of common and special causes is the key to
reducing process variation,
and
Reducing process variation is the key to improving product and service
quality, productivity and profitability.
Copyright © 2013, SAS Institute Inc. All rights reserved.
UNIVERSAL MODEL
OF A PROCESS
System of InterestCauses We
Understand
Causes We Don’t Understand,
Know About, or Care About
Measured Effects
or Outcomes of
Interest
X1
X2
X3
X4 X5 X6
Y1
Y2
Y1 = Signal Function1(X1, X2, X3) + Nuisance Function1(X4, X5, X6)
Y2 = Signal Function2(X1, X2, X3) + Nuisance Function2(X4, X5, X6)
The ‘Nuisance Functions’ are what give rise to the Variation
in the process outputs.
‘Statistics’ can be seen as the study of variation.
Copyright © 2013, SAS Institute Inc. All rights reserved.
‘STATISTICS’ IS . . . LEARNING IN THE PRESENCE OF VARIATION
(Adapted from Box,
Hunter and Hunter).
What we think
is happening
What is really
happening
Situation
Appraisal
Measurement and
Data Collection
Situation
Appraisal
Situation
Appraisal
Measurement and
Data Collection
Statistics
Able to Consistently Meet Customer Requirements
Real
World
Model
Unable to Consistently Meet Customer Requirements
There is no ‘free lunch’ – The cost of gaining new knowledge must be
offset by its perceived or expected value
Copyright © 2013, SAS Institute Inc. All rights reserved.
STATISTICAL DISCOVERY WITH JMP
Copyright © 2013, SAS Institute Inc. All rights reserved.
CDA EDA
Copyright © 2013, SAS Institute Inc. All rights reserved.
STATISTICAL
DISCOVERY
Frame Problem
or Opportunity Collect Data
Uncover
Relationships
Model
Relationships
Utilize
Knowledge
Revise
Knowledge
“Statistics as
Detective” or EDA
“Statistics as
Lawyer” or CDA
Copyright © 2013, SAS Institute Inc. All rights reserved.
STATISTICAL
DISCOVERY
1. Data visualization, done properly, is very powerful and effective.
2. Statistical analysis, done properly, and defined broadly to include things
like experimental design and predictive modeling, is also very powerful
and effective, but in a different way.
3. Tightly integrating the two creates a synergy that is much more
powerful and effective than either one alone.
Copyright © 2013, SAS Institute Inc. All rights reserved.
AGENDA
• Why?
• What?
• How?
• Who?
Copyright © 2013, SAS Institute Inc. All rights reserved.
Copyright © 2013, SAS Institute Inc. All rights reserved.
AGENDA
• Why?
• What?
• How?
• Who?
Copyright © 2013, SAS Institute Inc. All rights reserved.
JMP …
• Aims to provide ‘Statistical Discovery’ on
the desktop of Researchers, Engineers
and Scientists.
• Is characterized by interactivity and agility
in usage.
• Is a SAS product which has been in
development since 1989.
• Aspires to support users who:
• Have specific research questions, or are
simply curious.
• Have access to, or can generate, somewhat
relevant data.
• Do not necessarily have specialist statistical
training.
Copyright © 2013, SAS Institute Inc. All rights reserved.
Copyright © 2013, SAS Institute Inc. All rights reserved. www.SAS.com

More Related Content

What's hot

205420 crystal ball case studies
205420 crystal ball case studies205420 crystal ball case studies
205420 crystal ball case studiesp6academy
 
What are the odds of making that number risk analysis with crystal ball - O...
What are the odds of making that number   risk analysis with crystal ball - O...What are the odds of making that number   risk analysis with crystal ball - O...
What are the odds of making that number risk analysis with crystal ball - O...p6academy
 
1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptop1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptopRising Media, Inc.
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualizationDr. Hamdan Al-Sabri
 
Week08 slides
Week08 slidesWeek08 slides
Week08 slideshenry KKK
 
On the Measurement of Test Collection Reliability
On the Measurement of Test Collection ReliabilityOn the Measurement of Test Collection Reliability
On the Measurement of Test Collection ReliabilityJulián Urbano
 
MLPA for health care presentation smc
MLPA for health care presentation   smcMLPA for health care presentation   smc
MLPA for health care presentation smcShaun Comfort
 
Risk Management in Data Analysis
Risk Management in Data AnalysisRisk Management in Data Analysis
Risk Management in Data AnalysisDavid Lee
 
1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptopRising Media, Inc.
 
To Explain Or To Predict?
To Explain Or To Predict?To Explain Or To Predict?
To Explain Or To Predict?Galit Shmueli
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Edureka!
 
Statistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingStatistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingGalit Shmueli
 
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...gregoryg
 

What's hot (20)

205420 crystal ball case studies
205420 crystal ball case studies205420 crystal ball case studies
205420 crystal ball case studies
 
What are the odds of making that number risk analysis with crystal ball - O...
What are the odds of making that number   risk analysis with crystal ball - O...What are the odds of making that number   risk analysis with crystal ball - O...
What are the odds of making that number risk analysis with crystal ball - O...
 
Introduction to regression
Introduction to regressionIntroduction to regression
Introduction to regression
 
1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptop1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptop
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualization
 
Machine Learning For Stock Broking
Machine Learning For Stock BrokingMachine Learning For Stock Broking
Machine Learning For Stock Broking
 
Week08 slides
Week08 slidesWeek08 slides
Week08 slides
 
On the Measurement of Test Collection Reliability
On the Measurement of Test Collection ReliabilityOn the Measurement of Test Collection Reliability
On the Measurement of Test Collection Reliability
 
Text Analytics for Legal work
Text Analytics for Legal workText Analytics for Legal work
Text Analytics for Legal work
 
MLPA for health care presentation smc
MLPA for health care presentation   smcMLPA for health care presentation   smc
MLPA for health care presentation smc
 
Image Analytics In Healthcare
Image Analytics In HealthcareImage Analytics In Healthcare
Image Analytics In Healthcare
 
Risk Management in Data Analysis
Risk Management in Data AnalysisRisk Management in Data Analysis
Risk Management in Data Analysis
 
Data Visualization: Sales forecasting
Data Visualization: Sales forecastingData Visualization: Sales forecasting
Data Visualization: Sales forecasting
 
1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop
 
To Explain Or To Predict?
To Explain Or To Predict?To Explain Or To Predict?
To Explain Or To Predict?
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
AlogoAnalytics Company Presentation
AlogoAnalytics Company PresentationAlogoAnalytics Company Presentation
AlogoAnalytics Company Presentation
 
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
Random Forest Tutorial | Random Forest in R | Machine Learning | Data Science...
 
Statistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingStatistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and Predicting
 
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
Multi-criteria Decision Analysis for Customization of Estimation by Analogy M...
 

Similar to A Primer in Statistical Discovery

What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariSAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariInstitute of Contemporary Sciences
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Denodo
 
Analyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessAnalyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessDATAVERSITY
 
User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...
User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...
User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...SPTechCon
 
Too much data and not enough analytics!
Too much data and not enough analytics!Too much data and not enough analytics!
Too much data and not enough analytics!Emma Kelly
 
Marketing, Consumers and The Rise of Big Data Analytics
Marketing, Consumers and The Rise of Big Data AnalyticsMarketing, Consumers and The Rise of Big Data Analytics
Marketing, Consumers and The Rise of Big Data AnalyticsLouis Fernandes
 
Data and AI in education
Data and AI in educationData and AI in education
Data and AI in educationJisc
 
Enabling fact based governance with analytics external
Enabling fact based governance with analytics externalEnabling fact based governance with analytics external
Enabling fact based governance with analytics externalSarabjeet Singh
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsNoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsInfiniteGraph
 
Digicrome Student Hand Book
Digicrome Student Hand BookDigicrome Student Hand Book
Digicrome Student Hand BookAayushdigichrome
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraisingJames Orton
 
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...Earley Information Science
 
Jumpstart a Lucrative Career in Data Science
Jumpstart a Lucrative Career in Data ScienceJumpstart a Lucrative Career in Data Science
Jumpstart a Lucrative Career in Data ScienceSharala Axryd
 
How to sustain analytics capabilities in an organization
How to sustain analytics capabilities in an organizationHow to sustain analytics capabilities in an organization
How to sustain analytics capabilities in an organizationSAS Canada
 

Similar to A Primer in Statistical Discovery (20)

What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariSAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
 
Analyze Your Data, Transform Your Business
Analyze Your Data, Transform Your BusinessAnalyze Your Data, Transform Your Business
Analyze Your Data, Transform Your Business
 
EIS-Webinar-data.world-collab-2023-02-15.pptx
EIS-Webinar-data.world-collab-2023-02-15.pptxEIS-Webinar-data.world-collab-2023-02-15.pptx
EIS-Webinar-data.world-collab-2023-02-15.pptx
 
User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...
User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...
User-Centric Design: How to Leverage Use Cases and User Scenarios to Design S...
 
Too much data and not enough analytics!
Too much data and not enough analytics!Too much data and not enough analytics!
Too much data and not enough analytics!
 
Marketing, Consumers and The Rise of Big Data Analytics
Marketing, Consumers and The Rise of Big Data AnalyticsMarketing, Consumers and The Rise of Big Data Analytics
Marketing, Consumers and The Rise of Big Data Analytics
 
Data and AI in education
Data and AI in educationData and AI in education
Data and AI in education
 
Sukhdeep
SukhdeepSukhdeep
Sukhdeep
 
Data driven-business
Data driven-businessData driven-business
Data driven-business
 
Enabling fact based governance with analytics external
Enabling fact based governance with analytics externalEnabling fact based governance with analytics external
Enabling fact based governance with analytics external
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsNoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive Analytics
 
Digicrome Student Hand Book
Digicrome Student Hand BookDigicrome Student Hand Book
Digicrome Student Hand Book
 
SAS Visual Analytics Overview
SAS Visual Analytics OverviewSAS Visual Analytics Overview
SAS Visual Analytics Overview
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraising
 
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
Earley Executive Roundtable on Data Analytics - Session 1 - The Business Pote...
 
Data is not the new snake oil
Data is not the new snake oilData is not the new snake oil
Data is not the new snake oil
 
Jumpstart a Lucrative Career in Data Science
Jumpstart a Lucrative Career in Data ScienceJumpstart a Lucrative Career in Data Science
Jumpstart a Lucrative Career in Data Science
 
How to sustain analytics capabilities in an organization
How to sustain analytics capabilities in an organizationHow to sustain analytics capabilities in an organization
How to sustain analytics capabilities in an organization
 

More from JMP software from SAS

Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...
Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...
Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...JMP software from SAS
 
Exploring Best Practises in Design of Experiments
Exploring Best Practises in Design of ExperimentsExploring Best Practises in Design of Experiments
Exploring Best Practises in Design of ExperimentsJMP software from SAS
 
Evaluating & Monitoring Your Process Using MSA & SPC
Evaluating & Monitoring Your Process Using MSA & SPCEvaluating & Monitoring Your Process Using MSA & SPC
Evaluating & Monitoring Your Process Using MSA & SPCJMP software from SAS
 
Everything You Wanted to Know About Definitive Screening Designs
Everything You Wanted to Know About Definitive Screening DesignsEverything You Wanted to Know About Definitive Screening Designs
Everything You Wanted to Know About Definitive Screening DesignsJMP software from SAS
 
Basic Design of Experiments Using the Custom DOE Platform
Basic Design of Experiments Using the Custom DOE PlatformBasic Design of Experiments Using the Custom DOE Platform
Basic Design of Experiments Using the Custom DOE PlatformJMP software from SAS
 
Advanced Use Cases of the Bootstrap Method in JMP Pro
Advanced Use Cases of the Bootstrap Method in JMP ProAdvanced Use Cases of the Bootstrap Method in JMP Pro
Advanced Use Cases of the Bootstrap Method in JMP ProJMP software from SAS
 
Correcting Misconceptions About Optimal Design
Correcting Misconceptions About Optimal DesignCorrecting Misconceptions About Optimal Design
Correcting Misconceptions About Optimal DesignJMP software from SAS
 
Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...
Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...
Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...JMP software from SAS
 
Building Models for Complex Design of Experiments
Building Models for Complex Design of ExperimentsBuilding Models for Complex Design of Experiments
Building Models for Complex Design of ExperimentsJMP software from SAS
 
New Design of Experiments Features in JMP 11
New Design of Experiments Features in JMP 11New Design of Experiments Features in JMP 11
New Design of Experiments Features in JMP 11JMP software from SAS
 
When a Linear Model Just Won't Do: Fitting Nonlinear Models in JMP
When a Linear Model Just Won't Do: Fitting Nonlinear Models in JMPWhen a Linear Model Just Won't Do: Fitting Nonlinear Models in JMP
When a Linear Model Just Won't Do: Fitting Nonlinear Models in JMPJMP software from SAS
 
The Bootstrap and Beyond: Using JSL for Resampling
The Bootstrap and Beyond: Using JSL for ResamplingThe Bootstrap and Beyond: Using JSL for Resampling
The Bootstrap and Beyond: Using JSL for ResamplingJMP software from SAS
 
Exploring Variable Clustering and Importance in JMP
Exploring Variable Clustering and Importance in JMPExploring Variable Clustering and Importance in JMP
Exploring Variable Clustering and Importance in JMPJMP software from SAS
 

More from JMP software from SAS (15)

Grafische Analyse Ihrer Excel Daten
Grafische Analyse  Ihrer Excel DatenGrafische Analyse  Ihrer Excel Daten
Grafische Analyse Ihrer Excel Daten
 
JMP for Ethanol Producers
JMP for Ethanol ProducersJMP for Ethanol Producers
JMP for Ethanol Producers
 
Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...
Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...
Exploring Best Practises in Design of Experiments: A Data Driven Approach to ...
 
Exploring Best Practises in Design of Experiments
Exploring Best Practises in Design of ExperimentsExploring Best Practises in Design of Experiments
Exploring Best Practises in Design of Experiments
 
Evaluating & Monitoring Your Process Using MSA & SPC
Evaluating & Monitoring Your Process Using MSA & SPCEvaluating & Monitoring Your Process Using MSA & SPC
Evaluating & Monitoring Your Process Using MSA & SPC
 
Everything You Wanted to Know About Definitive Screening Designs
Everything You Wanted to Know About Definitive Screening DesignsEverything You Wanted to Know About Definitive Screening Designs
Everything You Wanted to Know About Definitive Screening Designs
 
Basic Design of Experiments Using the Custom DOE Platform
Basic Design of Experiments Using the Custom DOE PlatformBasic Design of Experiments Using the Custom DOE Platform
Basic Design of Experiments Using the Custom DOE Platform
 
Advanced Use Cases of the Bootstrap Method in JMP Pro
Advanced Use Cases of the Bootstrap Method in JMP ProAdvanced Use Cases of the Bootstrap Method in JMP Pro
Advanced Use Cases of the Bootstrap Method in JMP Pro
 
Correcting Misconceptions About Optimal Design
Correcting Misconceptions About Optimal DesignCorrecting Misconceptions About Optimal Design
Correcting Misconceptions About Optimal Design
 
Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...
Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...
Visual Analytic Approaches for the Analysis of Spontaneously Reported Adverse...
 
Building Models for Complex Design of Experiments
Building Models for Complex Design of ExperimentsBuilding Models for Complex Design of Experiments
Building Models for Complex Design of Experiments
 
New Design of Experiments Features in JMP 11
New Design of Experiments Features in JMP 11New Design of Experiments Features in JMP 11
New Design of Experiments Features in JMP 11
 
When a Linear Model Just Won't Do: Fitting Nonlinear Models in JMP
When a Linear Model Just Won't Do: Fitting Nonlinear Models in JMPWhen a Linear Model Just Won't Do: Fitting Nonlinear Models in JMP
When a Linear Model Just Won't Do: Fitting Nonlinear Models in JMP
 
The Bootstrap and Beyond: Using JSL for Resampling
The Bootstrap and Beyond: Using JSL for ResamplingThe Bootstrap and Beyond: Using JSL for Resampling
The Bootstrap and Beyond: Using JSL for Resampling
 
Exploring Variable Clustering and Importance in JMP
Exploring Variable Clustering and Importance in JMPExploring Variable Clustering and Importance in JMP
Exploring Variable Clustering and Importance in JMP
 

Recently uploaded

SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptrcbcrtm
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 

Recently uploaded (20)

SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
cpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.pptcpct NetworkING BASICS AND NETWORK TOOL.ppt
cpct NetworkING BASICS AND NETWORK TOOL.ppt
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 

A Primer in Statistical Discovery

  • 1. Copyright © 2013, SAS Institute Inc. All rights reserved. A PRIMER IN STATISTICAL DISCOVERY ECSITE ANNUAL CONFERENCE 2015 Dr. Ian Cox, ian.cox@jmp.com
  • 2. Copyright © 2013, SAS Institute Inc. All rights reserved. ABSTRACT • This presentation introduces ’Statistical Discovery’, a process that allows you to work with data to discover new, useful, insights that drive cycles of learning. • After a brief overview to introduce the concept, an example involving property prices in the US will be used to demonstrate the how the process works in practice. • Through this example we also exemplify the skills and aptitudes required to exercise the process successfully.
  • 3. Copyright © 2013, SAS Institute Inc. All rights reserved. AGENDA • Why? • What? • How? • Who?
  • 4. Copyright © 2013, SAS Institute Inc. All rights reserved. AGENDA • Why? • What? • How? • Who?
  • 5. Copyright © 2013, SAS Institute Inc. All rights reserved. ECSITE VALUES • We believe that science is part of culture and that curious minds make for better societies. • We are convinced that science needs fair, open-minded, democratic societies and active citizenship to thrive. • We know that citizens and scientists working together with other stakeholders can make research more relevant. • We find science is fascinating and we are dedicated to making it a source of inspiration for everyone.
  • 6. Copyright © 2013, SAS Institute Inc. All rights reserved. PREMISE OF THIS PRESENTATION So, in the areas of: • Science… • Evidence-based public policy… • The more utilitarian world of providing products and services… it’s good to have an appreciation of how new knowledge is acquired. Better yet is to be able to communicate how this process works, and perhaps even to encourage it’s more widespread use. Especially as ‘big data’ takes hold . . .
  • 7. Copyright © 2013, SAS Institute Inc. All rights reserved. SCIENTIFIC METHOD TIMELINE
  • 8. Copyright © 2013, SAS Institute Inc. All rights reserved. A COLLOQUIAL VIEW http://idea.ucr.edu/documents/flash/scientific_method/story.htm
  • 9. Copyright © 2013, SAS Institute Inc. All rights reserved. ‘MAKE OBSERVATIONS’ PRODUCES DATA, AND … • All data is contextual. • Any analysis that does not understand the data context is pointless, or worse. • Making new discoveries is not ‘algorithmic’, and never could be. • Only people understand ‘context’, ‘relevance’ and ‘utility’.
  • 10. Copyright © 2013, SAS Institute Inc. All rights reserved. AGENDA • Why? • What? • How? • Who?
  • 11. Copyright © 2013, SAS Institute Inc. All rights reserved. A ‘QUALITY’ PERSPECTIVE All value-adding activity occurs in a system of interconnected processes, each of which has suppliers, owner(s) and customers. Variation exists in all processes. Causes of process variation can be segregated into ‘common’ and ‘special’ causes. Understanding the unique nature of common and special causes is the key to reducing process variation, and Reducing process variation is the key to improving product and service quality, productivity and profitability.
  • 12. Copyright © 2013, SAS Institute Inc. All rights reserved. UNIVERSAL MODEL OF A PROCESS System of InterestCauses We Understand Causes We Don’t Understand, Know About, or Care About Measured Effects or Outcomes of Interest X1 X2 X3 X4 X5 X6 Y1 Y2 Y1 = Signal Function1(X1, X2, X3) + Nuisance Function1(X4, X5, X6) Y2 = Signal Function2(X1, X2, X3) + Nuisance Function2(X4, X5, X6) The ‘Nuisance Functions’ are what give rise to the Variation in the process outputs. ‘Statistics’ can be seen as the study of variation.
  • 13. Copyright © 2013, SAS Institute Inc. All rights reserved. ‘STATISTICS’ IS . . . LEARNING IN THE PRESENCE OF VARIATION (Adapted from Box, Hunter and Hunter). What we think is happening What is really happening Situation Appraisal Measurement and Data Collection Situation Appraisal Situation Appraisal Measurement and Data Collection Statistics Able to Consistently Meet Customer Requirements Real World Model Unable to Consistently Meet Customer Requirements There is no ‘free lunch’ – The cost of gaining new knowledge must be offset by its perceived or expected value
  • 14. Copyright © 2013, SAS Institute Inc. All rights reserved. STATISTICAL DISCOVERY WITH JMP
  • 15. Copyright © 2013, SAS Institute Inc. All rights reserved. CDA EDA
  • 16. Copyright © 2013, SAS Institute Inc. All rights reserved. STATISTICAL DISCOVERY Frame Problem or Opportunity Collect Data Uncover Relationships Model Relationships Utilize Knowledge Revise Knowledge “Statistics as Detective” or EDA “Statistics as Lawyer” or CDA
  • 17. Copyright © 2013, SAS Institute Inc. All rights reserved. STATISTICAL DISCOVERY 1. Data visualization, done properly, is very powerful and effective. 2. Statistical analysis, done properly, and defined broadly to include things like experimental design and predictive modeling, is also very powerful and effective, but in a different way. 3. Tightly integrating the two creates a synergy that is much more powerful and effective than either one alone.
  • 18. Copyright © 2013, SAS Institute Inc. All rights reserved. AGENDA • Why? • What? • How? • Who?
  • 19. Copyright © 2013, SAS Institute Inc. All rights reserved.
  • 20. Copyright © 2013, SAS Institute Inc. All rights reserved. AGENDA • Why? • What? • How? • Who?
  • 21. Copyright © 2013, SAS Institute Inc. All rights reserved. JMP … • Aims to provide ‘Statistical Discovery’ on the desktop of Researchers, Engineers and Scientists. • Is characterized by interactivity and agility in usage. • Is a SAS product which has been in development since 1989. • Aspires to support users who: • Have specific research questions, or are simply curious. • Have access to, or can generate, somewhat relevant data. • Do not necessarily have specialist statistical training.
  • 22. Copyright © 2013, SAS Institute Inc. All rights reserved.
  • 23. Copyright © 2013, SAS Institute Inc. All rights reserved. www.SAS.com