Data Science Popup Austin: Surfing Silver Dynamic Bayesian Forecasting for Fun and Profit

•

1 like•973 views

Watch the talk ➟ http://bit.ly/1NJGRcb 2008 was a historic year in many ways, perhaps the most prominent being the election of the first African American president. But 2008 also saw an unlikely hero emerge amongst the record setting presidential race... Nate Silver and his astonishingly accurate prediction of its results. More important than Nate's remarkable result however was the attention it drew to the potential of data and the importance of uncertainty (through bayesian statistics). And it was in that moment that our modern incarnation of data journalism was born (though ironically the field dates back to an attempt to predict the 1952 presidential election) with Nate's (now famous) 538 blog. In this talk I will walk through the approach that made Nate so successful in 2008, test its efficacy in predicting the early 2016 primary results, and show how these (relatively) simple concepts can be applied in novel ways to tangential fields to great effect (for fun and profit) by estimating the time to failure for industrial machines in our connected world of the IoT.

Data & Analytics

DATA
SCIENCE
POP UP
AUSTIN
#datapopupaustin
April 13, 2016
Galvanize, Austin Campus

SURFING SILVERDYNAMIC BAYESIAN FORECASTING FOR FUN AND PROFIT
Jonathan Dinu // April 13th, 2016 // @clearspandex

whoami
Jonathan Dinu // April 13th, 2016 // @clearspandex

Jonathan Dinu // April 13th, 2016 // @clearspandex

THE 2008 ELECTION
let me tell you a little story...
Jonathan Dinu // April 13th, 2016 // @clearspandex

SPOILER ALERT...IT'S BEEN DONE BEFORE
Jonathan Dinu // April 13th, 2016 // @clearspandex

> Nate Silver
> Drew Linzer
> Josh Putnam
> Simon Jackman
Jonathan Dinu // April 13th, 2016 // @clearspandex

ANDREW GELMAN
Jonathan Dinu // April 13th, 2016 // @clearspandex

ANDREW GELMAN (1995...)
Jonathan Dinu // April 13th, 2016 // @clearspandex

THE THEORY BEHIND THE MAGIC
Courtesy of 538 and Drew Linzer (Votamatic)
Jonathan Dinu // April 13th, 2016 // @clearspandex

CHALLENGES
> Historical Predictions susceptible to Uncertainty
> Sparse pre-election Poll Data
> Sampling Error and House Effects Bias Polls
Jonathan Dinu // April 13th, 2016 // @clearspandex

WHAT DREW (AND NATE) DID DIFFERENTLY
> State level vs. National Polls
> Online Updates as more data become available
> Not All Polls are Created Equal (weights/averages)
> (Probabilistic) Forecasting in addition to Estimation
Jonathan Dinu // April 13th, 2016 // @clearspandex

DYNAMIC BAYESIAN
FORECASTING2
National:
State:
Forecasts:
Not shown here: informative priors
based on historical predictions
Jonathan Dinu // April 13th, 2016 // @clearspandex

SO WHY AM I TELLING YOU
THIS THEN?
Jonathan Dinu // April 13th, 2016 // @clearspandex

STRUCTURED PREDICTIONSUPERVISED LEARNING ON SEQUENCES
Jonathan Dinu // April 13th, 2016 // @clearspandex

TRADITIONALLY
Jonathan Dinu // April 13th, 2016 // @clearspandex

STATES + TIME + TRANSITIONS
Jonathan Dinu // April 13th, 2016 // @clearspandex

GRAPHICAL MODELS
> Assess Risk (uncertainty) as
Probability of Failure
> Unobservable (hidden) Failure States
> Proactive/Early Prediction
> Interpretable Latent Properties
> Online Algorithm (iteratively improve)
Jonathan Dinu // April 13th, 2016 // @clearspandex

KEY IDEAS
> Uncertainty
> Point vs. Distribution (or confidence intervals)
> Bayesian vs. Frequentists methods
> Temporal variability
All models are wrong, but some models are useful... or
something
Jonathan Dinu // April 13th, 2016 // @clearspandex

KEY IDEAS (APPLIED)
Jonathan Dinu // April 13th, 2016 // @clearspandex

IOT IMPACT: DETECTING MACHINE FAILURES
> Historical Structural Predictions susceptible to Uncertainty
(Supervised Learning)
> Sparse pre-election Poll Data (costly to measure)
> Sampling Error Biases Polls Inspections
(prediction in the absence of data)
> Online Updates as more data become available
> Not All Polls sensors are Created Equal (weights/averages)
> (Probabilistic) Forecasting in addition to Estimation
Jonathan Dinu // April 13th, 2016 // @clearspandex

REMEMBER THIS...
National:
State:
Forecasts:
Jonathan Dinu // April 13th, 2016 // @clearspandex

INDUSTRIAL MACHINES3
HTTP://WWW.CITEMASTER.NET/GET/8BD1ACC0-F04B-11E3-BBAF-00163E009CC7/SALFNER05PREDICTING.PDF
Jonathan Dinu // April 13th, 2016 // @clearspandex

MORE INTERPRETABLEWE HAVE TO ACTUALLY FIX THE MACHINES AFTER ALL...
Jonathan Dinu // April 13th, 2016 // @clearspandex

LATENT FACTORS
Jonathan Dinu // April 13th, 2016 // @clearspandex

CAUSALITY!
Jonathan Dinu // April 13th, 2016 // @clearspandex

REFERENCES
> The Signal and the Noise
> Data Journalism Handbook
> Dynamic Bayesian Forecasting of Presidential Elections in the States (Drew A.
Linzer)
> Time for Change model (Alan Abramowitz)
> Baysian Data Analysis Gelman
> Causality Judea Pearl
> 538: How we are Forecasting the 2016 Primaries
> Predicting Time-to-Failure of Industrial Machines with Temporal Data Mining
Jonathan Dinu // April 13th, 2016 // @clearspandex

DATA
SCIENCE
POP UP
AUSTIN
@datapopup
#datapopupaustin

Viewers also liked

Grammar for beginner levelainunatin mahfudhoh

Maken in de Bibliotheek, presentatie bij Platform voor medezeggenschap in de ...Fers

Understanding ObjectsR. Sosa

Scaling Community Information SystemsRalf Klamma

La Investigación como Proceso y el Conocimiento Científico - Bloque IIGaby Bastida

caracteristicas arquitectonicasariannegarciarr

Education in Emergency in Ghana: A Review of the Evidence of Protracted Displ...Jenkins Macedo

Top of most beautiful birdsMakala D.

Viewers also liked (8)

Grammar for beginner level

Maken in de Bibliotheek, presentatie bij Platform voor medezeggenschap in de ...

Understanding Objects

Scaling Community Information Systems

La Investigación como Proceso y el Conocimiento Científico - Bloque II

caracteristicas arquitectonicas

Education in Emergency in Ghana: A Review of the Evidence of Protracted Displ...

Top of most beautiful birds

Recently uploaded

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research

Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ

Vision, Mission, Goals and Objectives ppt..pptxellehsormae

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

detection and classification of knee osteoarthritis.pptxAleenaJamil4

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss

Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort

Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

ASML's Taxonomy Adventure by Daniel Cantervoginip

毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss

DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett

RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993

RadioAdProWritingCinderellabyButleri.pdfgstagge

Easter Eggs From Star Wars and in cars 1 and 217djon017

Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson

LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter

Recently uploaded (20)

20240419 - Measurecamp Amsterdam - SAM.pdf

科罗拉多大学波尔得分校毕业证学位证成绩单-可办理

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...

Advanced Machine Learning for Business Professionals

Vision, Mission, Goals and Objectives ppt..pptx

Data Factory in Microsoft Fabric (MsBIP #82)

detection and classification of knee osteoarthritis.pptx

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

ASML's Taxonomy Adventure by Daniel Canter

毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree

DBA Basics: Getting Started with Performance Tuning.pdf

RABBIT: A CLI tool for identifying bots based on their GitHub events.

RadioAdProWritingCinderellabyButleri.pdf

Easter Eggs From Star Wars and in cars 1 and 2

Defining Constituents, Data Vizzes and Telling a Data Story

LLMs, LMMs, their Improvement Suggestions and the Path towards AGI

Data Science Popup Austin: Surfing Silver Dynamic Bayesian Forecasting for Fun and Profit

1. DATA SCIENCE POP UP AUSTIN Surﬁng Silver: Dynamic Bayesian Forecasting for Fun and Proﬁt Jonathan Dinu Author and Teacher clearspandex

2. DATA SCIENCE POP UP AUSTIN #datapopupaustin April 13, 2016 Galvanize, Austin Campus

4. SURFING SILVERDYNAMIC BAYESIAN FORECASTING FOR FUN AND PROFIT Jonathan Dinu // April 13th, 2016 // @clearspandex

5. whoami Jonathan Dinu // April 13th, 2016 // @clearspandex

6. whoami Jonathan Dinu // April 13th, 2016 // @clearspandex

7. Jonathan Dinu // April 13th, 2016 // @clearspandex

8. THE 2008 ELECTION let me tell you a little story... Jonathan Dinu // April 13th, 2016 // @clearspandex

9. SPOILER ALERT...IT'S BEEN DONE BEFORE Jonathan Dinu // April 13th, 2016 // @clearspandex

10. > Nate Silver > Drew Linzer > Josh Putnam > Simon Jackman Jonathan Dinu // April 13th, 2016 // @clearspandex

11. ANDREW GELMAN Jonathan Dinu // April 13th, 2016 // @clearspandex

12. ANDREW GELMAN (1995...) Jonathan Dinu // April 13th, 2016 // @clearspandex

13. THE THEORY BEHIND THE MAGIC Courtesy of 538 and Drew Linzer (Votamatic) Jonathan Dinu // April 13th, 2016 // @clearspandex

14. CHALLENGES > Historical Predictions susceptible to Uncertainty > Sparse pre-election Poll Data > Sampling Error and House Effects Bias Polls Jonathan Dinu // April 13th, 2016 // @clearspandex

15. WHAT DREW (AND NATE) DID DIFFERENTLY > State level vs. National Polls > Online Updates as more data become available > Not All Polls are Created Equal (weights/averages) > (Probabilistic) Forecasting in addition to Estimation Jonathan Dinu // April 13th, 2016 // @clearspandex

16. DYNAMIC BAYESIAN FORECASTING2 National: State: Forecasts: Not shown here: informative priors based on historical predictions Jonathan Dinu // April 13th, 2016 // @clearspandex

17. SO WHY AM I TELLING YOU THIS THEN? Jonathan Dinu // April 13th, 2016 // @clearspandex

18. STRUCTURED PREDICTIONSUPERVISED LEARNING ON SEQUENCES Jonathan Dinu // April 13th, 2016 // @clearspandex

19. TRADITIONALLY Jonathan Dinu // April 13th, 2016 // @clearspandex

20. TRADITIONALLY Jonathan Dinu // April 13th, 2016 // @clearspandex

21. STATES + TIME + TRANSITIONS Jonathan Dinu // April 13th, 2016 // @clearspandex

22. GRAPHICAL MODELS > Assess Risk (uncertainty) as Probability of Failure > Unobservable (hidden) Failure States > Proactive/Early Prediction > Interpretable Latent Properties > Online Algorithm (iteratively improve) Jonathan Dinu // April 13th, 2016 // @clearspandex

23. KEY IDEAS > Uncertainty > Point vs. Distribution (or confidence intervals) > Bayesian vs. Frequentists methods > Temporal variability All models are wrong, but some models are useful... or something Jonathan Dinu // April 13th, 2016 // @clearspandex

24. KEY IDEAS (APPLIED) Jonathan Dinu // April 13th, 2016 // @clearspandex

25. IOT IMPACT: DETECTING MACHINE FAILURES > Historical Structural Predictions susceptible to Uncertainty (Supervised Learning) > Sparse pre-election Poll Data (costly to measure) > Sampling Error Biases Polls Inspections (prediction in the absence of data) > Online Updates as more data become available > Not All Polls sensors are Created Equal (weights/averages) > (Probabilistic) Forecasting in addition to Estimation Jonathan Dinu // April 13th, 2016 // @clearspandex

26. REMEMBER THIS... National: State: Forecasts: Jonathan Dinu // April 13th, 2016 // @clearspandex

27. REMEMBER THIS... National: State: Forecasts: Jonathan Dinu // April 13th, 2016 // @clearspandex

28. REMEMBER THIS... National: State: Forecasts: Jonathan Dinu // April 13th, 2016 // @clearspandex

29. INDUSTRIAL MACHINES3 HTTP://WWW.CITEMASTER.NET/GET/8BD1ACC0-F04B-11E3-BBAF-00163E009CC7/SALFNER05PREDICTING.PDF Jonathan Dinu // April 13th, 2016 // @clearspandex

30. MORE INTERPRETABLEWE HAVE TO ACTUALLY FIX THE MACHINES AFTER ALL... Jonathan Dinu // April 13th, 2016 // @clearspandex

31. LATENT FACTORS Jonathan Dinu // April 13th, 2016 // @clearspandex

32. CAUSALITY! Jonathan Dinu // April 13th, 2016 // @clearspandex

33. REFERENCES > The Signal and the Noise > Data Journalism Handbook > Dynamic Bayesian Forecasting of Presidential Elections in the States (Drew A. Linzer) > Time for Change model (Alan Abramowitz) > Baysian Data Analysis Gelman > Causality Judea Pearl > 538: How we are Forecasting the 2016 Primaries > Predicting Time-to-Failure of Industrial Machines with Temporal Data Mining Jonathan Dinu // April 13th, 2016 // @clearspandex

34. DATA SCIENCE POP UP AUSTIN @datapopup #datapopupaustin

Data Science Popup Austin: Surfing Silver Dynamic Bayesian Forecasting for Fun and Profit

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (8)

More from Domino Data Lab

More from Domino Data Lab (20)

Recently uploaded

Recently uploaded (20)

Data Science Popup Austin: Surfing Silver Dynamic Bayesian Forecasting for Fun and Profit