Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017

•

1 like•458 views

Codifying Data Science Intuition: Using Decision Theory to Automate Time Series Model Selection: While models generated from cross-sectional data can utilize cross-validation for model selection, most time series models cannot be cross-validated due to the temporal structure of the data used to create them. It is possible to employ a rolling cross-validation technique, however this process is computationally expensive and provides no indication of the long-term forecast accuracies of the models. The purpose of this talk is to elaborate how decision theory can be used to automate time series model selection in order to streamline the manual process of validation and testing. By creating consecutive, temporally independent holdout sets, performance metrics for each model’s prediction on each holdout set are fed into a decision function to select an unbiased model. The decision function helps minimize the poorest performance of each model across all holdout sets in order to counteract the possibility of choosing a model that overfits or underfits the holdout sets. Not only does this process improve forecast accuracy, but it also reduces computation time by only requiring the creation of a fixed number of proposed forecasting models.

Technology

1CONFIDENTIAL INFORMATION OF NEXOSIS.
Automating Time Series Model
Selection with Decision Theory
Ryan West
MLconf Atlanta 2017

2CONFIDENTIAL INFORMATION OF NEXOSIS.
K-Folds
Cross-Validation
Rolling Time Series
Cross-Validation
https://en.wikipedia.org/wiki/Cross-validation_(statistics)
https://robjhyndman.com/hyndsight/tscv/

3CONFIDENTIAL INFORMATION OF NEXOSIS.
Problem Visualized Total Test
Set
Model 1
Forecast
Model 2
Forecast
Model M
Forecast
Test Set
Subset 1
Test Set
Subset
N
Test Set
Subset 1
Test Set
Subset
N
Test Set
Subset 1
Test Set
Subset
N
.......
....... ..............
Minimize:
Maximize:

4CONFIDENTIAL INFORMATION OF NEXOSIS.
Formulation
o minimax(x1, x2, …, xM)
o xi = a variable of N possible values
o error metric calculated with N test sets and forecast of model i
min s
subject to:
s ≥ x1
s ≥ x2
…
s ≥ xM
Equivalent to:

5CONFIDENTIAL INFORMATION OF NEXOSIS.
Alternative Problem Total Test
Set
Test Set
Subset 1
Test Set
Subset 2
Test Set
Subset N
Model 1
Forecast
Model M
Forecast
Model 1
Forecast
Model M
Forecast
Model 1
Forecast
Model M
Forecast
.......
.....................
Maximize:
Minimize:

6CONFIDENTIAL INFORMATION OF NEXOSIS.
Alternative Formulation
o maximin(x1, x2, …, xN)
o xi = a variable of M possible values
o error metric calculated with the forecasts of M models and test set i
Equivalent to: max s
subject to:
s ≤ x1
s ≤ x2
…
s ≤ xN

7CONFIDENTIAL INFORMATION OF NEXOSIS.
Experiment
o 856 time series of daily retail sales data
o 7 exogenous variables per time series
o e.g. promotions, holidays, indicator variables of store open or closed
o 38 possible models
o Testing forecast accuracy of different model selection techniques

8CONFIDENTIAL INFORMATION OF NEXOSIS.
Model Selection Techniques
o Selection using ensembling
o Single test set for model selection
o Additional holdout set
o Selection based on maximin of error metric
o Multiple test sets for model selection
o Additional holdout set
o Selection based on minimizing error metric
o Single test set for model selection
o Additional holdout set

9CONFIDENTIAL INFORMATION OF NEXOSIS.
Partial Autocorrelation
o Strongly seasonal time series

10CONFIDENTIAL INFORMATION OF NEXOSIS.
Error Metric Visualization (MAE)

11CONFIDENTIAL INFORMATION OF NEXOSIS.
Error Metric Visualization (RMSE)

12CONFIDENTIAL INFORMATION OF NEXOSIS.
Error Metric Visualization (RMSPE)

13CONFIDENTIAL INFORMATION OF NEXOSIS.
Error Metric Visualization (sMAPE)

14CONFIDENTIAL INFORMATION OF NEXOSIS.
Forecast Accuracy
Average RMSPE*
on Holdout Set
Model Selection Technique Feature
Engineering
0.382 Minimizing RMSPE on test set No
0.223 Naïve median weekly seasonal
predictions
No
0.215 Maximin of RMSPE on test set subsets Yes
0.204 Minimizing RMSPE on test set Yes
0.191 Ensemble Averaging Yes
*RMSPE = Root Mean Squared Percentage Error

15CONFIDENTIAL INFORMATION OF NEXOSIS.
Thank You!

Viewers also liked

Matei zaharia, spark presentation m lconf 2013MLconf

Jonas Schneider, Head of Engineering for Robotics, OpenAIMLconf

Jessica Rudd, PhD Student, Analytics and Data Science, Kennesaw State Univers...MLconf

Daniel Shank, Data Scientist, Talla at MLconf SF 2017MLconf

Ashfaq Munshi, ML7 Fellow, PepperdataMLconf

Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017MLconf

Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017MLconf

LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...MLconf

Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017MLconf

Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017MLconf

Dr. Steve Liu, Chief Scientist, Tinder at MLconf SF 2017MLconf

Doug Eck, Research Scientist, Google Magenta, at MLconf SF 2017MLconf

Viewers also liked (12)

Matei zaharia, spark presentation m lconf 2013

Jonas Schneider, Head of Engineering for Robotics, OpenAI

Jessica Rudd, PhD Student, Analytics and Data Science, Kennesaw State Univers...

Daniel Shank, Data Scientist, Talla at MLconf SF 2017

Ashfaq Munshi, ML7 Fellow, Pepperdata

Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017

Matineh Shaker, Artificial Intelligence Scientist, Bonsai at MLconf SF 2017

LN Renganarayana, Architect, ML Platform and Services and Madhura Dudhgaonkar...

Alexandra Johnson, Software Engineer, SigOpt at MLconf ATL 2017

Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017

Dr. Steve Liu, Chief Scientist, Tinder at MLconf SF 2017

Doug Eck, Research Scientist, Google Magenta, at MLconf SF 2017

Similar to Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017

Machine learning for sanctions screeningEnigma

P1121133727Ashraf Aboshosha

DSUS_MAO_2012_JieMDO_Lab

Boston housing data analysisPreethi Jayaram Jayaraman

Cross-validation aggregation for forecastingDevon Barrow

Network Intrusion Detection System Using Machine Learning and Deep Learning F...Leaving A Legacy

CMU Trecvid sed11Lu Jiang

DSUS_SDM2012_JieMDO_Lab

Impact of novel ms ms all acquisition and processing techniques on forensic t...SCIEX

Impact of novel ms ms all acquisition and processing techniques on forensic t...Sara Feltesse

German credit data analysisPreethi Jayaram Jayaraman

Introduction of Feature HashingWush Wu

General pipeline of transcriptomics analysisSanty Marques-Ladeira

Risk_Management_Final_ReportRohan Sanas

Jogging While Driving, and Other Software Engineering Research Problems (invi...David Rosenblum

Anomaly detection, part 1David Khosid

680report finalRajesh M

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AIData Science Milan

Variable Selection Methodsjoycemi_la

Similar to Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017 (20)

Machine learning for sanctions screening

P1121133727

DSUS_MAO_2012_Jie

Boston housing data analysis

Cross-validation aggregation for forecasting

Network Intrusion Detection System Using Machine Learning and Deep Learning F...

CMU Trecvid sed11

DSUS_SDM2012_Jie

Impact of novel ms ms all acquisition and processing techniques on forensic t...

German credit data analysis

Introduction of Feature Hashing

General pipeline of transcriptomics analysis

Risk_Management_Final_Report

Jogging While Driving, and Other Software Engineering Research Problems (invi...

Anomaly detection, part 1

680report final

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Variable Selection Methods

Recently uploaded

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

DBX First Quarter 2024 Investor PresentationDropbox

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

FWD Group - Insurer Innovation Award 2024The Digital Insurer

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Real Time Object Detection Using Open CVKhem

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

Architecting Cloud Native ApplicationsWSO2

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Recently uploaded (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Strategies for Landing an Oracle DBA Job as a Fresher

Automating Google Workspace (GWS) & more with Apps Script

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

DBX First Quarter 2024 Investor Presentation

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

FWD Group - Insurer Innovation Award 2024

A Beginners Guide to Building a RAG App Using Open Source Milvus

Artificial Intelligence Chap.5 : Uncertainty

How to Troubleshoot Apps for the Modern Connected Worker

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

Axa Assurance Maroc - Insurer Innovation Award 2024

Real Time Object Detection Using Open CV

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MS Copilot expands with MS Graph connectors

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Architecting Cloud Native Applications

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017

1. 1CONFIDENTIAL INFORMATION OF NEXOSIS. Automating Time Series Model Selection with Decision Theory Ryan West MLconf Atlanta 2017

2. 2CONFIDENTIAL INFORMATION OF NEXOSIS. K-Folds Cross-Validation Rolling Time Series Cross-Validation https://en.wikipedia.org/wiki/Cross-validation_(statistics) https://robjhyndman.com/hyndsight/tscv/

3. 3CONFIDENTIAL INFORMATION OF NEXOSIS. Problem Visualized Total Test Set Model 1 Forecast Model 2 Forecast Model M Forecast Test Set Subset 1 Test Set Subset N Test Set Subset 1 Test Set Subset N Test Set Subset 1 Test Set Subset N ....... ....... .............. Minimize: Maximize:

4. 4CONFIDENTIAL INFORMATION OF NEXOSIS. Formulation o minimax(x1, x2, …, xM) o xi = a variable of N possible values o error metric calculated with N test sets and forecast of model i min s subject to: s ≥ x1 s ≥ x2 … s ≥ xM Equivalent to:

5. 5CONFIDENTIAL INFORMATION OF NEXOSIS. Alternative Problem Total Test Set Test Set Subset 1 Test Set Subset 2 Test Set Subset N Model 1 Forecast Model M Forecast Model 1 Forecast Model M Forecast Model 1 Forecast Model M Forecast ....... ..................... Maximize: Minimize:

6. 6CONFIDENTIAL INFORMATION OF NEXOSIS. Alternative Formulation o maximin(x1, x2, …, xN) o xi = a variable of M possible values o error metric calculated with the forecasts of M models and test set i Equivalent to: max s subject to: s ≤ x1 s ≤ x2 … s ≤ xN

7. 7CONFIDENTIAL INFORMATION OF NEXOSIS. Experiment o 856 time series of daily retail sales data o 7 exogenous variables per time series o e.g. promotions, holidays, indicator variables of store open or closed o 38 possible models o Testing forecast accuracy of different model selection techniques

8. 8CONFIDENTIAL INFORMATION OF NEXOSIS. Model Selection Techniques o Selection using ensembling o Single test set for model selection o Additional holdout set o Selection based on maximin of error metric o Multiple test sets for model selection o Additional holdout set o Selection based on minimizing error metric o Single test set for model selection o Additional holdout set

9. 9CONFIDENTIAL INFORMATION OF NEXOSIS. Partial Autocorrelation o Strongly seasonal time series

10. 10CONFIDENTIAL INFORMATION OF NEXOSIS. Error Metric Visualization (MAE)

11. 11CONFIDENTIAL INFORMATION OF NEXOSIS. Error Metric Visualization (RMSE)

12. 12CONFIDENTIAL INFORMATION OF NEXOSIS. Error Metric Visualization (RMSPE)

13. 13CONFIDENTIAL INFORMATION OF NEXOSIS. Error Metric Visualization (sMAPE)

14. 14CONFIDENTIAL INFORMATION OF NEXOSIS. Forecast Accuracy Average RMSPE* on Holdout Set Model Selection Technique Feature Engineering 0.382 Minimizing RMSPE on test set No 0.223 Naïve median weekly seasonal predictions No 0.215 Maximin of RMSPE on test set subsets Yes 0.204 Minimizing RMSPE on test set Yes 0.191 Ensemble Averaging Yes *RMSPE = Root Mean Squared Percentage Error

15. 15CONFIDENTIAL INFORMATION OF NEXOSIS. Thank You!

Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (12)

Similar to Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017

Similar to Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017 (20)

More from MLconf

More from MLconf (20)

Recently uploaded

Recently uploaded (20)

Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017