SlideShare a Scribd company logo
1 of 24
Download to read offline
Asset Management for
Machine Learning
Georg Hildebrand
DATA AI
$$$
???
The Big Picture:
ML
=
Knowledge Workers Productivity
x 50 ??
Peter Drucker source
Increase of
Performance
Acceleration through
automatisation The area of knowledge workers
… the story of a model that was sent
as email attachment ...
… the story of features that were not
available anymore ...
*D. Sculley et al., “Hidden technical debt in machine learning
systems,” in Advances in Neural Information Processing
Systems, 2015, pp. 2503–2511.
Pick a complexity for your ML product:
source
System of ML silos
Productivity …
Hidden depth of
machine learning
How to reach
50 x productivity
?
… make models and features human
and machine readable!
Store context
- How was the input data
chosen?
- Who trained the model? Is
there a project paper?
- How to run the model?
- Store log output of model
training
- Framework used to build the
model.
- Chain model and feature
preprocessing!
Version and
test
- Snapshot input data /features
- Version the model (eg. into S3
or a backend database)
- Link the container artifact.
- Provide input and output
tests
- Store performance metrics
- …
Put it into practice
-
There is no one stop shop solution :-/
Manage ML Deployment pipeline:
Example: SeldonIO, serving and integrating ML
Purpose: End to end deployment for serving ML and more, very advanced.
https://github.com/SeldonIO/
Trys use the
modularised way!
May require data to
move around.
Manage Models + Data and serve low latency:
Example: Vespa, storing model together with the data
Purpose: eg. Low latency serving of ML
https://github.com/vespa-engine/vespa
Container node
Query
Application
Package
Admin &
Config
Content node
Deploy
- Configuration
- Components
- ML models
Scatter-gather
Core
sharding
models models models
Moves models to
data!
Manage Project, Code, Data and more:
Purpose: Git for data scientists - manage your code and data together
Example DVC (Data Version Control):
https://github.com/iterative/dvc
Input and output are
tracked --> DAG !!!
$ dvc run python code/xml_to_tsv.py
data/Posts.xml data/Posts.tsv python
* source, why luigi and airflow might not be
enough...
Model tracking, snapshotting and reproducibility:
Purpose: Git for data scientists - turn any repository into a trackable task record with reusable
environments and metrics logging.
Example Datmo:
https://github.com/datmo/datmo
OS, but tight to paid
service!!!
PipelineAI:
https://github.com/PipelineAI/pipeline
Reproducible Model Pipelines + serving:
Purpose: Consistent, Immutable, Reproducible Model Runtimes
Example PipelineAI:
https://github.com/PipelineAI/pipeline
Models are stored in
docker containers!!!
Open Source solutions
for model / workflow
management
Open Source
Project
K8s 1st class
support
# Active
Maintainers
# Commits 1st Commit Comp
Seldon.IO Yes + Kubeflow 3 1131 2017-Feb #6
H2o driverless-ai Yes + Kubeflow 14 22967 2014-Mar #12
PipelineAI Yes. No Kubeflow 1 4846 2015-Jul #273
Polyaxon Yes. No Kubeflow 1 2732 2017-May #88
Vespa-engine No, but possilbe 14 18290 2016-Jun #5854
IBM/FfDL Yes. No Kubeflow 4 bulk-init hidden #77
RiseML Yes. No Kubeflow 3 95 2017-Oct #9
databricks/mlflow No. No Kubeflow 5 bulk-init hidden #58
datmo No, possible 4 bulk-init 2018-04 #213
Data / Feature
Management
Open Source
Project
Note # Active
Maintainers
# Commits 1st Commit Comp
Data Version Control
(DVC)
project and
data
management
2 1364 2017-Mar #readme
pachyderm Data Pipeline,
management
6 11652 2014-09 tbd
quiltdata/quilt Data and
package
management
8 1177 2017-01 tbd
Questions?
Georg Hildebrand, zalando SE
https://github.com/georghildebrand
https://www.linkedin.com/in/georghildebrand/
Example Components
ML Journey
Idea
Explore
Combine and
Extract Data
Analyse and
Preprocess
Model and
train
Test and Monitor
Share and
Improve
There is even more hidden depth:
● Models erode boundaries of modularisation:
context and preprocessing dependencies
● Data Dependencies Cost More than Code Dependencies (unstable data
dependencies, data quality monitoring etc.)
● Complex hardware requirements (GPUs on kubernetes??)
● Feedback Loops (model influences its feature data)
● ML-System Anti-Patterns (glue code, pipeline jungles …)
* Own experience
** D. Sculley et al., “Hidden technical debt in machine learning
systems,” in Advances in Neural Information Processing Systems,
2015, pp. 2503–2511.
*** D. Sculley et al., “Machine Learning: The High-Interest Credit Card
of Technical Debt,” p. 9.

More Related Content

What's hot (15)

Models of SDLC (Software Development Life Cycle / Program Development Life Cy...
Models of SDLC (Software Development Life Cycle / Program Development Life Cy...Models of SDLC (Software Development Life Cycle / Program Development Life Cy...
Models of SDLC (Software Development Life Cycle / Program Development Life Cy...
 
SDLC Models
SDLC ModelsSDLC Models
SDLC Models
 
Agile model
Agile modelAgile model
Agile model
 
DSDM
DSDMDSDM
DSDM
 
INTRODUCTION TO INDUSTRIAL REVOLUTION 4.0
INTRODUCTION TO INDUSTRIAL REVOLUTION 4.0 INTRODUCTION TO INDUSTRIAL REVOLUTION 4.0
INTRODUCTION TO INDUSTRIAL REVOLUTION 4.0
 
Serving ML easily with FastAPI
Serving ML easily with FastAPIServing ML easily with FastAPI
Serving ML easily with FastAPI
 
Lambda kappa architecture - the jury are still out
Lambda   kappa architecture - the jury are still outLambda   kappa architecture - the jury are still out
Lambda kappa architecture - the jury are still out
 
Map reduce vs spark
Map reduce vs sparkMap reduce vs spark
Map reduce vs spark
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M users
 
GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheet
 
How to Make a Good Presentation (academical) -Professorillama ep1
How to Make a Good Presentation (academical) -Professorillama ep1How to Make a Good Presentation (academical) -Professorillama ep1
How to Make a Good Presentation (academical) -Professorillama ep1
 
MLflow with Databricks
MLflow with DatabricksMLflow with Databricks
MLflow with Databricks
 
Apache Spark MLlib
Apache Spark MLlib Apache Spark MLlib
Apache Spark MLlib
 
Ohje Desuconin ohjelmanpitäjälle
Ohje Desuconin ohjelmanpitäjälleOhje Desuconin ohjelmanpitäjälle
Ohje Desuconin ohjelmanpitäjälle
 
Introduction to PySpark
Introduction to PySparkIntroduction to PySpark
Introduction to PySpark
 

Similar to 2018 data engineering for ml asset management for features and models

Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Luciano Resende
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...ScyllaDB
 
PyData Meetup - Feature Store for Hopsworks and ML Pipelines
PyData Meetup - Feature Store for Hopsworks and ML PipelinesPyData Meetup - Feature Store for Hopsworks and ML Pipelines
PyData Meetup - Feature Store for Hopsworks and ML PipelinesJim Dowling
 
Machine learning at scale challenges and solutions
Machine learning at scale challenges and solutionsMachine learning at scale challenges and solutions
Machine learning at scale challenges and solutionsStavros Kontopoulos
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in ProductionDataWorks Summit
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Sotrender
 
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...Henry Saputra
 
Scaling AI/ML with Containers and Kubernetes
Scaling AI/ML with Containers and Kubernetes Scaling AI/ML with Containers and Kubernetes
Scaling AI/ML with Containers and Kubernetes Tushar Katarki
 
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...James Anderson
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Databricks
 
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...Abhinav Joshi
 
Tech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning productsTech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning productsGianmario Spacagna
 
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning InfrastructureML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning InfrastructureFei Chen
 
Infrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentInfrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentDatabricks
 
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Akash Tandon
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsAnyscale
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsMárton Kodok
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaData Science Milan
 

Similar to 2018 data engineering for ml asset management for features and models (20)

Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
 
PyData Meetup - Feature Store for Hopsworks and ML Pipelines
PyData Meetup - Feature Store for Hopsworks and ML PipelinesPyData Meetup - Feature Store for Hopsworks and ML Pipelines
PyData Meetup - Feature Store for Hopsworks and ML Pipelines
 
Machine learning at scale challenges and solutions
Machine learning at scale challenges and solutionsMachine learning at scale challenges and solutions
Machine learning at scale challenges and solutions
 
Machine Learning Models in Production
Machine Learning Models in ProductionMachine Learning Models in Production
Machine Learning Models in Production
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
 
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
 
Scaling AI/ML with Containers and Kubernetes
Scaling AI/ML with Containers and Kubernetes Scaling AI/ML with Containers and Kubernetes
Scaling AI/ML with Containers and Kubernetes
 
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
 
03_aiops-1.pptx
03_aiops-1.pptx03_aiops-1.pptx
03_aiops-1.pptx
 
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
 
Tech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning productsTech leaders guide to effective building of machine learning products
Tech leaders guide to effective building of machine learning products
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning InfrastructureML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
ML Platform Q1 Meetup: Airbnb's End-to-End Machine Learning Infrastructure
 
Infrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentInfrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload Deployment
 
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
 
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning ModelsApache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 

Recently uploaded

Cooling Tower SERD pH drop issue (11 April 2024) .pptx
Cooling Tower SERD pH drop issue (11 April 2024) .pptxCooling Tower SERD pH drop issue (11 April 2024) .pptx
Cooling Tower SERD pH drop issue (11 April 2024) .pptxmamansuratman0253
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptMadan Karki
 
List of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdfList of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdfisabel213075
 
DM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in projectDM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in projectssuserb6619e
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Earthing details of Electrical Substation
Earthing details of Electrical SubstationEarthing details of Electrical Substation
Earthing details of Electrical Substationstephanwindworld
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdfHafizMudaserAhmad
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfRajuKanojiya4
 
"Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ..."Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ...Erbil Polytechnic University
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Erbil Polytechnic University
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Coursebim.edu.pl
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communicationpanditadesh123
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 
Crushers to screens in aggregate production
Crushers to screens in aggregate productionCrushers to screens in aggregate production
Crushers to screens in aggregate productionChinnuNinan
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingBootNeck1
 
Python Programming for basic beginners.pptx
Python Programming for basic beginners.pptxPython Programming for basic beginners.pptx
Python Programming for basic beginners.pptxmohitesoham12
 
Autonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.pptAutonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.pptbibisarnayak0
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONjhunlian
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

Cooling Tower SERD pH drop issue (11 April 2024) .pptx
Cooling Tower SERD pH drop issue (11 April 2024) .pptxCooling Tower SERD pH drop issue (11 April 2024) .pptx
Cooling Tower SERD pH drop issue (11 April 2024) .pptx
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.ppt
 
List of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdfList of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdf
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
DM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in projectDM Pillar Training Manual.ppt will be useful in deploying TPM in project
DM Pillar Training Manual.ppt will be useful in deploying TPM in project
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Earthing details of Electrical Substation
Earthing details of Electrical SubstationEarthing details of Electrical Substation
Earthing details of Electrical Substation
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdf
 
"Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ..."Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ...
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Course
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communication
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 
Crushers to screens in aggregate production
Crushers to screens in aggregate productionCrushers to screens in aggregate production
Crushers to screens in aggregate production
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event Scheduling
 
Python Programming for basic beginners.pptx
Python Programming for basic beginners.pptxPython Programming for basic beginners.pptx
Python Programming for basic beginners.pptx
 
Autonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.pptAutonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.ppt
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 

2018 data engineering for ml asset management for features and models

  • 1. Asset Management for Machine Learning Georg Hildebrand
  • 3. The Big Picture: ML = Knowledge Workers Productivity x 50 ?? Peter Drucker source
  • 5. … the story of a model that was sent as email attachment ...
  • 6. … the story of features that were not available anymore ...
  • 7. *D. Sculley et al., “Hidden technical debt in machine learning systems,” in Advances in Neural Information Processing Systems, 2015, pp. 2503–2511. Pick a complexity for your ML product:
  • 8. source System of ML silos Productivity … Hidden depth of machine learning
  • 9. How to reach 50 x productivity ?
  • 10. … make models and features human and machine readable!
  • 11. Store context - How was the input data chosen? - Who trained the model? Is there a project paper? - How to run the model? - Store log output of model training - Framework used to build the model. - Chain model and feature preprocessing!
  • 12. Version and test - Snapshot input data /features - Version the model (eg. into S3 or a backend database) - Link the container artifact. - Provide input and output tests - Store performance metrics - …
  • 13. Put it into practice - There is no one stop shop solution :-/
  • 14. Manage ML Deployment pipeline: Example: SeldonIO, serving and integrating ML Purpose: End to end deployment for serving ML and more, very advanced. https://github.com/SeldonIO/ Trys use the modularised way! May require data to move around.
  • 15. Manage Models + Data and serve low latency: Example: Vespa, storing model together with the data Purpose: eg. Low latency serving of ML https://github.com/vespa-engine/vespa Container node Query Application Package Admin & Config Content node Deploy - Configuration - Components - ML models Scatter-gather Core sharding models models models Moves models to data!
  • 16. Manage Project, Code, Data and more: Purpose: Git for data scientists - manage your code and data together Example DVC (Data Version Control): https://github.com/iterative/dvc Input and output are tracked --> DAG !!! $ dvc run python code/xml_to_tsv.py data/Posts.xml data/Posts.tsv python * source, why luigi and airflow might not be enough...
  • 17. Model tracking, snapshotting and reproducibility: Purpose: Git for data scientists - turn any repository into a trackable task record with reusable environments and metrics logging. Example Datmo: https://github.com/datmo/datmo OS, but tight to paid service!!!
  • 18. PipelineAI: https://github.com/PipelineAI/pipeline Reproducible Model Pipelines + serving: Purpose: Consistent, Immutable, Reproducible Model Runtimes Example PipelineAI: https://github.com/PipelineAI/pipeline Models are stored in docker containers!!!
  • 19. Open Source solutions for model / workflow management Open Source Project K8s 1st class support # Active Maintainers # Commits 1st Commit Comp Seldon.IO Yes + Kubeflow 3 1131 2017-Feb #6 H2o driverless-ai Yes + Kubeflow 14 22967 2014-Mar #12 PipelineAI Yes. No Kubeflow 1 4846 2015-Jul #273 Polyaxon Yes. No Kubeflow 1 2732 2017-May #88 Vespa-engine No, but possilbe 14 18290 2016-Jun #5854 IBM/FfDL Yes. No Kubeflow 4 bulk-init hidden #77 RiseML Yes. No Kubeflow 3 95 2017-Oct #9 databricks/mlflow No. No Kubeflow 5 bulk-init hidden #58 datmo No, possible 4 bulk-init 2018-04 #213
  • 20. Data / Feature Management Open Source Project Note # Active Maintainers # Commits 1st Commit Comp Data Version Control (DVC) project and data management 2 1364 2017-Mar #readme pachyderm Data Pipeline, management 6 11652 2014-09 tbd quiltdata/quilt Data and package management 8 1177 2017-01 tbd
  • 21. Questions? Georg Hildebrand, zalando SE https://github.com/georghildebrand https://www.linkedin.com/in/georghildebrand/
  • 23. ML Journey Idea Explore Combine and Extract Data Analyse and Preprocess Model and train Test and Monitor Share and Improve
  • 24. There is even more hidden depth: ● Models erode boundaries of modularisation: context and preprocessing dependencies ● Data Dependencies Cost More than Code Dependencies (unstable data dependencies, data quality monitoring etc.) ● Complex hardware requirements (GPUs on kubernetes??) ● Feedback Loops (model influences its feature data) ● ML-System Anti-Patterns (glue code, pipeline jungles …) * Own experience ** D. Sculley et al., “Hidden technical debt in machine learning systems,” in Advances in Neural Information Processing Systems, 2015, pp. 2503–2511. *** D. Sculley et al., “Machine Learning: The High-Interest Credit Card of Technical Debt,” p. 9.