SlideShare a Scribd company logo
1 of 56
Unleash Data Science
Carlos Guestrin
CEO
CreateTM
Last year…
Machine Learning
Today: Introducing GraphLab Create
Usability for ML from Inspiration to Production
Data-driven, predictive data apps
are making our world amazing…
Historical data
Sensor & interaction data
Real-time
predictions
& decisions
Recommenders Industrial Apps
Forecasters
Social
Human Sensing
Fraud &
Anomaly
Detection
Sentiment
Analysis &
other Text
Apps
Pers.
Medicine
Action 1: simple predictive app
Aren’t you tired of those spam txt messages???
Why predictive apps are
the future?
Spectrum of data products
Continuously get to value faster!
time-to-line-of-businessvalue
Building a predictive app
Was using 217 business rules
hoping world doesn’t change
Have an inspiring idea to
reinvent their business
Key pains:
Hiring Talent
Shortfall in data-savvy workers
needed to make sense out of
big data by 2018 [McKinsey 2011]
35%
Noisy Space of Tools
Data scientists use a variety of tools, across
different programming languages…
require a lot of context-switching…
affects productivity and impedes reproducibility.
Ben Lorica,
Data Analysis: Just one component of
the Data Science workflow
Crossing the Big Data Chasm
speed of iteration
scaleofdata
Get a Hadoop cluster!?!?
single machine memory
production data
Crossing the Big Data Chasm
speed of iteration
scaleofdata
single machine memory
production data
big data chasm
Crossing the Big Data Chasm
speed of iteration
scaleofdata
single machine memory
production data
CreateTM
GraphLab Create:
Unleashing data science
from inspiration to production
Data scientist: inspiration to production
Analyze big data on one machine
graphs, tables, text, images
in Python
doesn’t have to fit in memory
Distribute in production
with same code
on EC2, Yarn,…
Use my laptop
Variety of data
Not toy data scales
Language I love
Iterate quickly
Prototype MonitorProduction
data pipeline predictive service
GraphLab Canvas: Monitor & visualize
from prototype to production
GraphLab Create
What folks are saying about
GraphLab Create
“The ease of use and scalable performance, which is not limited by the memory of the
machine, are allowing us to innovate and advance at an astonishing pace.”
- Andrew Bruce, Senior Director, Data Science, Zillow
“Graphlab Create provides us with an end-to-end efficient framework … both tabular and
graph data generated by the activity of our users.”
- Baldo Faeita, Social Computing Lead, Adobe Systems
“…during my time as Zynga's lead architect for big data, I found my way to GraphLab. I was
astounded at the dramatic savings, on the order of 500x…”
- Mohan Reddy, Chief Architect, The Hive LLC.
Action 2: end-to-end predictive app
Key use cases
• Recommenders, pricing,…Retail
• Fraud detection,…Financial
• Targeting, sentiment,…Marketing &
Advertising
• Churn prediction,…Telecom
• Communities, friends,…Social network
analysis
• Med records, drug design,…Medical
Detecting fraud in the bitcoin network
Poorly-regulated cryptocurrency market w/open transactions
Task:
Investigate suspicious transactional behavior
Check out at graphlab.com…
Track stolen bitcoins
allinvain theif stole
25,000 BTC ($14M USD)
from a user & laundered tainted money
theft laundering
Productive on one
machine
Python
Tables, graphs, text & images
Scale beyond memory
Integrated visualization
Productive at scale
Deploy, monitor, improve on the Cloud/Cluster
Integrates with Hadoop/EC2
Deploy data pipelines & prediction services
Built for Machine Learning
in Production
Scalable, robust algorithms
End-to-end support
Get to value fast
Powered by
GraphLab Engine
Fastest analytics system
GraphLab Create
Getting started is as easy as…
1. pip install graphlab-create
2. be creative with your data
Hands-on training tomorrow
Online: Learning ML in Practice with GraphLab Notebook
http://graphlab.com/learn
ML tutorials, tips,
tricks
End-to-end GraphLab
Notebooks
Quick how-tos
Getting started is as easy as…
1. pip install graphlab-create
2. be creative with your data
Hands-on training tomorrow
Online: Learning ML in Practice with GraphLab Notebook
http://graphlab.com/learn
ML tutorials, tips,
tricks
End-to-end GraphLab
Notebooks
Quick how-tos
Getting started is as easy as…
1. pip install graphlab-create
2. be creative with your data
Deep Dive: key components of a
GraphLab Create
Scalable data structures:
The SFrame & The Sgraph
More details in Yucheng’s talk at 1:40pm
Scalable data structures
ease those scaling pains
Pain SFrame/SGraph
Running out of memory Graceful degradation
Optimized out-of-core computation
Integrating data Unified tables, graphs, text, images
Missing values
Strong/weak types
Missing value support from get-go
Strong types only when strength helps
SFrame: Scalable tabular data
Never run out of memory
Sharded, compressed, out-of-core, columnar
Arbitrary lambda transformations, joins,… from Python
Group-by
aggregate
10M rows/s
on your
desktop
SFrame: Scalable tabular data
Never run out of memory
Sharded, compressed, out-of-core, columnar
Arbitrary lambda transformations, joins,… from Python
Group-by
aggregate
10M rows/s
on your
desktop
Same
Python
code
SGraph: Scalable graph data
Easily and efficiently express entire pipelines
PageRank at
10M edges/s
on your
desktop
Action 3: data science on a
terabyte on data on my laptop
Scalable, robust machine learning
More details in Alice’s talk at 11am
Most ML toolkits don’t focus on the
real challenges
Tools out there Real needs
Bag of algorithms Task-oriented, e.g., recommender
Brittle Robust to data problems
Lots of parameters to tune Automatic; tuning is a bonus
“State-of-the-art” methods
in “research” mode
state-of-the-art accuracy,
performance (& methods)
GraphLab Create: Robust ML & graph analytics
state-of-the-art scaling and accuracy
focused on solving tasks, automatically
Scalable machine learning
• Recommend Products
• Target & segment
• Detect Fraud
• Analyze Text
• Regress
• Classify
• …
Recommender
3.5B ratings
1 hour
on your
desktop
ML with GraphLab Create
Tables, graphs, text, images
State-of-the-art accuracy & scaling
Classification, regression, clustering, nearest neighbors,…
Want topic models?
Super-fast LDA
Want winning method
for 50% of Kaggle
competitions?
Boosted
decision trees
Want hot, hot, hot?
Deep learning
Action 4: deep learning
Deploying GraphLab Create
Data pipelines
Predictive services
More details in Rajat’s talk at 3:20pm
Sample Prototype
MESSY NOT MODULAR
FILE PATHS NOT PORTABLE
Reusable components
Runs on Hadoop
CDH5 now; Pivotal, Spark coming…
Runs on Cloud
EC2 now; Azure, Google coming…
Data pipelines & predictive services
GraphLab
Data Pipeline
Beyond batch & stream processing
Predictive applications
require real-time service
Deployed directly from
data pipeline
GraphLab
Predictive Service
Monitor from GraphLab Canvas
Architecture and integration
Same code, many environments
Local
HDFS
S3
SQL/noSQL
GraphDB
GraphLab
Canvas
data pipelines predictive services
<Python>
SGraph
Fastest graph analytics
GraphLab Engine
SFrame
Scales out-of-core
Machine Learning
Robust, scalable, auto-tuning, task-oriented
Graphs, tables, text, images
End-to-end
visualization
monitoring
management
Same code, many environments
Benchmarking
Recommender
3.5B ratings
1 hour
on your
desktop
PageRank at
10M edges/s
on your
desktop
LDA at
1.4M tokens/s
on your
desktop
Tables Graphs Text
GraphLab Create is fast, really fast
Most importantly, it’s fast
enough for production now!
On graph data
Finding influencers in the
Live-Journal graph
135
1340
0 200 400 600 800 1000 1200 1400 1600
GraphLab
Mahout
Runtime (in seconds, PageRank for 10
iterations)
GraphLab on 1 machine is 10x faster than Mahout on 16 machines
16 machines
1 machine
Create
On tabular data
Logistic regression benchmark
Orders of magnitude faster
Timeinsecondsondesktop
KDD Cup data: predict student performance on math problems based on interactions with tutoring system
8.4M data points, 20M features, 2.4GB compressed
0
10000
20000
30000
40000
50000
60000
70000
Scikit Learn GraphLab Create
Scaling up
Recommender
using matrix factorization
0
1000
2000
3000
4000
5000
0 2E+09 4E+09
Timeinsecondsondesktop
Number of ratings
3.5B ratings
~ 1 hour
Amazon ratings data: 35M ratings, 6.6M users, 2.5M products
Replicated synthetically WRT users to evaluate scaling
GraphLab.com
GraphLab Create Roadmap
March 2014 July 21st 2014 October 2014
Scalable data structures
Tables, graphs, text
Robust ML algorithms
GraphLab Canvas
Data pipelines
New ML algorithms
More data types
Predictive services
Monitoring in production
100+ companies participated in beta program
Already used in production
Extremely positive feedback
Every feature since March in response to customer requests
Please keep them coming!
Commitment to open-source
• We have been committed to open-source for 6 years
- PowerGraph, GraphChi,…
- Our focus now is on GraphLab Create
• We are inspired by companies like MongoDB & ElasticSearch
- Open-source core
- Provide value-add tools, such as monitoring & management
Our users can be successful by just
using open-source version
GraphLab Create:
Unleashing data science
from inspiration to production
pip install graphlab-create jobs@graphlab.com
@graphlabteam
Inspiration Production

More Related Content

What's hot

Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold XinUnifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold XinDatabricks
 
Dowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptx
Dowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptxDowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptx
Dowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptxLex Avstreikh
 
Distributed computing poli
Distributed computing poliDistributed computing poli
Distributed computing poliivascucristian
 
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...Spark Summit
 
Giraph++: From "Think Like a Vertex" to "Think Like a Graph"
Giraph++: From "Think Like a Vertex" to "Think Like a Graph"Giraph++: From "Think Like a Vertex" to "Think Like a Graph"
Giraph++: From "Think Like a Vertex" to "Think Like a Graph"Yuanyuan Tian
 
Graph Analytics for big data
Graph Analytics for big dataGraph Analytics for big data
Graph Analytics for big dataSigmoid
 
Distributed processing of large graphs in python
Distributed processing of large graphs in pythonDistributed processing of large graphs in python
Distributed processing of large graphs in pythonJose Quesada (hiring)
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataPaco Nathan
 
Analyzing Data With Python
Analyzing Data With PythonAnalyzing Data With Python
Analyzing Data With PythonSarah Guido
 
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuireEmbracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuireDatabricks
 
Flux - Open Machine Learning Stack / Pipeline
Flux - Open Machine Learning Stack / PipelineFlux - Open Machine Learning Stack / Pipeline
Flux - Open Machine Learning Stack / PipelineJan Wiegelmann
 
Practical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on HadoopPractical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on HadoopDataWorks Summit
 
Spark Meetup @ Netflix, 05/19/2015
Spark Meetup @ Netflix, 05/19/2015Spark Meetup @ Netflix, 05/19/2015
Spark Meetup @ Netflix, 05/19/2015Yves Raimond
 
Tuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureTuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureDatabricks
 
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...MLconf
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisYuanyuan Tian
 
Distributed Deep Learning with Hadoop and TensorFlow
Distributed Deep Learning with Hadoop and TensorFlowDistributed Deep Learning with Hadoop and TensorFlow
Distributed Deep Learning with Hadoop and TensorFlowJan Wiegelmann
 
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...Spark Summit
 
Machine learning model to production
Machine learning model to productionMachine learning model to production
Machine learning model to productionGeorg Heiler
 
Distributed Heterogeneous Mixture Learning On Spark
Distributed Heterogeneous Mixture Learning On SparkDistributed Heterogeneous Mixture Learning On Spark
Distributed Heterogeneous Mixture Learning On SparkSpark Summit
 

What's hot (20)

Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold XinUnifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
Unifying State-of-the-Art AI and Big Data in Apache Spark with Reynold Xin
 
Dowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptx
Dowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptxDowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptx
Dowling buso-feature-store-logical-clocks-spark-ai-summit-2020.pptx
 
Distributed computing poli
Distributed computing poliDistributed computing poli
Distributed computing poli
 
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
 
Giraph++: From "Think Like a Vertex" to "Think Like a Graph"
Giraph++: From "Think Like a Vertex" to "Think Like a Graph"Giraph++: From "Think Like a Vertex" to "Think Like a Graph"
Giraph++: From "Think Like a Vertex" to "Think Like a Graph"
 
Graph Analytics for big data
Graph Analytics for big dataGraph Analytics for big data
Graph Analytics for big data
 
Distributed processing of large graphs in python
Distributed processing of large graphs in pythonDistributed processing of large graphs in python
Distributed processing of large graphs in python
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big Data
 
Analyzing Data With Python
Analyzing Data With PythonAnalyzing Data With Python
Analyzing Data With Python
 
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuireEmbracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
Embracing a Taxonomy of Types to Simplify Machine Learning with Leah McGuire
 
Flux - Open Machine Learning Stack / Pipeline
Flux - Open Machine Learning Stack / PipelineFlux - Open Machine Learning Stack / Pipeline
Flux - Open Machine Learning Stack / Pipeline
 
Practical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on HadoopPractical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on Hadoop
 
Spark Meetup @ Netflix, 05/19/2015
Spark Meetup @ Netflix, 05/19/2015Spark Meetup @ Netflix, 05/19/2015
Spark Meetup @ Netflix, 05/19/2015
 
Tuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and ArchitectureTuning ML Models: Scaling, Workflows, and Architecture
Tuning ML Models: Scaling, Workflows, and Architecture
 
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
 
Distributed Deep Learning with Hadoop and TensorFlow
Distributed Deep Learning with Hadoop and TensorFlowDistributed Deep Learning with Hadoop and TensorFlow
Distributed Deep Learning with Hadoop and TensorFlow
 
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
Building Large Scale Machine Learning Applications with Pipelines-(Evan Spark...
 
Machine learning model to production
Machine learning model to productionMachine learning model to production
Machine learning model to production
 
Distributed Heterogeneous Mixture Learning On Spark
Distributed Heterogeneous Mixture Learning On SparkDistributed Heterogeneous Mixture Learning On Spark
Distributed Heterogeneous Mixture Learning On Spark
 

Viewers also liked

PSCE 02 - Presentazione 11.05
PSCE 02 - Presentazione 11.05PSCE 02 - Presentazione 11.05
PSCE 02 - Presentazione 11.05luisadoniacovo
 
Osvaldo Ajuda C.V.-English
Osvaldo Ajuda C.V.-EnglishOsvaldo Ajuda C.V.-English
Osvaldo Ajuda C.V.-EnglishOsvaldo Ajuda
 
BuilderArchitect Magazine
BuilderArchitect MagazineBuilderArchitect Magazine
BuilderArchitect MagazineMark Roshanski
 
Fuel cell presentation p26 31-7-1-2013
Fuel cell presentation p26 31-7-1-2013Fuel cell presentation p26 31-7-1-2013
Fuel cell presentation p26 31-7-1-2013Pana Mann
 
Marketing_Collateral_Samples_2015_final
Marketing_Collateral_Samples_2015_finalMarketing_Collateral_Samples_2015_final
Marketing_Collateral_Samples_2015_finalTroy Wise
 
Screenplay - 'Kay'
Screenplay - 'Kay'Screenplay - 'Kay'
Screenplay - 'Kay'skywalker97
 
Managing Time as a Coach
Managing Time as a CoachManaging Time as a Coach
Managing Time as a CoachRL Learning
 
9588_Lakewood_TroyWise_2016_part1_convNOBleed
9588_Lakewood_TroyWise_2016_part1_convNOBleed9588_Lakewood_TroyWise_2016_part1_convNOBleed
9588_Lakewood_TroyWise_2016_part1_convNOBleedTroy Wise
 
Ob1 unit 4 chapter - 16 - conflict management
Ob1   unit 4 chapter - 16 - conflict managementOb1   unit 4 chapter - 16 - conflict management
Ob1 unit 4 chapter - 16 - conflict managementDr S Gokula Krishnan
 
American Builders Quarterly 12-12-07
American Builders Quarterly 12-12-07American Builders Quarterly 12-12-07
American Builders Quarterly 12-12-07Mark Roshanski
 
Advanced garments printing exam preparation
Advanced garments printing exam preparationAdvanced garments printing exam preparation
Advanced garments printing exam preparationAzmir Latif Beg
 
Website Analysis
Website AnalysisWebsite Analysis
Website Analysisskywalker97
 
Wrinkle Finishing Technique
Wrinkle Finishing TechniqueWrinkle Finishing Technique
Wrinkle Finishing TechniqueAzmir Latif Beg
 
Towards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning BenchmarkTowards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning BenchmarkTuri, Inc.
 

Viewers also liked (18)

Yarn Printing
Yarn PrintingYarn Printing
Yarn Printing
 
PSCE 02 - Presentazione 11.05
PSCE 02 - Presentazione 11.05PSCE 02 - Presentazione 11.05
PSCE 02 - Presentazione 11.05
 
Osvaldo Ajuda C.V.-English
Osvaldo Ajuda C.V.-EnglishOsvaldo Ajuda C.V.-English
Osvaldo Ajuda C.V.-English
 
BuilderArchitect Magazine
BuilderArchitect MagazineBuilderArchitect Magazine
BuilderArchitect Magazine
 
Fuel cell presentation p26 31-7-1-2013
Fuel cell presentation p26 31-7-1-2013Fuel cell presentation p26 31-7-1-2013
Fuel cell presentation p26 31-7-1-2013
 
Marketing_Collateral_Samples_2015_final
Marketing_Collateral_Samples_2015_finalMarketing_Collateral_Samples_2015_final
Marketing_Collateral_Samples_2015_final
 
Mt. Lookout - mosaic
Mt. Lookout - mosaicMt. Lookout - mosaic
Mt. Lookout - mosaic
 
AIDS
AIDSAIDS
AIDS
 
Screenplay - 'Kay'
Screenplay - 'Kay'Screenplay - 'Kay'
Screenplay - 'Kay'
 
Managing Time as a Coach
Managing Time as a CoachManaging Time as a Coach
Managing Time as a Coach
 
9588_Lakewood_TroyWise_2016_part1_convNOBleed
9588_Lakewood_TroyWise_2016_part1_convNOBleed9588_Lakewood_TroyWise_2016_part1_convNOBleed
9588_Lakewood_TroyWise_2016_part1_convNOBleed
 
Ob1 unit 4 chapter - 16 - conflict management
Ob1   unit 4 chapter - 16 - conflict managementOb1   unit 4 chapter - 16 - conflict management
Ob1 unit 4 chapter - 16 - conflict management
 
American Builders Quarterly 12-12-07
American Builders Quarterly 12-12-07American Builders Quarterly 12-12-07
American Builders Quarterly 12-12-07
 
Advanced garments printing exam preparation
Advanced garments printing exam preparationAdvanced garments printing exam preparation
Advanced garments printing exam preparation
 
Website Analysis
Website AnalysisWebsite Analysis
Website Analysis
 
Rapport ramed 2013 v2
Rapport ramed 2013 v2Rapport ramed 2013 v2
Rapport ramed 2013 v2
 
Wrinkle Finishing Technique
Wrinkle Finishing TechniqueWrinkle Finishing Technique
Wrinkle Finishing Technique
 
Towards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning BenchmarkTowards a Comprehensive Machine Learning Benchmark
Towards a Comprehensive Machine Learning Benchmark
 

Similar to GraphLab Conference 2014 Keynote - Carlos Guestrin

10-Hot-Data-Analytics-Tre-8904178.ppsx
10-Hot-Data-Analytics-Tre-8904178.ppsx10-Hot-Data-Analytics-Tre-8904178.ppsx
10-Hot-Data-Analytics-Tre-8904178.ppsxSangeetaTripathi8
 
Big Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache SparkBig Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache SparkKenny Bastani
 
Machine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & OpportunitiesMachine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & OpportunitiesCodePolitan
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Shirshanka Das
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Yael Garten
 
Lambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataLambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataTrieu Nguyen
 
Aginity "Big Data" Research Lab
Aginity "Big Data" Research LabAginity "Big Data" Research Lab
Aginity "Big Data" Research Labkevinflorian
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Sciencesarith divakar
 
How to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfHow to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfCareervira
 
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesThe Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesAlice Zheng
 
Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create PyData
 
Big Data
Big DataBig Data
Big DataNGDATA
 
IBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes MatterIBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes MatterChristine O'Connor
 
OSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningOSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningPaco Nathan
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...phdAssistance1
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?Samet KILICTAS
 

Similar to GraphLab Conference 2014 Keynote - Carlos Guestrin (20)

10-Hot-Data-Analytics-Tre-8904178.ppsx
10-Hot-Data-Analytics-Tre-8904178.ppsx10-Hot-Data-Analytics-Tre-8904178.ppsx
10-Hot-Data-Analytics-Tre-8904178.ppsx
 
AI meets Big Data
AI meets Big DataAI meets Big Data
AI meets Big Data
 
Big Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache SparkBig Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache Spark
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Machine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & OpportunitiesMachine Learning - Challenges, Learnings & Opportunities
Machine Learning - Challenges, Learnings & Opportunities
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
Lambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataLambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big data
 
Aginity "Big Data" Research Lab
Aginity "Big Data" Research LabAginity "Big Data" Research Lab
Aginity "Big Data" Research Lab
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
How to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdfHow to Become a Big Data Professional.pdf
How to Become a Big Data Professional.pdf
 
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the MassesThe Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the Masses
 
Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create Danny Bickson - Python based predictive analytics with GraphLab Create
Danny Bickson - Python based predictive analytics with GraphLab Create
 
Big Data
Big DataBig Data
Big Data
 
IBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes MatterIBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes Matter
 
OSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningOSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine Learning
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
BigData
BigDataBigData
BigData
 
Dato Keynote
Dato KeynoteDato Keynote
Dato Keynote
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 

More from Turi, Inc.

Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing VideoTuri, Inc.
 
Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission RiskTuri, Inc.
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Turi, Inc.
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Turi, Inc.
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Turi, Inc.
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Turi, Inc.
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsTuri, Inc.
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataTuri, Inc.
 
Intelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsIntelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsTuri, Inc.
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine LearningTuri, Inc.
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab CreateTuri, Inc.
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesTuri, Inc.
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinTuri, Inc.
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data scienceTuri, Inc.
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Turi, Inc.
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender SystemsTuri, Inc.
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in productionTuri, Inc.
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringTuri, Inc.
 
Building Personalized Data Products with Dato
Building Personalized Data Products with DatoBuilding Personalized Data Products with Dato
Building Personalized Data Products with DatoTuri, Inc.
 

More from Turi, Inc. (20)

Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing Video
 
Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission Risk
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log Data
 
Intelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsIntelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning Toolkits
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine Learning
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab Create
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive Services
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos Guestrin
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data science
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender Systems
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature Engineering
 
SFrame
SFrameSFrame
SFrame
 
Building Personalized Data Products with Dato
Building Personalized Data Products with DatoBuilding Personalized Data Products with Dato
Building Personalized Data Products with Dato
 

Recently uploaded

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 

GraphLab Conference 2014 Keynote - Carlos Guestrin

  • 1. Unleash Data Science Carlos Guestrin CEO CreateTM
  • 2. Last year… Machine Learning Today: Introducing GraphLab Create Usability for ML from Inspiration to Production
  • 3. Data-driven, predictive data apps are making our world amazing… Historical data Sensor & interaction data Real-time predictions & decisions Recommenders Industrial Apps Forecasters Social Human Sensing Fraud & Anomaly Detection Sentiment Analysis & other Text Apps Pers. Medicine
  • 4. Action 1: simple predictive app Aren’t you tired of those spam txt messages???
  • 5. Why predictive apps are the future? Spectrum of data products
  • 6. Continuously get to value faster! time-to-line-of-businessvalue
  • 7. Building a predictive app Was using 217 business rules hoping world doesn’t change Have an inspiring idea to reinvent their business Key pains: Hiring Talent Shortfall in data-savvy workers needed to make sense out of big data by 2018 [McKinsey 2011] 35% Noisy Space of Tools Data scientists use a variety of tools, across different programming languages… require a lot of context-switching… affects productivity and impedes reproducibility. Ben Lorica, Data Analysis: Just one component of the Data Science workflow
  • 8. Crossing the Big Data Chasm speed of iteration scaleofdata Get a Hadoop cluster!?!? single machine memory production data
  • 9. Crossing the Big Data Chasm speed of iteration scaleofdata single machine memory production data big data chasm
  • 10. Crossing the Big Data Chasm speed of iteration scaleofdata single machine memory production data CreateTM GraphLab Create: Unleashing data science from inspiration to production
  • 11. Data scientist: inspiration to production Analyze big data on one machine graphs, tables, text, images in Python doesn’t have to fit in memory Distribute in production with same code on EC2, Yarn,… Use my laptop Variety of data Not toy data scales Language I love Iterate quickly Prototype MonitorProduction data pipeline predictive service GraphLab Canvas: Monitor & visualize from prototype to production GraphLab Create
  • 12. What folks are saying about GraphLab Create “The ease of use and scalable performance, which is not limited by the memory of the machine, are allowing us to innovate and advance at an astonishing pace.” - Andrew Bruce, Senior Director, Data Science, Zillow “Graphlab Create provides us with an end-to-end efficient framework … both tabular and graph data generated by the activity of our users.” - Baldo Faeita, Social Computing Lead, Adobe Systems “…during my time as Zynga's lead architect for big data, I found my way to GraphLab. I was astounded at the dramatic savings, on the order of 500x…” - Mohan Reddy, Chief Architect, The Hive LLC.
  • 13. Action 2: end-to-end predictive app
  • 14. Key use cases • Recommenders, pricing,…Retail • Fraud detection,…Financial • Targeting, sentiment,…Marketing & Advertising • Churn prediction,…Telecom • Communities, friends,…Social network analysis • Med records, drug design,…Medical
  • 15. Detecting fraud in the bitcoin network Poorly-regulated cryptocurrency market w/open transactions Task: Investigate suspicious transactional behavior
  • 16. Check out at graphlab.com…
  • 17. Track stolen bitcoins allinvain theif stole 25,000 BTC ($14M USD) from a user & laundered tainted money theft laundering
  • 18. Productive on one machine Python Tables, graphs, text & images Scale beyond memory Integrated visualization Productive at scale Deploy, monitor, improve on the Cloud/Cluster Integrates with Hadoop/EC2 Deploy data pipelines & prediction services Built for Machine Learning in Production Scalable, robust algorithms End-to-end support Get to value fast Powered by GraphLab Engine Fastest analytics system GraphLab Create
  • 19. Getting started is as easy as… 1. pip install graphlab-create 2. be creative with your data Hands-on training tomorrow Online: Learning ML in Practice with GraphLab Notebook http://graphlab.com/learn ML tutorials, tips, tricks End-to-end GraphLab Notebooks Quick how-tos
  • 20. Getting started is as easy as… 1. pip install graphlab-create 2. be creative with your data Hands-on training tomorrow Online: Learning ML in Practice with GraphLab Notebook http://graphlab.com/learn ML tutorials, tips, tricks End-to-end GraphLab Notebooks Quick how-tos
  • 21. Getting started is as easy as… 1. pip install graphlab-create 2. be creative with your data
  • 22. Deep Dive: key components of a GraphLab Create
  • 23.
  • 24. Scalable data structures: The SFrame & The Sgraph More details in Yucheng’s talk at 1:40pm
  • 25. Scalable data structures ease those scaling pains Pain SFrame/SGraph Running out of memory Graceful degradation Optimized out-of-core computation Integrating data Unified tables, graphs, text, images Missing values Strong/weak types Missing value support from get-go Strong types only when strength helps
  • 26. SFrame: Scalable tabular data Never run out of memory Sharded, compressed, out-of-core, columnar Arbitrary lambda transformations, joins,… from Python Group-by aggregate 10M rows/s on your desktop
  • 27. SFrame: Scalable tabular data Never run out of memory Sharded, compressed, out-of-core, columnar Arbitrary lambda transformations, joins,… from Python Group-by aggregate 10M rows/s on your desktop Same Python code
  • 28. SGraph: Scalable graph data Easily and efficiently express entire pipelines PageRank at 10M edges/s on your desktop
  • 29. Action 3: data science on a terabyte on data on my laptop
  • 30.
  • 31. Scalable, robust machine learning More details in Alice’s talk at 11am
  • 32. Most ML toolkits don’t focus on the real challenges Tools out there Real needs Bag of algorithms Task-oriented, e.g., recommender Brittle Robust to data problems Lots of parameters to tune Automatic; tuning is a bonus “State-of-the-art” methods in “research” mode state-of-the-art accuracy, performance (& methods) GraphLab Create: Robust ML & graph analytics state-of-the-art scaling and accuracy focused on solving tasks, automatically
  • 33. Scalable machine learning • Recommend Products • Target & segment • Detect Fraud • Analyze Text • Regress • Classify • … Recommender 3.5B ratings 1 hour on your desktop
  • 34. ML with GraphLab Create Tables, graphs, text, images State-of-the-art accuracy & scaling Classification, regression, clustering, nearest neighbors,… Want topic models? Super-fast LDA Want winning method for 50% of Kaggle competitions? Boosted decision trees Want hot, hot, hot? Deep learning
  • 35. Action 4: deep learning
  • 36.
  • 37. Deploying GraphLab Create Data pipelines Predictive services More details in Rajat’s talk at 3:20pm
  • 38. Sample Prototype MESSY NOT MODULAR FILE PATHS NOT PORTABLE
  • 39. Reusable components Runs on Hadoop CDH5 now; Pivotal, Spark coming… Runs on Cloud EC2 now; Azure, Google coming… Data pipelines & predictive services GraphLab Data Pipeline Beyond batch & stream processing Predictive applications require real-time service Deployed directly from data pipeline GraphLab Predictive Service Monitor from GraphLab Canvas
  • 41. Same code, many environments Local HDFS S3 SQL/noSQL GraphDB GraphLab Canvas data pipelines predictive services <Python> SGraph Fastest graph analytics GraphLab Engine SFrame Scales out-of-core Machine Learning Robust, scalable, auto-tuning, task-oriented Graphs, tables, text, images End-to-end visualization monitoring management Same code, many environments
  • 43. Recommender 3.5B ratings 1 hour on your desktop PageRank at 10M edges/s on your desktop LDA at 1.4M tokens/s on your desktop Tables Graphs Text GraphLab Create is fast, really fast Most importantly, it’s fast enough for production now!
  • 45. Finding influencers in the Live-Journal graph 135 1340 0 200 400 600 800 1000 1200 1400 1600 GraphLab Mahout Runtime (in seconds, PageRank for 10 iterations) GraphLab on 1 machine is 10x faster than Mahout on 16 machines 16 machines 1 machine Create
  • 47. Logistic regression benchmark Orders of magnitude faster Timeinsecondsondesktop KDD Cup data: predict student performance on math problems based on interactions with tutoring system 8.4M data points, 20M features, 2.4GB compressed 0 10000 20000 30000 40000 50000 60000 70000 Scikit Learn GraphLab Create
  • 49. Recommender using matrix factorization 0 1000 2000 3000 4000 5000 0 2E+09 4E+09 Timeinsecondsondesktop Number of ratings 3.5B ratings ~ 1 hour Amazon ratings data: 35M ratings, 6.6M users, 2.5M products Replicated synthetically WRT users to evaluate scaling
  • 51. GraphLab Create Roadmap March 2014 July 21st 2014 October 2014 Scalable data structures Tables, graphs, text Robust ML algorithms GraphLab Canvas Data pipelines New ML algorithms More data types Predictive services Monitoring in production 100+ companies participated in beta program Already used in production Extremely positive feedback Every feature since March in response to customer requests Please keep them coming!
  • 52. Commitment to open-source • We have been committed to open-source for 6 years - PowerGraph, GraphChi,… - Our focus now is on GraphLab Create • We are inspired by companies like MongoDB & ElasticSearch - Open-source core - Provide value-add tools, such as monitoring & management Our users can be successful by just using open-source version
  • 53. GraphLab Create: Unleashing data science from inspiration to production
  • 54.
  • 55. pip install graphlab-create jobs@graphlab.com @graphlabteam

Editor's Notes

  1. Who cares if you are using SGD, ALS, L-BFGS,.. You want performance, simplicity, accuracy
  2. These slides are missing the punchline. So what if we can do "PageRank in 22 seconds" Lets turn this around and say how fast can these systems find Central Users (and put a footnote stating the algorithm). Lets make a case for realtime response speeds.