SlideShare a Scribd company logo
1 of 18
Jianqiang (Jay) Wang
Stitch fix/twitter/HP Labs
July 26, 2015
Notes on Machine Learning and
data centric startups
About me
B.S. degree in Management Science; Ph.D. in Statistics;
Data scientist in Stitch Fix (retail recommendation);
Data scientist in twitter (computational ads algo);
HP Labs : Business optimization (pricing & portfolio
management, marketing)
Consulting:
SpotTrender (video-pretesting)
Brilent (data science training, recruiter products)
Data-centric businesses (advertising, retail,...).
Publication
Consulting
Patent
Statistical Demand Modeling @HP
Who do you communicate with for your analytics/ML? How?
Personalized recommendation in retail
Stitch Fix : how it works
Sources of data
sold flag, survey ratings
Unstructured : feedback, request note,
style image
How should interact with algorithms to
Recommend clothes
perform analytics
Medical diagnostics
Human-computer interaction
Data-centric startups
Jet.com Amazon killer: subscription-based retail, Marc
Lore (Diapers.com), $50/yr, 5-10% lower price
Thumbtacks Service provider referral (how to monetize?)
SpotTrender Pre-test video commercials
Sano Realtime news discovery from social
networks (twitter, instagram, weibo, VK, ..)
Common
crawl
(non-profit) Open repo of web crawl data,
billions of pages each month
ML applications
Search engines
Computational advertising
Recommender systems
Adaptive websites : (learn user preference, personalized webpage)
Medical diagnosis
Human-computer interaction;
Computational finance/stock market analysis;
Computer vision, object recognition,
Speech and handwriting recognition
Machine Translation
Fraud detection (internet, credit card)
Game playing
Information retrieval
Natural language processing
Building competitive moat with data
Pete Skomoroch : Ex Principal DS @linkedin
Ads on twitter platform
Ads serving pipeline
Dashboard;
Automate R/python jobs, send email with
analysis reports;
Saas
Automatic data analysis
Product analytics @twitter
Opportunity analysis
Diagnostic analysis
A/B testing
Advertiser campaigns
Supply (platform users) vs demand (advertisers)
Creating your own campaign
Tweet engagement
Followers
App install
Website visits
Lead generation
Targeting
Targeting criteria
Keywords (tweet or tweet engagement)
Interests
Followers : (similar) followers of a handle
Tailored audiences
How to match users to targeting criteria
Interest/age prediction: we don’t ask the users to explicitly indicate their
interests/age but infer them from who they follow and what they tweet about.
Algorithm & analytics
Interest (NLP), age (classification)
Filtering ad candidates
Campaigns currently active with budget left
Same advertiser/tweet fatigue rules
How many times per week for the same user?
How to make such decisions?
Dismiss/block/spam filters
Click through rate (CTR) prediction
How likely is the user to ...
Click on the url
Expand the image
Download the app
Online machine learning with 10k+ features
User request and candidate features
Request : user geo, user type, login frequency, interest,..
Ad : advertiser vertical, popularity, tweet content
Model fitting & diagnostics
Ranking
Second price auction on Expected Cost per Impression
(ECPI)
Advertisers bid for engagement (Bid)
Predicated engagement rate (pCTR)
Naïve ranking function : ECPI=Bid * pCTR
Pricing
Minimum bid required to win auction
Winner has (bidCPE1, pCTR1), runner-up has (bidCPE2, pCTR2)
Winner pays paidCPE = bidCPE2 * pCTR2 / pCTR1

More Related Content

What's hot

Intermediate Google Analytics, Beyond the Basics
Intermediate Google Analytics, Beyond the BasicsIntermediate Google Analytics, Beyond the Basics
Intermediate Google Analytics, Beyond the Basics
Caitlin Jeansonne
 
Digital Strategy - Business & Brand
Digital Strategy - Business & BrandDigital Strategy - Business & Brand
Digital Strategy - Business & Brand
Marcin Ladowski
 
ad:tech Media Attibution
ad:tech Media Attibutionad:tech Media Attibution
ad:tech Media Attibution
Datalicious
 
Getting the Payoff from Direct Response Mail
Getting the Payoff from Direct Response MailGetting the Payoff from Direct Response Mail
Getting the Payoff from Direct Response Mail
Semphonic
 

What's hot (18)

Behavioral Targeting and Audience Analysis for Networks
Behavioral Targeting and Audience Analysis for NetworksBehavioral Targeting and Audience Analysis for Networks
Behavioral Targeting and Audience Analysis for Networks
 
DBS-Week10-EcommSites&SalesFunnells
DBS-Week10-EcommSites&SalesFunnellsDBS-Week10-EcommSites&SalesFunnells
DBS-Week10-EcommSites&SalesFunnells
 
Intermediate Google Analytics, Beyond the Basics
Intermediate Google Analytics, Beyond the BasicsIntermediate Google Analytics, Beyond the Basics
Intermediate Google Analytics, Beyond the Basics
 
An introduction to closed loop marketing
An introduction to closed loop marketingAn introduction to closed loop marketing
An introduction to closed loop marketing
 
Email marketing for continuing education professionals - Brad Kleinman
Email marketing for continuing education professionals - Brad KleinmanEmail marketing for continuing education professionals - Brad Kleinman
Email marketing for continuing education professionals - Brad Kleinman
 
Getting Started With Digital Marketing
Getting Started With Digital MarketingGetting Started With Digital Marketing
Getting Started With Digital Marketing
 
Digital Marketing Strategy for Your Business
Digital Marketing Strategy for Your BusinessDigital Marketing Strategy for Your Business
Digital Marketing Strategy for Your Business
 
Digital Strategy - Business & Brand
Digital Strategy - Business & BrandDigital Strategy - Business & Brand
Digital Strategy - Business & Brand
 
ad:tech Media Attibution
ad:tech Media Attibutionad:tech Media Attibution
ad:tech Media Attibution
 
SMAC
SMACSMAC
SMAC
 
Facebook marketing (Old and New Media)
Facebook marketing (Old and New Media)Facebook marketing (Old and New Media)
Facebook marketing (Old and New Media)
 
Getting the Payoff from Direct Response Mail
Getting the Payoff from Direct Response MailGetting the Payoff from Direct Response Mail
Getting the Payoff from Direct Response Mail
 
The Complete Social Media Audience Analysis Guide
The Complete Social Media Audience Analysis GuideThe Complete Social Media Audience Analysis Guide
The Complete Social Media Audience Analysis Guide
 
Ga training ppt
Ga training pptGa training ppt
Ga training ppt
 
Facebook Analytics: How to Measure 650 Million People for Fun and Profit
Facebook Analytics: How to Measure 650 Million People for Fun and ProfitFacebook Analytics: How to Measure 650 Million People for Fun and Profit
Facebook Analytics: How to Measure 650 Million People for Fun and Profit
 
Web Analytics and Usability Testing
Web Analytics and Usability TestingWeb Analytics and Usability Testing
Web Analytics and Usability Testing
 
Advanced Social Media Strategies for Business
Advanced Social Media Strategies for Business Advanced Social Media Strategies for Business
Advanced Social Media Strategies for Business
 
RealTime Marketing and Loyalty+ Platform - RML+P
RealTime Marketing and Loyalty+ Platform - RML+PRealTime Marketing and Loyalty+ Platform - RML+P
RealTime Marketing and Loyalty+ Platform - RML+P
 

Viewers also liked

SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...
SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...
SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...
Rod King, Ph.D.
 
H2O Deep Learning at Next.ML
H2O Deep Learning at Next.MLH2O Deep Learning at Next.ML
H2O Deep Learning at Next.ML
Sri Ambati
 
H2O Distributed Deep Learning by Arno Candel 071614
H2O Distributed Deep Learning by Arno Candel 071614H2O Distributed Deep Learning by Arno Candel 071614
H2O Distributed Deep Learning by Arno Candel 071614
Sri Ambati
 

Viewers also liked (20)

Cities and Startups: Cultivating Deep Engagement
Cities and Startups: Cultivating Deep EngagementCities and Startups: Cultivating Deep Engagement
Cities and Startups: Cultivating Deep Engagement
 
Predictive apps for startups
Predictive apps for startupsPredictive apps for startups
Predictive apps for startups
 
Investor's View on Machine Intelligence startups, 1.0, @YellowDoors meetup Ap...
Investor's View on Machine Intelligence startups, 1.0, @YellowDoors meetup Ap...Investor's View on Machine Intelligence startups, 1.0, @YellowDoors meetup Ap...
Investor's View on Machine Intelligence startups, 1.0, @YellowDoors meetup Ap...
 
Startups are about learning, SW Startup Day at TUT
Startups are about learning, SW Startup Day at TUTStartups are about learning, SW Startup Day at TUT
Startups are about learning, SW Startup Day at TUT
 
SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...
SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...
SUPERSMART LEARNING TOOLS for Lean Startups: Volume 1 - Six Question (Q) Temp...
 
Investors foresee a safe bet on deep tech startups
Investors foresee a safe bet on deep tech startupsInvestors foresee a safe bet on deep tech startups
Investors foresee a safe bet on deep tech startups
 
Self-Service.AI - Pitch Competition for AI-Driven SaaS Startups
Self-Service.AI - Pitch Competition for AI-Driven SaaS StartupsSelf-Service.AI - Pitch Competition for AI-Driven SaaS Startups
Self-Service.AI - Pitch Competition for AI-Driven SaaS Startups
 
Recommender Systems and Active Learning (for Startups)
Recommender Systems and Active Learning (for Startups)Recommender Systems and Active Learning (for Startups)
Recommender Systems and Active Learning (for Startups)
 
Investor's view on machine intelligence startups, 2.0, Jan 2017
Investor's view on machine intelligence startups, 2.0, Jan 2017Investor's view on machine intelligence startups, 2.0, Jan 2017
Investor's view on machine intelligence startups, 2.0, Jan 2017
 
Deep learning in production with the best
Deep learning in production   with the bestDeep learning in production   with the best
Deep learning in production with the best
 
BootstrapLabs - Tracxn Report - artificial intelligence for the Applied Arti...
BootstrapLabs - Tracxn  Report - artificial intelligence for the Applied Arti...BootstrapLabs - Tracxn  Report - artificial intelligence for the Applied Arti...
BootstrapLabs - Tracxn Report - artificial intelligence for the Applied Arti...
 
Machine learning and TensorFlow
Machine learning and TensorFlowMachine learning and TensorFlow
Machine learning and TensorFlow
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!
 
Venture Scanner Artificial Intelligence 2016 Q4
Venture Scanner Artificial Intelligence 2016 Q4Venture Scanner Artificial Intelligence 2016 Q4
Venture Scanner Artificial Intelligence 2016 Q4
 
Introduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningIntroduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep Learning
 
Deep Learning in Computer Vision
Deep Learning in Computer VisionDeep Learning in Computer Vision
Deep Learning in Computer Vision
 
H2O Deep Learning at Next.ML
H2O Deep Learning at Next.MLH2O Deep Learning at Next.ML
H2O Deep Learning at Next.ML
 
How to win data science competitions with Deep Learning
How to win data science competitions with Deep LearningHow to win data science competitions with Deep Learning
How to win data science competitions with Deep Learning
 
H2O Distributed Deep Learning by Arno Candel 071614
H2O Distributed Deep Learning by Arno Candel 071614H2O Distributed Deep Learning by Arno Candel 071614
H2O Distributed Deep Learning by Arno Candel 071614
 
Transform your Business with AI, Deep Learning and Machine Learning
Transform your Business with AI, Deep Learning and Machine LearningTransform your Business with AI, Deep Learning and Machine Learning
Transform your Business with AI, Deep Learning and Machine Learning
 

Similar to Notes on Machine Learning and Data-centric Startups

FQ Mobile Asia Congress - App Bytes 2011 - Session Speech & Panel
FQ Mobile Asia Congress - App Bytes 2011 - Session Speech & PanelFQ Mobile Asia Congress - App Bytes 2011 - Session Speech & Panel
FQ Mobile Asia Congress - App Bytes 2011 - Session Speech & Panel
FabriQate
 
Online marketing summit 061610 monica seebohm
Online marketing summit 061610 monica seebohmOnline marketing summit 061610 monica seebohm
Online marketing summit 061610 monica seebohm
Online Marketing Summit
 
Analytics For Leaders
Analytics For LeadersAnalytics For Leaders
Analytics For Leaders
Tunheim
 

Similar to Notes on Machine Learning and Data-centric Startups (20)

AI and Machine Learning in Digital Marketing.pdf
AI and Machine Learning in Digital Marketing.pdfAI and Machine Learning in Digital Marketing.pdf
AI and Machine Learning in Digital Marketing.pdf
 
Introduction to data science and its application in online advertising
Introduction to data science and its application in online advertisingIntroduction to data science and its application in online advertising
Introduction to data science and its application in online advertising
 
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
How to use Online Marketing Technology to Improve Campaign Performance - Lowe...
 
Web 2.0 new metrics new media oct 2011
Web 2.0 new metrics new media oct 2011Web 2.0 new metrics new media oct 2011
Web 2.0 new metrics new media oct 2011
 
The Future of AI in Digital Marketing Transforming Customer Experiences.pdf
The Future of AI in Digital Marketing Transforming Customer Experiences.pdfThe Future of AI in Digital Marketing Transforming Customer Experiences.pdf
The Future of AI in Digital Marketing Transforming Customer Experiences.pdf
 
FQ Mobile Asia Congress - App Bytes 2011 - Session Speech & Panel
FQ Mobile Asia Congress - App Bytes 2011 - Session Speech & PanelFQ Mobile Asia Congress - App Bytes 2011 - Session Speech & Panel
FQ Mobile Asia Congress - App Bytes 2011 - Session Speech & Panel
 
Meelis Ojasild - B2B Email Marketing
Meelis Ojasild - B2B Email MarketingMeelis Ojasild - B2B Email Marketing
Meelis Ojasild - B2B Email Marketing
 
If job boards died - where would you get your candidates?
If job boards died - where would you get your candidates?If job boards died - where would you get your candidates?
If job boards died - where would you get your candidates?
 
TargetSummit Berlin Meetup - Aso in 5 quick tips, Moritz Daan
TargetSummit Berlin Meetup - Aso in 5 quick tips, Moritz DaanTargetSummit Berlin Meetup - Aso in 5 quick tips, Moritz Daan
TargetSummit Berlin Meetup - Aso in 5 quick tips, Moritz Daan
 
El impacto del big data en la estrategia de los medios de comunicacion by Osc...
El impacto del big data en la estrategia de los medios de comunicacion by Osc...El impacto del big data en la estrategia de los medios de comunicacion by Osc...
El impacto del big data en la estrategia de los medios de comunicacion by Osc...
 
AdWords Research, Segmentation, Targeting, Strategies
AdWords Research, Segmentation, Targeting, StrategiesAdWords Research, Segmentation, Targeting, Strategies
AdWords Research, Segmentation, Targeting, Strategies
 
Introduction to data science and candidate data science projects
Introduction to data science and candidate data science projectsIntroduction to data science and candidate data science projects
Introduction to data science and candidate data science projects
 
Online marketing summit 061610 monica seebohm
Online marketing summit 061610 monica seebohmOnline marketing summit 061610 monica seebohm
Online marketing summit 061610 monica seebohm
 
B2B Growth Hacking: How to build a Growth Engine in B2B
B2B Growth Hacking: How to build a Growth Engine in B2BB2B Growth Hacking: How to build a Growth Engine in B2B
B2B Growth Hacking: How to build a Growth Engine in B2B
 
The Customer Data Platform, the Future of the Marketing Database
The Customer Data Platform, the Future of the Marketing DatabaseThe Customer Data Platform, the Future of the Marketing Database
The Customer Data Platform, the Future of the Marketing Database
 
online startups metrics
online startups metrics online startups metrics
online startups metrics
 
SALESmanago: Buyer Persona in Marketing Automation
SALESmanago: Buyer Persona in Marketing AutomationSALESmanago: Buyer Persona in Marketing Automation
SALESmanago: Buyer Persona in Marketing Automation
 
Demystifying Internet Marketing
Demystifying Internet MarketingDemystifying Internet Marketing
Demystifying Internet Marketing
 
marketing analytics 1.pptx
marketing analytics 1.pptxmarketing analytics 1.pptx
marketing analytics 1.pptx
 
Analytics For Leaders
Analytics For LeadersAnalytics For Leaders
Analytics For Leaders
 

More from Jay (Jianqiang) Wang

Boosted multinomial logit model (working manuscript)
Boosted multinomial logit model (working manuscript)Boosted multinomial logit model (working manuscript)
Boosted multinomial logit model (working manuscript)
Jay (Jianqiang) Wang
 

More from Jay (Jianqiang) Wang (9)

The Practice of Data Driven Products in Kuaishou
The Practice of Data Driven Products in KuaishouThe Practice of Data Driven Products in Kuaishou
The Practice of Data Driven Products in Kuaishou
 
Artificial Intelligence in fashion -- Combining Statistics and Expert Human J...
Artificial Intelligence in fashion -- Combining Statistics and Expert Human J...Artificial Intelligence in fashion -- Combining Statistics and Expert Human J...
Artificial Intelligence in fashion -- Combining Statistics and Expert Human J...
 
Making data-informed decisions and building intelligent products (Chinese)
Making data-informed decisions and building intelligent products (Chinese)Making data-informed decisions and building intelligent products (Chinese)
Making data-informed decisions and building intelligent products (Chinese)
 
How to prepare for data science interviews
How to prepare for data science interviewsHow to prepare for data science interviews
How to prepare for data science interviews
 
Boosted multinomial logit model (working manuscript)
Boosted multinomial logit model (working manuscript)Boosted multinomial logit model (working manuscript)
Boosted multinomial logit model (working manuscript)
 
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market DataBoosted Tree-based Multinomial Logit Model for Aggregated Market Data
Boosted Tree-based Multinomial Logit Model for Aggregated Market Data
 
Multivariate outlier detection
Multivariate outlier detectionMultivariate outlier detection
Multivariate outlier detection
 
Multivariate outlier detection
Multivariate outlier detectionMultivariate outlier detection
Multivariate outlier detection
 
A Bayesian Approach to Estimating Agricultual Yield Based on Multiple Repeat...
A Bayesian Approach to Estimating Agricultual Yield Based on Multiple Repeat...A Bayesian Approach to Estimating Agricultual Yield Based on Multiple Repeat...
A Bayesian Approach to Estimating Agricultual Yield Based on Multiple Repeat...
 

Recently uploaded

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 

Recently uploaded (20)

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 

Notes on Machine Learning and Data-centric Startups

  • 1. Jianqiang (Jay) Wang Stitch fix/twitter/HP Labs July 26, 2015 Notes on Machine Learning and data centric startups
  • 2. About me B.S. degree in Management Science; Ph.D. in Statistics; Data scientist in Stitch Fix (retail recommendation); Data scientist in twitter (computational ads algo); HP Labs : Business optimization (pricing & portfolio management, marketing) Consulting: SpotTrender (video-pretesting) Brilent (data science training, recruiter products) Data-centric businesses (advertising, retail,...).
  • 3. Publication Consulting Patent Statistical Demand Modeling @HP Who do you communicate with for your analytics/ML? How?
  • 4. Personalized recommendation in retail Stitch Fix : how it works
  • 5. Sources of data sold flag, survey ratings Unstructured : feedback, request note, style image
  • 6. How should interact with algorithms to Recommend clothes perform analytics Medical diagnostics Human-computer interaction
  • 7. Data-centric startups Jet.com Amazon killer: subscription-based retail, Marc Lore (Diapers.com), $50/yr, 5-10% lower price Thumbtacks Service provider referral (how to monetize?) SpotTrender Pre-test video commercials Sano Realtime news discovery from social networks (twitter, instagram, weibo, VK, ..) Common crawl (non-profit) Open repo of web crawl data, billions of pages each month
  • 8. ML applications Search engines Computational advertising Recommender systems Adaptive websites : (learn user preference, personalized webpage) Medical diagnosis Human-computer interaction; Computational finance/stock market analysis; Computer vision, object recognition, Speech and handwriting recognition Machine Translation Fraud detection (internet, credit card) Game playing Information retrieval Natural language processing
  • 9. Building competitive moat with data Pete Skomoroch : Ex Principal DS @linkedin
  • 10. Ads on twitter platform
  • 12. Dashboard; Automate R/python jobs, send email with analysis reports; Saas Automatic data analysis
  • 13. Product analytics @twitter Opportunity analysis Diagnostic analysis A/B testing
  • 14. Advertiser campaigns Supply (platform users) vs demand (advertisers) Creating your own campaign Tweet engagement Followers App install Website visits Lead generation
  • 15. Targeting Targeting criteria Keywords (tweet or tweet engagement) Interests Followers : (similar) followers of a handle Tailored audiences How to match users to targeting criteria Interest/age prediction: we don’t ask the users to explicitly indicate their interests/age but infer them from who they follow and what they tweet about. Algorithm & analytics Interest (NLP), age (classification)
  • 16. Filtering ad candidates Campaigns currently active with budget left Same advertiser/tweet fatigue rules How many times per week for the same user? How to make such decisions? Dismiss/block/spam filters
  • 17. Click through rate (CTR) prediction How likely is the user to ... Click on the url Expand the image Download the app Online machine learning with 10k+ features User request and candidate features Request : user geo, user type, login frequency, interest,.. Ad : advertiser vertical, popularity, tweet content Model fitting & diagnostics
  • 18. Ranking Second price auction on Expected Cost per Impression (ECPI) Advertisers bid for engagement (Bid) Predicated engagement rate (pCTR) Naïve ranking function : ECPI=Bid * pCTR Pricing Minimum bid required to win auction Winner has (bidCPE1, pCTR1), runner-up has (bidCPE2, pCTR2) Winner pays paidCPE = bidCPE2 * pCTR2 / pCTR1