SlideShare a Scribd company logo
1 of 27
Download to read offline
2© 2015 Pivotal Software, Inc. All rights reserved. 2© 2015 Pivotal Software, Inc. All rights reserved.
Internet of Things:
Implications for the Enterprise
Rashmi Raghu, Ph.D.
Principal Data Scientist
3© 2015 Pivotal Software, Inc. All rights reserved.
Gene Sequencing
Smart Grids
COST TO SEQUENCE
ONE GENOME
HAS FALLEN FROM
$100M IN
2001
TO $10K IN 2011
TO $1K IN 2014
READING SMART METERS
EVERY 15 MINUTES IS
3000X MORE
DATA INTENSIVE
Stock Market
Social Media
FACEBOOK UPLOADS
250 MILLION
PHOTOS EACH DAY
Billions of Data Points
Oil Exploration
Video Surveillance
OIL RIGS GENERATE
25000
DATA POINTS
PER SECOND
Medical Imaging
Mobile Sensors
4© 2015 Pivotal Software, Inc. All rights reserved.
Implications for the Enterprise
Ÿ  Organizational
–  Vision
–  Preparedness
–  Execution
Ÿ  Technical
–  Data quality & completeness
–  Heterogeneity of data sources
–  Technology architecture
5© 2015 Pivotal Software, Inc. All rights reserved.
Implications for the Enterprise
Ÿ  Organizational
–  Vision
–  Preparedness
–  Execution
Ÿ  Technical
–  Data quality & completeness
–  Heterogeneity of data sources
–  Technology architecture
Issues in any of these have implications for data science
approaches and their effectiveness
6© 2015 Pivotal Software, Inc. All rights reserved.
Case Studies
Oil Drilling Telecommunications
Predictive Maintenance Customer Micro-segmentation
7© 2015 Pivotal Software, Inc. All rights reserved.
Case Studies
Oil Drilling Telecommunications
Predictive Maintenance Customer Micro-segmentation
8© 2015 Pivotal Software, Inc. All rights reserved.
Data: The New Oil
Ÿ  Oil & gas exploration and production activities generate
large amounts of data from sensors
Ÿ  What opportunities exist for data-driven approaches to
improve operations?
Drilling into the San Andreas Fault at Parkfield California.
Credit: Stephen H. Hickman, USGS
*http://blog.pivotal.io/pivotal/case-studies-2/data-as-the-new-oil-producing-value-for-the-oil-gas-industry
9© 2015 Pivotal Software, Inc. All rights reserved.
Data: The New Oil
Ÿ  Oil & gas exploration and production activities generate
large amounts of data from sensors
Ÿ  What opportunities exist for data-driven approaches to
improve operations?
Drilling into the San Andreas Fault at Parkfield California.
Credit: Stephen H. Hickman, USGS
*http://blog.pivotal.io/pivotal/case-studies-2/data-as-the-new-oil-producing-value-for-the-oil-gas-industry
Predictive maintenance
•  Predict equipment function and failure
•  Motivation: Failure costs estimated at
$150,000/incident (billions annually)*
•  Goals:
–  Early warning system
–  Insights into prominent features impacting
operation and failure
–  Reduction of non-productive drill time
–  Reduced incidents
10© 2015 Pivotal Software, Inc. All rights reserved.
Predictive Maintenance for Drilling Operations
Integrating
& Cleansing
Feature
Building
Modeling
11© 2015 Pivotal Software, Inc. All rights reserved.
Primary Data Sources
Integrating
& Cleansing
Feature
Building
Modeling
Integrated Data
Primary data sources
Operator Data
( ~ thousands of records )
•  Failure details
•  Component details
•  Drill Bit details
Drill Rig Sensor Data
( ~ billions of records )
•  Rate of Penetration (ROP)
•  RPM
•  Weight on Bit (WOB) …
12© 2015 Pivotal Software, Inc. All rights reserved.
Primary Data Sources: Challenges
Integrating
& Cleansing
Feature
Building
Modeling
Primary data sources
Operator Data
( ~ thousands of records )
•  Failure details
•  Component details
•  Drill Bit details
Drill Rig Sensor Data
( ~ billions of records )
•  Rate of Penetration (ROP)
•  RPM
•  Weight on Bit (WOB) …
Challenges
•  Failure instances not clearly labeled
•  Labels may be embedded in reports or comments
Implications
•  Dependent variable generation also becomes a
machine learning exercise
•  Accuracy of failure prediction impacted by
accuracy of failure label derivation
13© 2015 Pivotal Software, Inc. All rights reserved.
Primary Data Sources: Challenges
Well ID Depth Comment Event flag
1 1000 equipment not responding 1
2 2000 TOOH to bit. rubber pieces seen 1
Integrating
& Cleansing
Feature
Building
Modeling
•  Dependent variable generation – a machine learning exercise
•  Text analytics pipeline needed to convert failure reports or comments to event flags
14© 2015 Pivotal Software, Inc. All rights reserved.
Complex Feature Set Across Data Sources
Integrating
& Cleansing
Feature
Building
Modeling
•  A failure occurred at the
end of this run
•  Taking a window of time
prior to failure, what
features could we extract
(e.g. variance of RPM,
max bit position velocity)?
BitpositionRPM
ROPWOB
15© 2015 Pivotal Software, Inc. All rights reserved.
Complex Feature Set Across Data Sources
•  Depth
•  Rate of Penetration
•  Torque
•  Weight on Bit
•  RPM
•  …
•  Drill Bit details
•  Component
details etc.
•  Failure events
•  …
Features on
Time
Windows
•  Mean
•  Median
•  Standard Deviation
•  Range
•  Skewness
•  …
Final Set of
Features on
Time
Windows
•  Leverage GPDB / HAWQ (+ MADlib, PL/X) for fast computation of hundreds of features
over time windows within billions of rows (or more) of time-series data
Operator
data
Drill Rig
Sensor
data
16© 2015 Pivotal Software, Inc. All rights reserved.
Predictive Maintenance App Pipeline
Data Lake
Ingest
Business Levers
Early Warning System
Rig Operator Dashboard
Models
•  Elastic Net Regression
•  Cox Proportional
Hazards Regression
•  Decision Trees
Initial data
cleansing filters
Wells with failure
scores and early
warning indicators
Feedback loop for continuous
model improvementDomain
Knowledge
Oil Rig
Operator
HAWQ
GPDB
PL/X
MADlib
R Python
CJava Perl
Spark + MLlib
17© 2015 Pivotal Software, Inc. All rights reserved.
Case Studies
Oil Drilling Telecommunications
Predictive Maintenance Customer Micro-segmentation
18© 2015 Pivotal Software, Inc. All rights reserved.
State of Data at Telco Company
Customer Segments New Data Sources
Multi-Gadget Families Affluent Matures
Thrifty Families High Tech Singles
Budget Singles Seniors
Internet Deep Packet
Inspection
TV Consumption (Linear)
Video On Demand
Consumption
19© 2015 Pivotal Software, Inc. All rights reserved.
Native Services
Video On
Demand TVInternet
Internet Devices
OTT (Over The Top) Services
What is the level of engagement with
client’s products (TV, VOD, Internet)?
What are the patterns of device usage
behavior?
What is the level of OTT engagement, by
segment, and by bandwidth?
Understanding Subscriber Behavior
20© 2015 Pivotal Software, Inc. All rights reserved.
Newly Identified Behavior-Based SegmentsSubscribers
Moderates
OTT & Data Heavyweights
Portable OTT Entertainment Seekers
iPhone Heavy
Android Heavy
iPad Heavy
In-Home OTT Entertainment Seekers
In-Home Native Content Seekers
VOD Heavy
TV Heavy
21© 2015 Pivotal Software, Inc. All rights reserved.
Moderates
OTT & Data Heavyweights
In-Home OTT Entertainment Seekers
Portable OTT Entertainment Seekers - iPhone Heavy
Portable OTT Entertainment Seekers - Android Heavy
Portable OTT Entertainment Seekers - iPad Heavy
In-Home Native Content Seekers - VOD Heavy
In-Home Native Content Seekers - TV Heavy
Cross Behavior-based and Existing Segments
New Behavior-Based Segments
Customized Micro-Segments!
Existing Segments
Multi-Gadget Families
Affluent Matures
Thrifty Families
Budget Singles
High Tech Singles
Seniors
22© 2015 Pivotal Software, Inc. All rights reserved.
Heterogeneous Data Sources
Ÿ  Prevalence of new data sources was
limited but increasing
–  Rich usage data available on a
subset of the subscribers
–  Leads to limited applicability of
micro-segments
Ÿ  Lack of data may be alleviated by
expanding data science efforts
–  Leverage micro-segmentation model to
score a different subset of subscribers
(who we have limited data on)
New Data Sources
Internet Deep Packet
Inspection
TV Consumption (Linear)
Video On Demand
Consumption
23© 2015 Pivotal Software, Inc. All rights reserved.
Driving New Business Value
Upsell and Cross-Sell New Product Offerings Data Monetization
24© 2015 Pivotal Software, Inc. All rights reserved.
Implications for the Enterprise
Ÿ  Organizational
–  Vision
–  Preparedness
–  Execution
Ÿ  Technical / Data
–  Data quality & completeness
–  Heterogeneity of data sources
–  Technology architecture
•  Data quality & completeness:
•  Data capture mechanisms can have a lasting impact on ability to solve a
business problem
•  Heterogeneity of data sources:
•  Existence of legacy systems & devices may limit the applicability of new models
unless that is taken into account ahead of time
•  Feedback to spur upgrading of equipment wherever possible
25© 2015 Pivotal Software, Inc. All rights reserved.
Implications for the Enterprise
Ÿ  Creating value from IoT requires organizational and technical alignment
Ÿ  Impacts of these considerations on data science efforts and outcomes
are non-trivial
Ÿ  Specific impacts of data issues include:
–  Longer time to realization of value
–  Model accuracy issues
–  Limited applicability of results
–  And more …
26© 2015 Pivotal Software, Inc. All rights reserved.
For further information, checkout …
Ÿ  Pivotal Blog @ http://blog.pivotal.io
Ÿ  Pivotal Data Science Blog @ http://blog.pivotal.io/data-science-pivotal
Ÿ  Pivotal Data Product Info, Docs and Downloads @ http://pivotal.io/big-data
Ÿ  Oil & Gas Use Case Webinar:
–  Video: https://www.youtube.com/watch?v=dhT-tjHCr9E
–  Slides: http://www.slideshare.net/Pivotal/data-as-thenewoil
Ÿ  Blogs:
–  Oil & Gas Use Case:
http://blog.pivotal.io/pivotal/case-studies-2/data-as-the-new-oil-producing-value-for-the-oil-gas-
industry
–  Time Series Analysis: http://blog.pivotal.io/tag/time-series-analysis
Data Science Case Studies: The Internet of Things: Implications for the Enterprise

More Related Content

What's hot

What's hot (20)

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Business Analytics Overview
Business Analytics OverviewBusiness Analytics Overview
Business Analytics Overview
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
Data science
Data scienceData science
Data science
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Predictive analytics
Predictive analytics Predictive analytics
Predictive analytics
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Data mining
Data miningData mining
Data mining
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Introduction To Analytics
Introduction To AnalyticsIntroduction To Analytics
Introduction To Analytics
 
data mining
data mining data mining
data mining
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Data science
Data scienceData science
Data science
 
Big Data Analytics for Banking, a Point of View
Big Data Analytics for Banking, a Point of ViewBig Data Analytics for Banking, a Point of View
Big Data Analytics for Banking, a Point of View
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 

Viewers also liked

Duties & responsibility
Duties & responsibilityDuties & responsibility
Duties & responsibility
Zaw Min
 

Viewers also liked (18)

Data as the New Oil: Producing Value in the Oil and Gas Industry
 Data as the New Oil: Producing Value in the Oil and Gas Industry Data as the New Oil: Producing Value in the Oil and Gas Industry
Data as the New Oil: Producing Value in the Oil and Gas Industry
 
Pipeline analytics concept for posting
Pipeline analytics concept for postingPipeline analytics concept for posting
Pipeline analytics concept for posting
 
Personal Healthcare IOT on PCF using Spring
Personal Healthcare IOT on PCF using SpringPersonal Healthcare IOT on PCF using Spring
Personal Healthcare IOT on PCF using Spring
 
Internet Of Things: How Data Science Driven Software is Eating the Connected ...
Internet Of Things: How Data Science Driven Software is Eating the Connected ...Internet Of Things: How Data Science Driven Software is Eating the Connected ...
Internet Of Things: How Data Science Driven Software is Eating the Connected ...
 
Data Science At Scale for IoT on the Pivotal Platform
Data Science At Scale for IoT on the Pivotal PlatformData Science At Scale for IoT on the Pivotal Platform
Data Science At Scale for IoT on the Pivotal Platform
 
SALESmanago - Internet of Things
SALESmanago - Internet of ThingsSALESmanago - Internet of Things
SALESmanago - Internet of Things
 
Dr. Denner opening keynote at Bosch Connected World
Dr. Denner opening keynote at Bosch Connected World Dr. Denner opening keynote at Bosch Connected World
Dr. Denner opening keynote at Bosch Connected World
 
Pivotal Big Data Roadshow
Pivotal Big Data Roadshow Pivotal Big Data Roadshow
Pivotal Big Data Roadshow
 
Duties & responsibility
Duties & responsibilityDuties & responsibility
Duties & responsibility
 
Global Oil and Gas Pipeline Leak Detection Market Forecast and Opportunities,...
Global Oil and Gas Pipeline Leak Detection Market Forecast and Opportunities,...Global Oil and Gas Pipeline Leak Detection Market Forecast and Opportunities,...
Global Oil and Gas Pipeline Leak Detection Market Forecast and Opportunities,...
 
Predictive Maintenance for Oil and Gas
Predictive Maintenance for Oil and Gas Predictive Maintenance for Oil and Gas
Predictive Maintenance for Oil and Gas
 
Oil and gas big data analytics data Visualization
Oil and gas big data analytics data VisualizationOil and gas big data analytics data Visualization
Oil and gas big data analytics data Visualization
 
Business Impact From IoT? Just Add Data Science
Business Impact From IoT? Just Add Data ScienceBusiness Impact From IoT? Just Add Data Science
Business Impact From IoT? Just Add Data Science
 
Managing Downhole Failures in a Rod Pumped Well
Managing Downhole Failures in a Rod Pumped Well Managing Downhole Failures in a Rod Pumped Well
Managing Downhole Failures in a Rod Pumped Well
 
Big Data in Oil and Gas
Big Data in Oil and GasBig Data in Oil and Gas
Big Data in Oil and Gas
 
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
“The Digital Oilfield” : Using IoT to reduce costs in an era of decreasing oi...
 
Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...
 
Big Data Analytics in Energy & Utilities
Big Data Analytics in Energy & UtilitiesBig Data Analytics in Energy & Utilities
Big Data Analytics in Energy & Utilities
 

Similar to Data Science Case Studies: The Internet of Things: Implications for the Enterprise

Predictive Analytics and the Industrial Internet of Manufacturing Things with...
Predictive Analytics and the Industrial Internet of Manufacturing Things with...Predictive Analytics and the Industrial Internet of Manufacturing Things with...
Predictive Analytics and the Industrial Internet of Manufacturing Things with...
gogo6
 
Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA
Kai Wähner
 

Similar to Data Science Case Studies: The Internet of Things: Implications for the Enterprise (20)

Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
IoT Cloud Service & Partner IoT Solution
IoT Cloud Service & Partner IoT Solution IoT Cloud Service & Partner IoT Solution
IoT Cloud Service & Partner IoT Solution
 
There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?There are 250 Database products, are you running the right one?
There are 250 Database products, are you running the right one?
 
Virtualization to Improve Speed and Increase Quality
Virtualization to Improve Speed and Increase QualityVirtualization to Improve Speed and Increase Quality
Virtualization to Improve Speed and Increase Quality
 
Going Beyond the Device Heart Beat
Going Beyond the Device Heart BeatGoing Beyond the Device Heart Beat
Going Beyond the Device Heart Beat
 
You Sold Your First 1,000 Devices? Now What?
You Sold Your First 1,000 Devices? Now What?You Sold Your First 1,000 Devices? Now What?
You Sold Your First 1,000 Devices? Now What?
 
Pivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewPivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical Overview
 
Predictive Analytics and the Industrial Internet of Manufacturing Things with...
Predictive Analytics and the Industrial Internet of Manufacturing Things with...Predictive Analytics and the Industrial Internet of Manufacturing Things with...
Predictive Analytics and the Industrial Internet of Manufacturing Things with...
 
Data Day - Escuchando la red
Data Day - Escuchando la redData Day - Escuchando la red
Data Day - Escuchando la red
 
Streaming Analytics - Comparison of Open Source Frameworks and Products
Streaming Analytics - Comparison of Open Source Frameworks and ProductsStreaming Analytics - Comparison of Open Source Frameworks and Products
Streaming Analytics - Comparison of Open Source Frameworks and Products
 
Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...
Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...
Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...
 
Validation
ValidationValidation
Validation
 
Splunk for ITOA Breakout Session
Splunk for ITOA Breakout SessionSplunk for ITOA Breakout Session
Splunk for ITOA Breakout Session
 
Sensor Data Management & Analytics: Advanced Process Control
Sensor Data Management & Analytics: Advanced Process ControlSensor Data Management & Analytics: Advanced Process Control
Sensor Data Management & Analytics: Advanced Process Control
 
Competing with Software: It Takes a Platform -- Devops @ EMC World
Competing with Software: It Takes a Platform -- Devops @ EMC WorldCompeting with Software: It Takes a Platform -- Devops @ EMC World
Competing with Software: It Takes a Platform -- Devops @ EMC World
 
Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA Framework and Product Comparison for Big Data Log Analytics and ITOA
Framework and Product Comparison for Big Data Log Analytics and ITOA
 
Hey IT, Meet OT with Hima Mukkamala
Hey IT, Meet OT with Hima MukkamalaHey IT, Meet OT with Hima Mukkamala
Hey IT, Meet OT with Hima Mukkamala
 
Steps to Scale Internet of Things (IoT)
Steps to Scale Internet of Things (IoT)Steps to Scale Internet of Things (IoT)
Steps to Scale Internet of Things (IoT)
 
Enabling the-Connected-Car-Java
Enabling the-Connected-Car-JavaEnabling the-Connected-Car-Java
Enabling the-Connected-Car-Java
 
Give ‘Em What They Want! Self-Service Middleware Monitoring in a Shared Servi...
Give ‘Em What They Want! Self-Service Middleware Monitoring in a Shared Servi...Give ‘Em What They Want! Self-Service Middleware Monitoring in a Shared Servi...
Give ‘Em What They Want! Self-Service Middleware Monitoring in a Shared Servi...
 

More from VMware Tanzu

More from VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Recently uploaded

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 

Data Science Case Studies: The Internet of Things: Implications for the Enterprise

  • 1.
  • 2. 2© 2015 Pivotal Software, Inc. All rights reserved. 2© 2015 Pivotal Software, Inc. All rights reserved. Internet of Things: Implications for the Enterprise Rashmi Raghu, Ph.D. Principal Data Scientist
  • 3. 3© 2015 Pivotal Software, Inc. All rights reserved. Gene Sequencing Smart Grids COST TO SEQUENCE ONE GENOME HAS FALLEN FROM $100M IN 2001 TO $10K IN 2011 TO $1K IN 2014 READING SMART METERS EVERY 15 MINUTES IS 3000X MORE DATA INTENSIVE Stock Market Social Media FACEBOOK UPLOADS 250 MILLION PHOTOS EACH DAY Billions of Data Points Oil Exploration Video Surveillance OIL RIGS GENERATE 25000 DATA POINTS PER SECOND Medical Imaging Mobile Sensors
  • 4. 4© 2015 Pivotal Software, Inc. All rights reserved. Implications for the Enterprise Ÿ  Organizational –  Vision –  Preparedness –  Execution Ÿ  Technical –  Data quality & completeness –  Heterogeneity of data sources –  Technology architecture
  • 5. 5© 2015 Pivotal Software, Inc. All rights reserved. Implications for the Enterprise Ÿ  Organizational –  Vision –  Preparedness –  Execution Ÿ  Technical –  Data quality & completeness –  Heterogeneity of data sources –  Technology architecture Issues in any of these have implications for data science approaches and their effectiveness
  • 6. 6© 2015 Pivotal Software, Inc. All rights reserved. Case Studies Oil Drilling Telecommunications Predictive Maintenance Customer Micro-segmentation
  • 7. 7© 2015 Pivotal Software, Inc. All rights reserved. Case Studies Oil Drilling Telecommunications Predictive Maintenance Customer Micro-segmentation
  • 8. 8© 2015 Pivotal Software, Inc. All rights reserved. Data: The New Oil Ÿ  Oil & gas exploration and production activities generate large amounts of data from sensors Ÿ  What opportunities exist for data-driven approaches to improve operations? Drilling into the San Andreas Fault at Parkfield California. Credit: Stephen H. Hickman, USGS *http://blog.pivotal.io/pivotal/case-studies-2/data-as-the-new-oil-producing-value-for-the-oil-gas-industry
  • 9. 9© 2015 Pivotal Software, Inc. All rights reserved. Data: The New Oil Ÿ  Oil & gas exploration and production activities generate large amounts of data from sensors Ÿ  What opportunities exist for data-driven approaches to improve operations? Drilling into the San Andreas Fault at Parkfield California. Credit: Stephen H. Hickman, USGS *http://blog.pivotal.io/pivotal/case-studies-2/data-as-the-new-oil-producing-value-for-the-oil-gas-industry Predictive maintenance •  Predict equipment function and failure •  Motivation: Failure costs estimated at $150,000/incident (billions annually)* •  Goals: –  Early warning system –  Insights into prominent features impacting operation and failure –  Reduction of non-productive drill time –  Reduced incidents
  • 10. 10© 2015 Pivotal Software, Inc. All rights reserved. Predictive Maintenance for Drilling Operations Integrating & Cleansing Feature Building Modeling
  • 11. 11© 2015 Pivotal Software, Inc. All rights reserved. Primary Data Sources Integrating & Cleansing Feature Building Modeling Integrated Data Primary data sources Operator Data ( ~ thousands of records ) •  Failure details •  Component details •  Drill Bit details Drill Rig Sensor Data ( ~ billions of records ) •  Rate of Penetration (ROP) •  RPM •  Weight on Bit (WOB) …
  • 12. 12© 2015 Pivotal Software, Inc. All rights reserved. Primary Data Sources: Challenges Integrating & Cleansing Feature Building Modeling Primary data sources Operator Data ( ~ thousands of records ) •  Failure details •  Component details •  Drill Bit details Drill Rig Sensor Data ( ~ billions of records ) •  Rate of Penetration (ROP) •  RPM •  Weight on Bit (WOB) … Challenges •  Failure instances not clearly labeled •  Labels may be embedded in reports or comments Implications •  Dependent variable generation also becomes a machine learning exercise •  Accuracy of failure prediction impacted by accuracy of failure label derivation
  • 13. 13© 2015 Pivotal Software, Inc. All rights reserved. Primary Data Sources: Challenges Well ID Depth Comment Event flag 1 1000 equipment not responding 1 2 2000 TOOH to bit. rubber pieces seen 1 Integrating & Cleansing Feature Building Modeling •  Dependent variable generation – a machine learning exercise •  Text analytics pipeline needed to convert failure reports or comments to event flags
  • 14. 14© 2015 Pivotal Software, Inc. All rights reserved. Complex Feature Set Across Data Sources Integrating & Cleansing Feature Building Modeling •  A failure occurred at the end of this run •  Taking a window of time prior to failure, what features could we extract (e.g. variance of RPM, max bit position velocity)? BitpositionRPM ROPWOB
  • 15. 15© 2015 Pivotal Software, Inc. All rights reserved. Complex Feature Set Across Data Sources •  Depth •  Rate of Penetration •  Torque •  Weight on Bit •  RPM •  … •  Drill Bit details •  Component details etc. •  Failure events •  … Features on Time Windows •  Mean •  Median •  Standard Deviation •  Range •  Skewness •  … Final Set of Features on Time Windows •  Leverage GPDB / HAWQ (+ MADlib, PL/X) for fast computation of hundreds of features over time windows within billions of rows (or more) of time-series data Operator data Drill Rig Sensor data
  • 16. 16© 2015 Pivotal Software, Inc. All rights reserved. Predictive Maintenance App Pipeline Data Lake Ingest Business Levers Early Warning System Rig Operator Dashboard Models •  Elastic Net Regression •  Cox Proportional Hazards Regression •  Decision Trees Initial data cleansing filters Wells with failure scores and early warning indicators Feedback loop for continuous model improvementDomain Knowledge Oil Rig Operator HAWQ GPDB PL/X MADlib R Python CJava Perl Spark + MLlib
  • 17. 17© 2015 Pivotal Software, Inc. All rights reserved. Case Studies Oil Drilling Telecommunications Predictive Maintenance Customer Micro-segmentation
  • 18. 18© 2015 Pivotal Software, Inc. All rights reserved. State of Data at Telco Company Customer Segments New Data Sources Multi-Gadget Families Affluent Matures Thrifty Families High Tech Singles Budget Singles Seniors Internet Deep Packet Inspection TV Consumption (Linear) Video On Demand Consumption
  • 19. 19© 2015 Pivotal Software, Inc. All rights reserved. Native Services Video On Demand TVInternet Internet Devices OTT (Over The Top) Services What is the level of engagement with client’s products (TV, VOD, Internet)? What are the patterns of device usage behavior? What is the level of OTT engagement, by segment, and by bandwidth? Understanding Subscriber Behavior
  • 20. 20© 2015 Pivotal Software, Inc. All rights reserved. Newly Identified Behavior-Based SegmentsSubscribers Moderates OTT & Data Heavyweights Portable OTT Entertainment Seekers iPhone Heavy Android Heavy iPad Heavy In-Home OTT Entertainment Seekers In-Home Native Content Seekers VOD Heavy TV Heavy
  • 21. 21© 2015 Pivotal Software, Inc. All rights reserved. Moderates OTT & Data Heavyweights In-Home OTT Entertainment Seekers Portable OTT Entertainment Seekers - iPhone Heavy Portable OTT Entertainment Seekers - Android Heavy Portable OTT Entertainment Seekers - iPad Heavy In-Home Native Content Seekers - VOD Heavy In-Home Native Content Seekers - TV Heavy Cross Behavior-based and Existing Segments New Behavior-Based Segments Customized Micro-Segments! Existing Segments Multi-Gadget Families Affluent Matures Thrifty Families Budget Singles High Tech Singles Seniors
  • 22. 22© 2015 Pivotal Software, Inc. All rights reserved. Heterogeneous Data Sources Ÿ  Prevalence of new data sources was limited but increasing –  Rich usage data available on a subset of the subscribers –  Leads to limited applicability of micro-segments Ÿ  Lack of data may be alleviated by expanding data science efforts –  Leverage micro-segmentation model to score a different subset of subscribers (who we have limited data on) New Data Sources Internet Deep Packet Inspection TV Consumption (Linear) Video On Demand Consumption
  • 23. 23© 2015 Pivotal Software, Inc. All rights reserved. Driving New Business Value Upsell and Cross-Sell New Product Offerings Data Monetization
  • 24. 24© 2015 Pivotal Software, Inc. All rights reserved. Implications for the Enterprise Ÿ  Organizational –  Vision –  Preparedness –  Execution Ÿ  Technical / Data –  Data quality & completeness –  Heterogeneity of data sources –  Technology architecture •  Data quality & completeness: •  Data capture mechanisms can have a lasting impact on ability to solve a business problem •  Heterogeneity of data sources: •  Existence of legacy systems & devices may limit the applicability of new models unless that is taken into account ahead of time •  Feedback to spur upgrading of equipment wherever possible
  • 25. 25© 2015 Pivotal Software, Inc. All rights reserved. Implications for the Enterprise Ÿ  Creating value from IoT requires organizational and technical alignment Ÿ  Impacts of these considerations on data science efforts and outcomes are non-trivial Ÿ  Specific impacts of data issues include: –  Longer time to realization of value –  Model accuracy issues –  Limited applicability of results –  And more …
  • 26. 26© 2015 Pivotal Software, Inc. All rights reserved. For further information, checkout … Ÿ  Pivotal Blog @ http://blog.pivotal.io Ÿ  Pivotal Data Science Blog @ http://blog.pivotal.io/data-science-pivotal Ÿ  Pivotal Data Product Info, Docs and Downloads @ http://pivotal.io/big-data Ÿ  Oil & Gas Use Case Webinar: –  Video: https://www.youtube.com/watch?v=dhT-tjHCr9E –  Slides: http://www.slideshare.net/Pivotal/data-as-thenewoil Ÿ  Blogs: –  Oil & Gas Use Case: http://blog.pivotal.io/pivotal/case-studies-2/data-as-the-new-oil-producing-value-for-the-oil-gas- industry –  Time Series Analysis: http://blog.pivotal.io/tag/time-series-analysis