SlideShare a Scribd company logo
1 of 43
Download to read offline
11© 2017 MapR Technologies
Big Data in Healthcare
Carol McDonald
@caroljmcdonald
22© 2017 MapR Technologies
The Motivation for Big Data: Poor ROI
•  USA spends a lot more per
capita
•  US Health System ranks last
among eleven countries
(OECD)
–  healthy lives, access, quality,
efficiency
33© 2017 MapR Technologies
Who Knew Healthcare could be so complicated?
44© 2017 MapR Technologies
Value Based Care & Value Based Reimbursement
Incentives for Technology:
•  Improve coordination and
outcome
•  shifting from fee-for-service
•  to value based data driven incentives
55© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
The Data
66© 2017 MapR Technologies
Where is the Big Data Opportunity?
McKinsey Global Institute
77© 2017 MapR Technologies
Where is the Big Data Opportunity?
According to McKinsey Global Institute the big
data opportunity:
•  Claims
–  utilization of care
•  Pharmaceutical
–  clinical trials
•  Clinical Data
–  Electronic Medical Records
•  Patient Behavior and
Population Health
lab
EMR / EHR
Doctor’s notes
Claims
images
HL7
Social Media
88© 2017 MapR Technologies
Building a Healthcare Data Lake on MapR
Data
Lake
Claims
Clinical
Pharmacy
EMR
Logs and
Notes
3rd Party
Additional
Data
CB Header data, Social, ...
Historical procedures, co-morbidities (prof & inst.)
Lab results, vital signs, ...
Dr. Notes, Customer call logs, emails
Licensing, death master, …
Electronic Medical Records, images & text
Prescriptions, adherence
99© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
Big Data Use Cases
1010© 2017 MapR Technologies
Patient Data Management
Analyzed
Unstructured Data
Patient 360 View
Lab
EMR / EHR
Analysts
Doctor’s notes
Claims
Images
HL7
Social Media
Providers
MapR Converged Data
Platform
1111© 2017 MapR Technologies
Reducing Fraud Waste and Abuse with Big Data Analytics
•  Healthcare Fraud >$60 billion yr
•  UnitedHealthcare:
–  2200% ROI using MapR for
Fraud
•  Medicare/Medicaid prevented
>$210.7 million fraud 1 year
Machine Learning
Model
EDI Claim
Fraud
Score
1212© 2017 MapR Technologies
Predictive Analytics to Improve Outcomes
• Early Diagnosis of sepsis, CHF
• Predicting risk of readmission
• Matching treatments
Early Detection of Congestive Heart Failure
Sun, Jimeng, Large-scale Patient Similarity Learning for health analytics, Georgia Tech
1313© 2017 MapR Technologies
Predictive Analytics/ Machine Learning
•  Aetna Labs predict future risk of metabolic syndrome
–  https://www.healthcare-informatics.com/article/how-aetna-using-big-data-give-patients-
personalized-care
•  Optum Labs data from 150 million patient records gives insight about
what works best
–  http://www.modernhealthcare.com/article/20150926/MAGAZINE/309269979
1414© 2017 MapR Technologies
Real Time Monitoring and Alerts
Medical Devices
Stream
Stream
Stream Dashboards
Global Analytics &
Alerting
1515© 2017 MapR Technologies
Why combine IOT with Machine Learning?
•  Cheaper sensors and machine learning are making it possible for
doctors to rapidly apply smart medicine to their patients’ cases
–  https://www.wsj.com/articles/the-smart-medicine-solution-to-the-health-care-
crisis-1499443449
1616© 2017 MapR Technologies
Why combine IOT with Machine Learning?
•  A Stanford team has shown that a machine-learning model can
identify arrhythmias from an EKG better than an expert
–  https://www.technologyreview.com/s/608234/the-machines-are-getting-ready-to-play-
doctor/
1717© 2017 MapR Technologies
Applying Machine Learning to Live Patient Data
–  https://www.healthitoutcomes.com/doc/applying-machine-learning-to-live-
data-0001
1818© 2017 MapR Technologies
Real Time Monitoring Potential
•  CDC: chronic diseases—such as heart disease—are the major
causes of sickness and health care costs in the nation
•  McKinsey: Better management of congestive heart failure could
reduce treatment costs by a billion dollars annually
1919© 2017 MapR Technologies
Why combine IOT with Machine Learning?
•  Connected care ensuring quicker Sepsis treatment:
–  Blood pressure, pulse rates and oxygen levels from monitoring devices
combined with machine learning to provide alerts
–  http://www.computerweekly.com/news/450422258/Putting-sepsis-algorithms-into-
electronic-patient-records
2020© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
Solution Architecture
2121© 2017 MapR Technologies
Serve DataStore DataCollect Data
What Do We Need to Do ?
Process DataData Sources
images
? ? ? ?
2222© 2017 MapR Technologies
Collect the Data with NFS mounted on MapR-XD
•  Data Ingest:
–  File Based:
NFS with
MapR-FS
•  Move hot data
to $$ storage
•  Move cold
data to
cheaper MapR-
XD
Collect Data
MapR-FS
Data Sources
images
NFS
$$$ Storage
NFS
RDBMS
Data
Warehouse
NFS
Unlimited
Inexpensive
Storage
2323© 2017 MapR Technologies
Collect the Events with MapR Streams
Consumers
Consumers
Consumers
Producers
Producers
Producers
MapR-FS
Kafka API Kafka API
2424© 2017 MapR Technologies
Collect Data
Batch processing
MapR-FS
Process Data
•  Spark Parallel processing high
throughput fast
•  Hive, Pig, MapReduce slower but can
be simpler for batch file processing
2525© 2017 MapR Technologies
Apache Spark Distributed Datasets
Distributed Dataset
Node
Executor
P4
Node
Executor
P1 P3
Node
Executor
P2
partitioned
Partition 1
8213034705, 95,
2.927373,
jake7870, 0……
Partition 2
8213034705,
115, 2.943484,
Davidbresler2,
1….
Partition 3
8213034705,
100, 2.951285,
gladimacowgirl,
58…
Partition 4
8213034705,
117, 2.998947,
daysrus, 95….
•  Data read into Memory Cache
•  Partitioned across a cluster
•  Operated on in parallel
•  Cached in memory for iterations
2626© 2017 MapR Technologies
Streaming Data
Stream processing
Process Data
•  scalable, high-throughput, stream
processing of live data
raw
enriched
alerts
2727© 2017 MapR Technologies
Streaming Analytics
2828© 2017 MapR Technologies
Store the Data with MapR-DB
Key
Range
xxxx
xxxx
Key
Range
xxxx
xxxx
Key
Range
xxxx
xxxx
Key colB col
C
val val val
xxx val val
Key colB col
C
val val val
xxx val val
Key colB col
C
val val val
xxx val val
Fast Reads and Writes by Key! Data is automatically partitioned
by Key Range!
2929© 2017 MapR Technologies
Store Lots of Data with NoSQL MapR-DB
bottleneck
Storage ModelRDBMS MapR-DB
Normalized schema à Joins for
queries can cause bottleneck De-Normalized schema à Data that
is read together is stored together
Key colB colC
xxx val val
xxx val val
Key colB colC
xxx val val
xxx val val
Key colB colC
xxx val val
xxx val val
3030© 2017 MapR Technologies
What is Drill?
•  SQL engine on “everything”
•  Files: JSON, CSV, Parquet
•  Structured formats – Ex: parquet
•  Ecosystem components – Hbase, MapRDB, Hive
•  Schema optional
•  interactive response times
3131© 2017 MapR Technologies
Apache Drill Architecture
•  massively parallel processing execution engine
•  distributed query processing
3232© 2017 MapR Technologies
Serve DataStore DataCollect Data
What Do We Need to Do ?
MapR-FS
Process DataData Sources
MapR-FS
Stream
Topic
3333© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies
Customer Data Lakes
3434© 2017 MapR Technologies
MapR Healthcare Customers
Delivers clinical intelligence
to healthcare providers
Sepsis control based on
real time patient data
Genomic data platform
Research grant analysis
80+ use cases; FWA, …
Genomics analysisRadiology analytics Customized solutions for
value-based care
MRI
manufacturer
Novartis
3535© 2017 MapR Technologies
MapR Healthcare Architecture
3636© 2017 MapR Technologies
Data Lake Architectures
Agile, self-
service data
exploration
ETL into operational
reporting formats (e.g.,
Parquet)
Multi-tenancy: job/
data placement
control, volumes
Access controls:
file, table, column,
column family, doc,
sub-doc levels
Sources
Labs
Claims pharmacy
EHR
Auditing:
compliance, analyze
user accesses
Snapshots:
track data lineage
and history
Table Replication:
global multi-master,
business continuity
MapR Converged Data Platform
Enterprise Storage Database Event Streaming
MapR-FS MapR-DB MapR Streams
MapR-DB: time
series, structured
data, JSON
MapR-XD:
unstructured data
NFS/ raw files
MapR Event Streams:
real-time event data
3737© 2017 MapR Technologies
Valence Health
Population Health SaaS for 85,000 doctors 135 hospitals
•  3,000 inbound data feeds
–  Labs, EHR, claims…
Business Problem:
•  ETL for 20 million lab records took 22 hours to process.
Solution with MapR:
•  With NFS 20 million lab records now take 20 minutes with less
hardware
•  https://www.cioreview.com/news/valence-health-cuts-down-processing-time-and-
drives-customer-satisfaction-with-mapr-nid-11084-cid-15.html
3838© 2017 MapR Technologies
UnitedHealthcare Optum
MapR Data Lake single platform to analyze claims, prescriptions..
•  NFS to ingest 1 million claims, 10 terabytes per day
•  2200% ROI machine learning for Payment Integrity
•  Machine learning for improving outcomes: Diabetes, reduce readmissions…
3939© 2017 MapR Technologies
Baptist Health South Florida
Problem:
•  Oracle too expensive for big data
•  Need a common data platform for patient history
Solution:
1.  MapR data lake
2.  Offload cold data from Oracle $$ NFS to MapR
3.  Integration with EMR
4.  Admission/Readmission prediction
5.  Early sepsis detection/notification
6.  real time monitoring
4040© 2017 MapR Technologies
Use Case: Streaming System of Record for Healthcare
•  Objective:
–  Build a flexible, secure
healthcare information
exchange
Challenges:
•  Many different data models
•  Security and privacy issues
•  HIPAA compliance
4141© 2017 MapR Technologies
Solution: Streaming System of Record for Healthcare
•  Solution:
–  Streaming system of record
•  secure
•  immutable
•  rewindable
Auditable
•  Materialized views continuously computed
•  Selective cross data center replication
Stream
Topic
Records
Applications
6 5 4 3 2 1
Search
Graph DB
JSON
HBase
Micro
Service
Micro
Service
Micro
Service
Micro
Service
Micro
Service
Micro
Service
A
P
I
Streaming System of Record
Materialized
Views
4242© 2017 MapR Technologies
Streaming System of Record for Healthcare
Case Study: Liaison Technologies
Raw
Data
workflow
Key/Value
MapR-DB
materialized
view
workflow
Search
Engine
materialized
view
CEP
k v v v v v
k v v v
k v v
k v v v v
k v v v
k v v v v v
Document Log
(MapR-FS)
log
API
App
pre-
processor
workflow
Graph DB
materialized
view
workflow
Time
Series DB
materialized
view
micro
service
micro
service
micro
service
micro
service
micro
service
micro
service
micro
service
micro
service
App AppApp
...
MapR-ES as Immutable Log
MapR Event Streams (MapR-ES)
•  Immutable log for all data
ingested or consumed.
•  Events become system of
record, processed by
consumers based on their
permissions.
MapR-ES powers compliance-
ready lineage:
•  Immutability. MapR-ES throws
no data away.
•  Auditing. Who wrote/read
events?
•  Rewind. What was status of
data two days ago?
•  Replay. Rebuild derivative data
stores.
Auditors want to see:
•  Data lineage. Where data came
from, how it got there.
•  Audit logging. Who wrote to,
updated, or read the data.
4343© 2017 MapR Technologies
Q&A
@mapr
https://www.mapr.com/blog/author/carol-mcdonald
Engage with us!
mapr-technologies

More Related Content

What's hot

Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...Carol McDonald
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Carol McDonald
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Carol McDonald
 
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...Carol McDonald
 
Apache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision TreesApache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision TreesCarol McDonald
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...Carol McDonald
 
Streaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka APIStreaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka APICarol McDonald
 
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DBAnalyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DBCarol McDonald
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...MapR Technologies
 
When Streaming Becomes Strategic
When Streaming Becomes StrategicWhen Streaming Becomes Strategic
When Streaming Becomes StrategicMapR Technologies
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBMapR Technologies
 
Deep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningDeep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningMapR Technologies
 
MapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Technologies
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleIan Downard
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...MapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageMapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Introduction to machine learning with GPUs
Introduction to machine learning with GPUsIntroduction to machine learning with GPUs
Introduction to machine learning with GPUsCarol McDonald
 

What's hot (20)

Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
 
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
 
Apache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision TreesApache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision Trees
 
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...Analysis of Popular Uber Locations using Apache APIs:  Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
 
Streaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka APIStreaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka API
 
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DBAnalyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
 
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
Streaming Goes Mainstream: New Architecture & Emerging Technologies for Strea...
 
When Streaming Becomes Strategic
When Streaming Becomes StrategicWhen Streaming Becomes Strategic
When Streaming Becomes Strategic
 
NoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DBNoSQL Application Development with JSON and MapR-DB
NoSQL Application Development with JSON and MapR-DB
 
Deep Learning vs. Cheap Learning
Deep Learning vs. Cheap LearningDeep Learning vs. Cheap Learning
Deep Learning vs. Cheap Learning
 
MapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data PlatformMapR Streams and MapR Converged Data Platform
MapR Streams and MapR Converged Data Platform
 
Spark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating ExampleSpark and MapR Streams: A Motivating Example
Spark and MapR Streams: A Motivating Example
 
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Introduction to machine learning with GPUs
Introduction to machine learning with GPUsIntroduction to machine learning with GPUs
Introduction to machine learning with GPUs
 

Similar to How Big Data is Reducing Costs and Improving Outcomes in Health Care

Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data StyleGenome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data StyleJulius Remigio, CBIP
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeDataWorks Summit
 
Hitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-researchHitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-researchHitachi Vantara
 
Baptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataBaptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataMapR Technologies
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016Mark Yunger
 
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareDennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareInnovation Agency
 
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...Patrick Van Renterghem
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentationelasticdave
 
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design PatternsAllen Day, PhD
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsExtraHop Networks
 
Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitGanesan Narayanasamy
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and AnalyticsVMware Tanzu
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceeRic Choo
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareDATA360US
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Chip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochureChip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochureMarco van der Hart
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DLaura Berry
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalVMware Tanzu Korea
 

Similar to How Big Data is Reducing Costs and Improving Outcomes in Health Care (20)

Genome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data StyleGenome Analysis Pipelines, Big Data Style
Genome Analysis Pipelines, Big Data Style
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
 
Hitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-researchHitachi high-performance-accelerates-life-sciences-research
Hitachi high-performance-accelerates-life-sciences-research
 
Baptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big DataBaptist Health: Solving Healthcare Problems with Big Data
Baptist Health: Solving Healthcare Problems with Big Data
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Data analytics - May 2016
Data analytics - May 2016Data analytics - May 2016
Data analytics - May 2016
 
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcareDennis Kehoe - ECO 15: Digital connectivity in healthcare
Dennis Kehoe - ECO 15: Digital connectivity in healthcare
 
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
Data Virtualization at UMC Utrecht: Don't Collect, Connect! by Erik Fransen (...
 
Appistry WGDAS Presentation
Appistry WGDAS PresentationAppistry WGDAS Presentation
Appistry WGDAS Presentation
 
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
 
AI Super computer update
AI Super computer update AI Super computer update
AI Super computer update
 
Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on Summit
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Enterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for HealthcareEnterprise Analytics: Serving Big Data Projects for Healthcare
Enterprise Analytics: Serving Big Data Projects for Healthcare
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Chip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochureChip ICT | Hgst storage brochure
Chip ICT | Hgst storage brochure
 
The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
 
Real Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from PivotalReal Time Business Platform by Ivan Novick from Pivotal
Real Time Business Platform by Ivan Novick from Pivotal
 

More from Carol McDonald

Spark machine learning predicting customer churn
Spark machine learning predicting customer churnSpark machine learning predicting customer churn
Spark machine learning predicting customer churnCarol McDonald
 
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...Carol McDonald
 
Apache Spark Machine Learning
Apache Spark Machine LearningApache Spark Machine Learning
Apache Spark Machine LearningCarol McDonald
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBaseCarol McDonald
 
Apache Spark streaming and HBase
Apache Spark streaming and HBaseApache Spark streaming and HBase
Apache Spark streaming and HBaseCarol McDonald
 
Machine Learning Recommendations with Spark
Machine Learning Recommendations with SparkMachine Learning Recommendations with Spark
Machine Learning Recommendations with SparkCarol McDonald
 
Getting started with HBase
Getting started with HBaseGetting started with HBase
Getting started with HBaseCarol McDonald
 
Introduction to Spark on Hadoop
Introduction to Spark on HadoopIntroduction to Spark on Hadoop
Introduction to Spark on HadoopCarol McDonald
 
NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill Carol McDonald
 

More from Carol McDonald (13)

Spark graphx
Spark graphxSpark graphx
Spark graphx
 
Spark machine learning predicting customer churn
Spark machine learning predicting customer churnSpark machine learning predicting customer churn
Spark machine learning predicting customer churn
 
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
 
Apache Spark Machine Learning
Apache Spark Machine LearningApache Spark Machine Learning
Apache Spark Machine Learning
 
Build a Time Series Application with Apache Spark and Apache HBase
Build a Time Series Application with Apache Spark and Apache  HBaseBuild a Time Series Application with Apache Spark and Apache  HBase
Build a Time Series Application with Apache Spark and Apache HBase
 
Apache Spark streaming and HBase
Apache Spark streaming and HBaseApache Spark streaming and HBase
Apache Spark streaming and HBase
 
Machine Learning Recommendations with Spark
Machine Learning Recommendations with SparkMachine Learning Recommendations with Spark
Machine Learning Recommendations with Spark
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Introduction to Spark
Introduction to SparkIntroduction to Spark
Introduction to Spark
 
CU9411MW.DOC
CU9411MW.DOCCU9411MW.DOC
CU9411MW.DOC
 
Getting started with HBase
Getting started with HBaseGetting started with HBase
Getting started with HBase
 
Introduction to Spark on Hadoop
Introduction to Spark on HadoopIntroduction to Spark on Hadoop
Introduction to Spark on Hadoop
 
NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill NoSQL HBase schema design and SQL with Apache Drill
NoSQL HBase schema design and SQL with Apache Drill
 

Recently uploaded

20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
SoftTeco - Software Development Company Profile
SoftTeco - Software Development Company ProfileSoftTeco - Software Development Company Profile
SoftTeco - Software Development Company Profileakrivarotava
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonApplitools
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Patterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencePatterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencessuser9e7c64
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 

Recently uploaded (20)

20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
SoftTeco - Software Development Company Profile
SoftTeco - Software Development Company ProfileSoftTeco - Software Development Company Profile
SoftTeco - Software Development Company Profile
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Patterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencePatterns for automating API delivery. API conference
Patterns for automating API delivery. API conference
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 

How Big Data is Reducing Costs and Improving Outcomes in Health Care

  • 1. 11© 2017 MapR Technologies Big Data in Healthcare Carol McDonald @caroljmcdonald
  • 2. 22© 2017 MapR Technologies The Motivation for Big Data: Poor ROI •  USA spends a lot more per capita •  US Health System ranks last among eleven countries (OECD) –  healthy lives, access, quality, efficiency
  • 3. 33© 2017 MapR Technologies Who Knew Healthcare could be so complicated?
  • 4. 44© 2017 MapR Technologies Value Based Care & Value Based Reimbursement Incentives for Technology: •  Improve coordination and outcome •  shifting from fee-for-service •  to value based data driven incentives
  • 5. 55© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies The Data
  • 6. 66© 2017 MapR Technologies Where is the Big Data Opportunity? McKinsey Global Institute
  • 7. 77© 2017 MapR Technologies Where is the Big Data Opportunity? According to McKinsey Global Institute the big data opportunity: •  Claims –  utilization of care •  Pharmaceutical –  clinical trials •  Clinical Data –  Electronic Medical Records •  Patient Behavior and Population Health lab EMR / EHR Doctor’s notes Claims images HL7 Social Media
  • 8. 88© 2017 MapR Technologies Building a Healthcare Data Lake on MapR Data Lake Claims Clinical Pharmacy EMR Logs and Notes 3rd Party Additional Data CB Header data, Social, ... Historical procedures, co-morbidities (prof & inst.) Lab results, vital signs, ... Dr. Notes, Customer call logs, emails Licensing, death master, … Electronic Medical Records, images & text Prescriptions, adherence
  • 9. 99© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies Big Data Use Cases
  • 10. 1010© 2017 MapR Technologies Patient Data Management Analyzed Unstructured Data Patient 360 View Lab EMR / EHR Analysts Doctor’s notes Claims Images HL7 Social Media Providers MapR Converged Data Platform
  • 11. 1111© 2017 MapR Technologies Reducing Fraud Waste and Abuse with Big Data Analytics •  Healthcare Fraud >$60 billion yr •  UnitedHealthcare: –  2200% ROI using MapR for Fraud •  Medicare/Medicaid prevented >$210.7 million fraud 1 year Machine Learning Model EDI Claim Fraud Score
  • 12. 1212© 2017 MapR Technologies Predictive Analytics to Improve Outcomes • Early Diagnosis of sepsis, CHF • Predicting risk of readmission • Matching treatments Early Detection of Congestive Heart Failure Sun, Jimeng, Large-scale Patient Similarity Learning for health analytics, Georgia Tech
  • 13. 1313© 2017 MapR Technologies Predictive Analytics/ Machine Learning •  Aetna Labs predict future risk of metabolic syndrome –  https://www.healthcare-informatics.com/article/how-aetna-using-big-data-give-patients- personalized-care •  Optum Labs data from 150 million patient records gives insight about what works best –  http://www.modernhealthcare.com/article/20150926/MAGAZINE/309269979
  • 14. 1414© 2017 MapR Technologies Real Time Monitoring and Alerts Medical Devices Stream Stream Stream Dashboards Global Analytics & Alerting
  • 15. 1515© 2017 MapR Technologies Why combine IOT with Machine Learning? •  Cheaper sensors and machine learning are making it possible for doctors to rapidly apply smart medicine to their patients’ cases –  https://www.wsj.com/articles/the-smart-medicine-solution-to-the-health-care- crisis-1499443449
  • 16. 1616© 2017 MapR Technologies Why combine IOT with Machine Learning? •  A Stanford team has shown that a machine-learning model can identify arrhythmias from an EKG better than an expert –  https://www.technologyreview.com/s/608234/the-machines-are-getting-ready-to-play- doctor/
  • 17. 1717© 2017 MapR Technologies Applying Machine Learning to Live Patient Data –  https://www.healthitoutcomes.com/doc/applying-machine-learning-to-live- data-0001
  • 18. 1818© 2017 MapR Technologies Real Time Monitoring Potential •  CDC: chronic diseases—such as heart disease—are the major causes of sickness and health care costs in the nation •  McKinsey: Better management of congestive heart failure could reduce treatment costs by a billion dollars annually
  • 19. 1919© 2017 MapR Technologies Why combine IOT with Machine Learning? •  Connected care ensuring quicker Sepsis treatment: –  Blood pressure, pulse rates and oxygen levels from monitoring devices combined with machine learning to provide alerts –  http://www.computerweekly.com/news/450422258/Putting-sepsis-algorithms-into- electronic-patient-records
  • 20. 2020© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies Solution Architecture
  • 21. 2121© 2017 MapR Technologies Serve DataStore DataCollect Data What Do We Need to Do ? Process DataData Sources images ? ? ? ?
  • 22. 2222© 2017 MapR Technologies Collect the Data with NFS mounted on MapR-XD •  Data Ingest: –  File Based: NFS with MapR-FS •  Move hot data to $$ storage •  Move cold data to cheaper MapR- XD Collect Data MapR-FS Data Sources images NFS $$$ Storage NFS RDBMS Data Warehouse NFS Unlimited Inexpensive Storage
  • 23. 2323© 2017 MapR Technologies Collect the Events with MapR Streams Consumers Consumers Consumers Producers Producers Producers MapR-FS Kafka API Kafka API
  • 24. 2424© 2017 MapR Technologies Collect Data Batch processing MapR-FS Process Data •  Spark Parallel processing high throughput fast •  Hive, Pig, MapReduce slower but can be simpler for batch file processing
  • 25. 2525© 2017 MapR Technologies Apache Spark Distributed Datasets Distributed Dataset Node Executor P4 Node Executor P1 P3 Node Executor P2 partitioned Partition 1 8213034705, 95, 2.927373, jake7870, 0…… Partition 2 8213034705, 115, 2.943484, Davidbresler2, 1…. Partition 3 8213034705, 100, 2.951285, gladimacowgirl, 58… Partition 4 8213034705, 117, 2.998947, daysrus, 95…. •  Data read into Memory Cache •  Partitioned across a cluster •  Operated on in parallel •  Cached in memory for iterations
  • 26. 2626© 2017 MapR Technologies Streaming Data Stream processing Process Data •  scalable, high-throughput, stream processing of live data raw enriched alerts
  • 27. 2727© 2017 MapR Technologies Streaming Analytics
  • 28. 2828© 2017 MapR Technologies Store the Data with MapR-DB Key Range xxxx xxxx Key Range xxxx xxxx Key Range xxxx xxxx Key colB col C val val val xxx val val Key colB col C val val val xxx val val Key colB col C val val val xxx val val Fast Reads and Writes by Key! Data is automatically partitioned by Key Range!
  • 29. 2929© 2017 MapR Technologies Store Lots of Data with NoSQL MapR-DB bottleneck Storage ModelRDBMS MapR-DB Normalized schema à Joins for queries can cause bottleneck De-Normalized schema à Data that is read together is stored together Key colB colC xxx val val xxx val val Key colB colC xxx val val xxx val val Key colB colC xxx val val xxx val val
  • 30. 3030© 2017 MapR Technologies What is Drill? •  SQL engine on “everything” •  Files: JSON, CSV, Parquet •  Structured formats – Ex: parquet •  Ecosystem components – Hbase, MapRDB, Hive •  Schema optional •  interactive response times
  • 31. 3131© 2017 MapR Technologies Apache Drill Architecture •  massively parallel processing execution engine •  distributed query processing
  • 32. 3232© 2017 MapR Technologies Serve DataStore DataCollect Data What Do We Need to Do ? MapR-FS Process DataData Sources MapR-FS Stream Topic
  • 33. 3333© 2017 MapR Technologies© 2016 MapR Technologies© 2016 MapR Technologies Customer Data Lakes
  • 34. 3434© 2017 MapR Technologies MapR Healthcare Customers Delivers clinical intelligence to healthcare providers Sepsis control based on real time patient data Genomic data platform Research grant analysis 80+ use cases; FWA, … Genomics analysisRadiology analytics Customized solutions for value-based care MRI manufacturer Novartis
  • 35. 3535© 2017 MapR Technologies MapR Healthcare Architecture
  • 36. 3636© 2017 MapR Technologies Data Lake Architectures Agile, self- service data exploration ETL into operational reporting formats (e.g., Parquet) Multi-tenancy: job/ data placement control, volumes Access controls: file, table, column, column family, doc, sub-doc levels Sources Labs Claims pharmacy EHR Auditing: compliance, analyze user accesses Snapshots: track data lineage and history Table Replication: global multi-master, business continuity MapR Converged Data Platform Enterprise Storage Database Event Streaming MapR-FS MapR-DB MapR Streams MapR-DB: time series, structured data, JSON MapR-XD: unstructured data NFS/ raw files MapR Event Streams: real-time event data
  • 37. 3737© 2017 MapR Technologies Valence Health Population Health SaaS for 85,000 doctors 135 hospitals •  3,000 inbound data feeds –  Labs, EHR, claims… Business Problem: •  ETL for 20 million lab records took 22 hours to process. Solution with MapR: •  With NFS 20 million lab records now take 20 minutes with less hardware •  https://www.cioreview.com/news/valence-health-cuts-down-processing-time-and- drives-customer-satisfaction-with-mapr-nid-11084-cid-15.html
  • 38. 3838© 2017 MapR Technologies UnitedHealthcare Optum MapR Data Lake single platform to analyze claims, prescriptions.. •  NFS to ingest 1 million claims, 10 terabytes per day •  2200% ROI machine learning for Payment Integrity •  Machine learning for improving outcomes: Diabetes, reduce readmissions…
  • 39. 3939© 2017 MapR Technologies Baptist Health South Florida Problem: •  Oracle too expensive for big data •  Need a common data platform for patient history Solution: 1.  MapR data lake 2.  Offload cold data from Oracle $$ NFS to MapR 3.  Integration with EMR 4.  Admission/Readmission prediction 5.  Early sepsis detection/notification 6.  real time monitoring
  • 40. 4040© 2017 MapR Technologies Use Case: Streaming System of Record for Healthcare •  Objective: –  Build a flexible, secure healthcare information exchange Challenges: •  Many different data models •  Security and privacy issues •  HIPAA compliance
  • 41. 4141© 2017 MapR Technologies Solution: Streaming System of Record for Healthcare •  Solution: –  Streaming system of record •  secure •  immutable •  rewindable Auditable •  Materialized views continuously computed •  Selective cross data center replication Stream Topic Records Applications 6 5 4 3 2 1 Search Graph DB JSON HBase Micro Service Micro Service Micro Service Micro Service Micro Service Micro Service A P I Streaming System of Record Materialized Views
  • 42. 4242© 2017 MapR Technologies Streaming System of Record for Healthcare Case Study: Liaison Technologies Raw Data workflow Key/Value MapR-DB materialized view workflow Search Engine materialized view CEP k v v v v v k v v v k v v k v v v v k v v v k v v v v v Document Log (MapR-FS) log API App pre- processor workflow Graph DB materialized view workflow Time Series DB materialized view micro service micro service micro service micro service micro service micro service micro service micro service App AppApp ... MapR-ES as Immutable Log MapR Event Streams (MapR-ES) •  Immutable log for all data ingested or consumed. •  Events become system of record, processed by consumers based on their permissions. MapR-ES powers compliance- ready lineage: •  Immutability. MapR-ES throws no data away. •  Auditing. Who wrote/read events? •  Rewind. What was status of data two days ago? •  Replay. Rebuild derivative data stores. Auditors want to see: •  Data lineage. Where data came from, how it got there. •  Audit logging. Who wrote to, updated, or read the data.
  • 43. 4343© 2017 MapR Technologies Q&A @mapr https://www.mapr.com/blog/author/carol-mcdonald Engage with us! mapr-technologies