SlideShare a Scribd company logo
1 of 38
1© Cloudera, Inc. All rights reserved.
Amy O’Connor
Big Data Evangelist / Business Value Enablement
Big Value with Big Customer Data
2© Cloudera, Inc. All rights reserved.
Real World Impact
The great value of data
Top Cancer Research
Institutions
Working to Cure
Cancer
Rocket Science
Thorn
Destroying Human
Trafficking Networks
Manning the Orion spacecraft
as it orbits the earth
3© Cloudera, Inc. All rights reserved.
Cloudera Fast Facts: An Innovative Technology Company
2008
Founded by former employees of
2009
First commercial Hadoop product
1200+ Employees
2300+ Partners
~$1B Investment
4© Cloudera, Inc. All rights reserved.
Google 1999: Indexing the Web
5© Cloudera, Inc. All rights reserved.
The Original Inspirations for Hadoop
2003 2004
6© Cloudera, Inc. All rights reserved.
2006
Core Hadoop
(HDFS,
MapReduce)
The Beginning: Building Hadoop
7© Cloudera, Inc. All rights reserved.
2006 2008 2009 2010 2011 2012 2013
HBase
ZooKeeper
Solr
Pig
Core Hadoop
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
Core Hadoop
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
Core Hadoop
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Parquet
Sentry
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
2007
Solr
Pig
Core Hadoop
Knox
Flink
Parquet
Sentry
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
2014 2015
Kudu
RecordService
Ibis
Falcon
Knox
Flink
Parquet
Sentry
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Core Hadoop
(HDFS,
MapReduce)
A Decade of Hadoop
8© Cloudera, Inc. All rights reserved.
Our relationship
with data
is changing.
Hadoop Technology enables
new ways of working.
9© Cloudera, Inc. All rights reserved.
Requirements necessary to drive value from data
1. Economically feasible to store more data
2. Powered to predictably process large data sets
3. Ability to build your data asset at linear scale
4. Collect data in native format – enables agility
5. Build history of activity by collecting data prior to its use
6. You can have near real-time access to data, plus a view of history
7. Security at the data layer increases flexibility and ability to protect privacy
8. Create community data and use machine learning to drive innovation
Extreme performance
and efficiency
Analytic agility
10© Cloudera, Inc. All rights reserved.
Merging real-time &
archived data
Structured with
unstructured
External and internal
sources
Data stays where it’s born
Not all can be in the cloud
Partnerships with Amazon,
Microsoft & Google
Native Encryption
Access Control
Data Governance
Regulatory Compliance
Advanced Analytics Hybrid Cloud Data SecurityMulti-Workload
Batch Computation
Interactive SQL
Machine Learning
Stream Processing
Search
In-memory
The ground has shifted, from “Storage + Compute” to:
11© Cloudera, Inc. All rights reserved.
Our relationship
with data
is changing.
From balance to blend:
personal & professional lives.
Understanding the customer journey
12© Cloudera, Inc. All rights reserved.
How many here sleep with your smartphone?
• One in three Australians sleep with
their smartphones
Source: Deloitte: Mobile Consumer Survey 2015
– The Australian Cut
From balance to blend: Personal & Professional Lives
• More than half the population checks
their smartphone within 15 minutes of
waking
• More than 88% of Australians use their
smartphones when talking to friends
and 92% of Australians use their
smartphones at work (92%)
13© Cloudera, Inc. All rights reserved.
From balance to blend: Personal & Professional Lives
Source: September 10, 2014|by Tasha Keeney, ARK Analyst|Devices / Gateways
Tablet
Smartphone
Internet
TV
70+ Telco / Internet Providers
286PB Data
14© Cloudera, Inc. All rights reserved.
• New insights into customer
behavior, abandoned online
shopping cart behavior with
unified customer data
• Marketing spend
optimization through
channel attribution analysis
• Improve supply chain and
reduce inventory costs
• Improved ability to predict
returns
DRIVE CUSTOMER
INSIGHTS
The Customer
Journey
15© Cloudera, Inc. All rights reserved.
• Leveraging EDH and predictive
modeling, help clients optimize
market, channel and offer.
• With their solution, customers
can auto serve offers tailored
by consumer behavior,
preference and transaction
history.
• Digital Alchemy customers
include Virgin Mobil, Spark,
RACQ, ASB and Rabbit
Rewards.
DRIVE CUSTOMER
INSIGHTS
Right Time, Right
Offer, Right Channel
16© Cloudera, Inc. All rights reserved.
Data and the
Sharing Economy
“Everything that we do in
engineering is about creating
great matches between people”.
Machine Learning drove up
booking rates by 4% - with first
experiment.
17© Cloudera, Inc. All rights reserved.
Our relationship
with data
is changing.
From separate to converged
digital and physical worlds.
Building better products & services
• Internet of Things (IoT)
• Smart Cities
• Augmented Reality
• Virtual Reality
• Precision Medicine
• Precision Energy
• Automated Logistics
• 3D printing
18© Cloudera, Inc. All rights reserved.
Source- Ovum: Understanding the IoT
Opportunity: An Industry Perspective -
2015
Building better products with data from IoT
19© Cloudera, Inc. All rights reserved.
• Deeper analytics from
customer profile data
• 50% increase in customer
retention while 2x
increase in policies issued
• Ability to analyze 10s of
millions of quotes in
under a minute
• $5 million claim cost
reduction
through fraud prevention
Connected cars
20© Cloudera, Inc. All rights reserved.
Moving people
Discount people’s
previous experience but
put a heavy premium on
their ability to solve the
problems that your
business has.
Heavy emphaisis on the
combination of creative
and analytical skill sets.
21© Cloudera, Inc. All rights reserved.
• Virtuous cycle: Identify
features that facilitate sharing
of content that drive new
customers
• Real-time streaming and batch
data from product logs, web
analytics, channel data and
ERP
•Impala connects to third-party
data wrangling and BI tools for
fast reporting
Sharing entertainment
experiences
22© Cloudera, Inc. All rights reserved.
• Monitor the health of
180,000+ trucks in real-time :
• OnCommand Connection
collecting telematics and
geolocation data across
thousands of trucks
• Identify and correct engine
problems early, and increase
fleet uptime
• Reduced maintenance costs
to $.03 per mile from $.12-
$.15 per mile
Connected logistics
23© Cloudera, Inc. All rights reserved.
The IT world
is changing.
Disaggregated, distributed
On-prem, hybrid, cloud
Fast, easy, secure
24© Cloudera, Inc. All rights reserved.
Hadoop deployments in cloud are accelerating:
● Executive mandate: minimize on-prem datacenter
footprint
● Data provenance and data gravity
● Increased agility: end-user self-service
● Elasticity: optimize infrastructure usage
● Perceived lower overall TCO
What’s driving data to Cloud and Hybrid Cloud?
Enterprise customers using cloud for big data analytics
25© Cloudera, Inc. All rights reserved.
Hadoop Expertise
◆ Most committers
◆ World-class innovation
◆ Enterprise-class stack
◆ Granular data security +
governance
◆ Best support, services, training
Flexible Deployments
◆ No vendor lock-in
◆ Multi-cloud and on-prem
◆ Transient and long-lived
clusters
Superior security
Security separation from
infrastructure leads to greater
choice
Flexible Pricing
◆ Pay-as-you-go cloud usage
◆ Traditional node-based
licensing
Why Cloudera in the Cloud: fast, easy, secure
CDH is the most deployed distro in the cloud
26© Cloudera, Inc. All rights reserved.
• Redshift “General Purpose Schema” - less modeled schema for general-purpose usage
• Redshift “Fixed Reporting” – fixed-purpose schema tuned for this specific test workload
Exploratory BI can be
slow on Redshift
Impala 4-10x faster than Redshift General Purpose
Impala 42-90% faster than Redshift Fixed Reporting
More Performant: Impala SQL on both EBS & S3
Multi-user queries
27© Cloudera, Inc. All rights reserved.
• Redshift “General Purpose Schema” - schema for general-purpose usage
• Redshift “Fixed Reporting” – fixed-purpose schema tuned for this specific test workload
Impala >200% cheaper than Redshift General Purpose
Impala 8-28% cheaper than Redshift Fixed Reporting
Exploratory BI can be
expensive on Redshift
More cost effective: Impala SQL on both EBS & S3
ETL + Multi-user queries
28© Cloudera, Inc. All rights reserved.
During the Rio Summer Olympics,
delivered 2.7 million emails across
108 campaigns that were triggered by
audience behavior with 29% increase
in ave minutes streamed.
•Enables varied & complex data to be
stored for highly variable events
•Provides extreme flexibility:
Provisioned extra nodes a wk before
games
•Data was used in 7 Olympic App,
and to track content on 200
variables to run email campaigns
CUSTOMER 360
Audience Engagement in
the Cloud
29© Cloudera, Inc. All rights reserved.
Crunching 1,000+ Business Metrics
per Customer with Sub-Second
Responses
•Enables granular targeting of
customers
•50% reduction in marketing cost
execution at one
•Stores & processes 1000s of
critical events at scale & low cost
•Provides flexibility, agility to
support customer needs with
Cloudera on Amazon Web
Services and on premises
CUSTOMER 360
Customer 360° in the
Cloud
30© Cloudera, Inc. All rights reserved.
Preventative Maintenance
• To improve traveler
satisfaction and safety, a
European needed to reduce
downtime for critical
operational machines
• Cloudera Enterprise on Azure
captures and correlates
sensor data with
transactional data to
proactively assess the health
of its machines and deliver
necessary fixes to prevent
failure
Flying Safer
31© Cloudera, Inc. All rights reserved.
• Collect and analyze data from
from thousands of diverse
manufacturing systems in
real-time
• iTrak application using
Cloudera on Azure to monitor
the performance of
individual manufacturing
systems in real-time
• Predictive Maintenance -
Proactively identifying &
fixing issues before they
break
Industrial IoT
32© Cloudera, Inc. All rights reserved.
• 30 Billion events/data in
Market graph database
built on Cloudera & AWS
• Rapid, interactive access
to 2+ years’ data
• Operational efficiencies
resulting from the
platform’s scalability result
in net annual savings of
$20 million
Financial
Compliance
33© Cloudera, Inc. All rights reserved.
Best Practice to Successful
Hadoop Adoption
By 2017,
Gartner “Predicts 2015: Big Data Challenges Move From Technology to the
Organization” – November 2014
of big data projects will fail
to go beyond the pilot phase60%
34© Cloudera, Inc. All rights reserved.
Our Most Successful Customers do these Five Things
1. Build a Big Data Culture
Led by an enabled executive sponsor(s). Communication methodologies. Advocating change.
2. Assemble the right team
Tightly aligned team. Mix of seasoned experts and innovators
3. Become lean and iterative for data engineering, data science, analysis
Successful projects start small, fail often and iterate to success approach. Roadmaps:
Document expected direction, yet expect insights to create change
4. Efficiently operationalize insights
Analytics -> Reports, Big Data -> Actions. Create a bridge between Dev and Ops
5. Govern the Data
Rightsize and iteratively building towards maturity.
35© Cloudera, Inc. All rights reserved.
Get Data
Explore
and Analyze
Deploy
1. Get data you already have, or
create new data.
2. Explore and analyze, quickly.
3. Deploy your application.
…and repeat. Add:
More data, more users, more use cases,
more complex analytics; go real-time!
Think Big.
Start Small.
Iterate to Success
36© Cloudera, Inc. All rights reserved.
Product
Innovation
Open Source,
Open Standards
Training
Services
Customer
Success
Proactive,
Predictive
SupportPartner
Ecosystem
Cloudera is your global Big Data Partner
37© Cloudera, Inc. All rights reserved.
Getting started is easy. Then iterative to success.
① ②
Download or Deploy
in the Cloud
Signup for Training Contact us or a Partner
to Start a Pilot Project
③
38© Cloudera, Inc. All rights reserved.
Thank you
@ImAmyO
AmyO@cloudera.com

More Related Content

Viewers also liked

The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360Capgemini
 
How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...
How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...
How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...Comarch
 
Connected Banking Framework
Connected Banking FrameworkConnected Banking Framework
Connected Banking FrameworkKashif Akram
 
CMA Summit 2012
CMA  Summit 2012CMA  Summit 2012
CMA Summit 2012Delvinia
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)Guido Schmutz
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017StampedeCon
 
Apache Kafka Scalable Message Processing and more!
Apache Kafka Scalable Message Processing and more! Apache Kafka Scalable Message Processing and more!
Apache Kafka Scalable Message Processing and more! Guido Schmutz
 
ANTS - 360 view of your customer - bigdata innovation summit 2016
ANTS - 360 view of your customer - bigdata innovation summit 2016ANTS - 360 view of your customer - bigdata innovation summit 2016
ANTS - 360 view of your customer - bigdata innovation summit 2016Dinh Le Dat (Kevin D.)
 
A Customer-Centric Banking Platform Powered by MongoDB
A Customer-Centric Banking Platform Powered by MongoDB A Customer-Centric Banking Platform Powered by MongoDB
A Customer-Centric Banking Platform Powered by MongoDB MongoDB
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE) Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE) Guido Schmutz
 
Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Cloudera, Inc.
 
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...Spark Summit
 

Viewers also liked (13)

The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360The Connected Consumer – Real-time Customer 360
The Connected Consumer – Real-time Customer 360
 
How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...
How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...
How to build an effective omni-channel CRM & Marketing Strategy & 360 custome...
 
Connected Banking Framework
Connected Banking FrameworkConnected Banking Framework
Connected Banking Framework
 
CMA Summit 2012
CMA  Summit 2012CMA  Summit 2012
CMA Summit 2012
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
 
Apache Kafka Scalable Message Processing and more!
Apache Kafka Scalable Message Processing and more! Apache Kafka Scalable Message Processing and more!
Apache Kafka Scalable Message Processing and more!
 
ANTS - 360 view of your customer - bigdata innovation summit 2016
ANTS - 360 view of your customer - bigdata innovation summit 2016ANTS - 360 view of your customer - bigdata innovation summit 2016
ANTS - 360 view of your customer - bigdata innovation summit 2016
 
A Customer-Centric Banking Platform Powered by MongoDB
A Customer-Centric Banking Platform Powered by MongoDB A Customer-Centric Banking Platform Powered by MongoDB
A Customer-Centric Banking Platform Powered by MongoDB
 
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE) Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
Customer Event Hub – a modern Customer 360° view with DataStax Enterprise (DSE)
 
Solution Blueprint - Customer 360
Solution Blueprint - Customer 360Solution Blueprint - Customer 360
Solution Blueprint - Customer 360
 
Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360Using Big Data to Drive Customer 360
Using Big Data to Drive Customer 360
 
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
Data Driven-Toyota Customer 360 Insights on Apache Spark and MLlib-(Brian Kur...
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noidabntitsolutionsrishis
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Best Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfBest Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfIdiosysTechnologies1
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Best Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfBest Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdf
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 

Big Value with Big Customer Data

  • 1. 1© Cloudera, Inc. All rights reserved. Amy O’Connor Big Data Evangelist / Business Value Enablement Big Value with Big Customer Data
  • 2. 2© Cloudera, Inc. All rights reserved. Real World Impact The great value of data Top Cancer Research Institutions Working to Cure Cancer Rocket Science Thorn Destroying Human Trafficking Networks Manning the Orion spacecraft as it orbits the earth
  • 3. 3© Cloudera, Inc. All rights reserved. Cloudera Fast Facts: An Innovative Technology Company 2008 Founded by former employees of 2009 First commercial Hadoop product 1200+ Employees 2300+ Partners ~$1B Investment
  • 4. 4© Cloudera, Inc. All rights reserved. Google 1999: Indexing the Web
  • 5. 5© Cloudera, Inc. All rights reserved. The Original Inspirations for Hadoop 2003 2004
  • 6. 6© Cloudera, Inc. All rights reserved. 2006 Core Hadoop (HDFS, MapReduce) The Beginning: Building Hadoop
  • 7. 7© Cloudera, Inc. All rights reserved. 2006 2008 2009 2010 2011 2012 2013 HBase ZooKeeper Solr Pig Core Hadoop Hive Mahout HBase ZooKeeper Solr Pig Core Hadoop Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig Core Hadoop Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Parquet Sentry Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop 2007 Solr Pig Core Hadoop Knox Flink Parquet Sentry Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop 2014 2015 Kudu RecordService Ibis Falcon Knox Flink Parquet Sentry Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Core Hadoop (HDFS, MapReduce) A Decade of Hadoop
  • 8. 8© Cloudera, Inc. All rights reserved. Our relationship with data is changing. Hadoop Technology enables new ways of working.
  • 9. 9© Cloudera, Inc. All rights reserved. Requirements necessary to drive value from data 1. Economically feasible to store more data 2. Powered to predictably process large data sets 3. Ability to build your data asset at linear scale 4. Collect data in native format – enables agility 5. Build history of activity by collecting data prior to its use 6. You can have near real-time access to data, plus a view of history 7. Security at the data layer increases flexibility and ability to protect privacy 8. Create community data and use machine learning to drive innovation Extreme performance and efficiency Analytic agility
  • 10. 10© Cloudera, Inc. All rights reserved. Merging real-time & archived data Structured with unstructured External and internal sources Data stays where it’s born Not all can be in the cloud Partnerships with Amazon, Microsoft & Google Native Encryption Access Control Data Governance Regulatory Compliance Advanced Analytics Hybrid Cloud Data SecurityMulti-Workload Batch Computation Interactive SQL Machine Learning Stream Processing Search In-memory The ground has shifted, from “Storage + Compute” to:
  • 11. 11© Cloudera, Inc. All rights reserved. Our relationship with data is changing. From balance to blend: personal & professional lives. Understanding the customer journey
  • 12. 12© Cloudera, Inc. All rights reserved. How many here sleep with your smartphone? • One in three Australians sleep with their smartphones Source: Deloitte: Mobile Consumer Survey 2015 – The Australian Cut From balance to blend: Personal & Professional Lives • More than half the population checks their smartphone within 15 minutes of waking • More than 88% of Australians use their smartphones when talking to friends and 92% of Australians use their smartphones at work (92%)
  • 13. 13© Cloudera, Inc. All rights reserved. From balance to blend: Personal & Professional Lives Source: September 10, 2014|by Tasha Keeney, ARK Analyst|Devices / Gateways Tablet Smartphone Internet TV 70+ Telco / Internet Providers 286PB Data
  • 14. 14© Cloudera, Inc. All rights reserved. • New insights into customer behavior, abandoned online shopping cart behavior with unified customer data • Marketing spend optimization through channel attribution analysis • Improve supply chain and reduce inventory costs • Improved ability to predict returns DRIVE CUSTOMER INSIGHTS The Customer Journey
  • 15. 15© Cloudera, Inc. All rights reserved. • Leveraging EDH and predictive modeling, help clients optimize market, channel and offer. • With their solution, customers can auto serve offers tailored by consumer behavior, preference and transaction history. • Digital Alchemy customers include Virgin Mobil, Spark, RACQ, ASB and Rabbit Rewards. DRIVE CUSTOMER INSIGHTS Right Time, Right Offer, Right Channel
  • 16. 16© Cloudera, Inc. All rights reserved. Data and the Sharing Economy “Everything that we do in engineering is about creating great matches between people”. Machine Learning drove up booking rates by 4% - with first experiment.
  • 17. 17© Cloudera, Inc. All rights reserved. Our relationship with data is changing. From separate to converged digital and physical worlds. Building better products & services • Internet of Things (IoT) • Smart Cities • Augmented Reality • Virtual Reality • Precision Medicine • Precision Energy • Automated Logistics • 3D printing
  • 18. 18© Cloudera, Inc. All rights reserved. Source- Ovum: Understanding the IoT Opportunity: An Industry Perspective - 2015 Building better products with data from IoT
  • 19. 19© Cloudera, Inc. All rights reserved. • Deeper analytics from customer profile data • 50% increase in customer retention while 2x increase in policies issued • Ability to analyze 10s of millions of quotes in under a minute • $5 million claim cost reduction through fraud prevention Connected cars
  • 20. 20© Cloudera, Inc. All rights reserved. Moving people Discount people’s previous experience but put a heavy premium on their ability to solve the problems that your business has. Heavy emphaisis on the combination of creative and analytical skill sets.
  • 21. 21© Cloudera, Inc. All rights reserved. • Virtuous cycle: Identify features that facilitate sharing of content that drive new customers • Real-time streaming and batch data from product logs, web analytics, channel data and ERP •Impala connects to third-party data wrangling and BI tools for fast reporting Sharing entertainment experiences
  • 22. 22© Cloudera, Inc. All rights reserved. • Monitor the health of 180,000+ trucks in real-time : • OnCommand Connection collecting telematics and geolocation data across thousands of trucks • Identify and correct engine problems early, and increase fleet uptime • Reduced maintenance costs to $.03 per mile from $.12- $.15 per mile Connected logistics
  • 23. 23© Cloudera, Inc. All rights reserved. The IT world is changing. Disaggregated, distributed On-prem, hybrid, cloud Fast, easy, secure
  • 24. 24© Cloudera, Inc. All rights reserved. Hadoop deployments in cloud are accelerating: ● Executive mandate: minimize on-prem datacenter footprint ● Data provenance and data gravity ● Increased agility: end-user self-service ● Elasticity: optimize infrastructure usage ● Perceived lower overall TCO What’s driving data to Cloud and Hybrid Cloud? Enterprise customers using cloud for big data analytics
  • 25. 25© Cloudera, Inc. All rights reserved. Hadoop Expertise ◆ Most committers ◆ World-class innovation ◆ Enterprise-class stack ◆ Granular data security + governance ◆ Best support, services, training Flexible Deployments ◆ No vendor lock-in ◆ Multi-cloud and on-prem ◆ Transient and long-lived clusters Superior security Security separation from infrastructure leads to greater choice Flexible Pricing ◆ Pay-as-you-go cloud usage ◆ Traditional node-based licensing Why Cloudera in the Cloud: fast, easy, secure CDH is the most deployed distro in the cloud
  • 26. 26© Cloudera, Inc. All rights reserved. • Redshift “General Purpose Schema” - less modeled schema for general-purpose usage • Redshift “Fixed Reporting” – fixed-purpose schema tuned for this specific test workload Exploratory BI can be slow on Redshift Impala 4-10x faster than Redshift General Purpose Impala 42-90% faster than Redshift Fixed Reporting More Performant: Impala SQL on both EBS & S3 Multi-user queries
  • 27. 27© Cloudera, Inc. All rights reserved. • Redshift “General Purpose Schema” - schema for general-purpose usage • Redshift “Fixed Reporting” – fixed-purpose schema tuned for this specific test workload Impala >200% cheaper than Redshift General Purpose Impala 8-28% cheaper than Redshift Fixed Reporting Exploratory BI can be expensive on Redshift More cost effective: Impala SQL on both EBS & S3 ETL + Multi-user queries
  • 28. 28© Cloudera, Inc. All rights reserved. During the Rio Summer Olympics, delivered 2.7 million emails across 108 campaigns that were triggered by audience behavior with 29% increase in ave minutes streamed. •Enables varied & complex data to be stored for highly variable events •Provides extreme flexibility: Provisioned extra nodes a wk before games •Data was used in 7 Olympic App, and to track content on 200 variables to run email campaigns CUSTOMER 360 Audience Engagement in the Cloud
  • 29. 29© Cloudera, Inc. All rights reserved. Crunching 1,000+ Business Metrics per Customer with Sub-Second Responses •Enables granular targeting of customers •50% reduction in marketing cost execution at one •Stores & processes 1000s of critical events at scale & low cost •Provides flexibility, agility to support customer needs with Cloudera on Amazon Web Services and on premises CUSTOMER 360 Customer 360° in the Cloud
  • 30. 30© Cloudera, Inc. All rights reserved. Preventative Maintenance • To improve traveler satisfaction and safety, a European needed to reduce downtime for critical operational machines • Cloudera Enterprise on Azure captures and correlates sensor data with transactional data to proactively assess the health of its machines and deliver necessary fixes to prevent failure Flying Safer
  • 31. 31© Cloudera, Inc. All rights reserved. • Collect and analyze data from from thousands of diverse manufacturing systems in real-time • iTrak application using Cloudera on Azure to monitor the performance of individual manufacturing systems in real-time • Predictive Maintenance - Proactively identifying & fixing issues before they break Industrial IoT
  • 32. 32© Cloudera, Inc. All rights reserved. • 30 Billion events/data in Market graph database built on Cloudera & AWS • Rapid, interactive access to 2+ years’ data • Operational efficiencies resulting from the platform’s scalability result in net annual savings of $20 million Financial Compliance
  • 33. 33© Cloudera, Inc. All rights reserved. Best Practice to Successful Hadoop Adoption By 2017, Gartner “Predicts 2015: Big Data Challenges Move From Technology to the Organization” – November 2014 of big data projects will fail to go beyond the pilot phase60%
  • 34. 34© Cloudera, Inc. All rights reserved. Our Most Successful Customers do these Five Things 1. Build a Big Data Culture Led by an enabled executive sponsor(s). Communication methodologies. Advocating change. 2. Assemble the right team Tightly aligned team. Mix of seasoned experts and innovators 3. Become lean and iterative for data engineering, data science, analysis Successful projects start small, fail often and iterate to success approach. Roadmaps: Document expected direction, yet expect insights to create change 4. Efficiently operationalize insights Analytics -> Reports, Big Data -> Actions. Create a bridge between Dev and Ops 5. Govern the Data Rightsize and iteratively building towards maturity.
  • 35. 35© Cloudera, Inc. All rights reserved. Get Data Explore and Analyze Deploy 1. Get data you already have, or create new data. 2. Explore and analyze, quickly. 3. Deploy your application. …and repeat. Add: More data, more users, more use cases, more complex analytics; go real-time! Think Big. Start Small. Iterate to Success
  • 36. 36© Cloudera, Inc. All rights reserved. Product Innovation Open Source, Open Standards Training Services Customer Success Proactive, Predictive SupportPartner Ecosystem Cloudera is your global Big Data Partner
  • 37. 37© Cloudera, Inc. All rights reserved. Getting started is easy. Then iterative to success. ① ② Download or Deploy in the Cloud Signup for Training Contact us or a Partner to Start a Pilot Project ③
  • 38. 38© Cloudera, Inc. All rights reserved. Thank you @ImAmyO AmyO@cloudera.com