SlideShare a Scribd company logo
1 of 35
1
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019© Eckerson Group 2018 www.eckerson.com
WELCOME
Modernizing the Legacy Data Warehouse:
What, Why, and How
Dave Wells
Advisory Consultant
Eckerson Group
Eva Nahari
Director of Product Management
Cloudera
Sponsored & Hosted By
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019© Eckerson Group 2018 www.eckerson.com
Modernizing the Legacy Data Warehouse:
What, Why, and How
Dave Wells
dwells@eckerson.com
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Is the Data Warehouse Dead?
4
4
"The EDW is dead. Period.
Like a dodo. Like Monty
Python's parrot.”
Philip Howard
Bloor Research
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Is the Data Warehouse Dead?
5
How many data warehouses does your organization have?
2 – 5
(62%)
only 1
(8%)
6 or more
(28.3%)
none
(1.7%)
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
The Challenges of Legacy Data Warehousing
6
Scalability Grow with increasing data volume, users, and use cases
Elasticity Dynamically adapt to workload volatility and fluctuation
Data Variety Work well with differently structured (non-relational) data sources
Data Latency Satisfy demand for real-time and near-real-time data
Data Velocity Work well with streaming data sources
Adaptability Quickly adjust to changes without complex data model refactoring
Architectural Fit Complement and interoperate with data lake
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing in the Cloud
7
Data
Storage
Metadata
Management
Much more than just data storage
Fast, efficient loading, bulk and
Incremental, stream processing
Integrate, aggregate, standardize …
fast and scalable processing
Query optimizers a must … hybrid
relational & non-relational is valuable
Monitoring, tuning, configuration,
Prioritizing, workload balancing
Business, process, and technical
metadata … lineage, quality, etc.
Data sensitivity, PII, data privacy …
work with existing security infrastructure
Preventing loss and corruption …
backups, logging, replication, etc.
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing in the Cloud
8
STRENGTHS
Business impact?
Application of technology?
Advanced skills?
Expertise and specialized knowledge?
Most experienced people?
Innovative solutions?
WEAKNESSES
Business value?
User satisfaction?
Resources?
Tools and technology?
Knowledge and skills?
Workload vs. capacity?
OPPORTUNITIES
Business goals and needs?
Advancing BI and analytics?
User wants and expectations?
Technology utilization?
Reach into and across the business?
Related projects & initiatives?
THREATS
Obstacles and barriers?
Stakeholder engagement & participation?
Technological uncertainties?
Data & process uncertainties?
Scheduling & critical business events?
Competing projects & initiatives?
Should you migrate to cloud?
SWOT once for status quo.
SWOT again for migration.
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
ContinuousPlanning
Initial Planning
Data Warehousing in the Cloud
9
Step-by-step migration
Migration
Technology Selection
Migration Strategy
Architectural Assessment
Business Caseincrementalmigration
Scope, Timing, Resources, Schedule,
User Transparency, Testing Plan
Drivers, Costs, Benefits, Risk of Migrating,
Risk of Not Migrating
Reliability, Availability, Performance,
Scalability, Adaptability, Maintainability
Lift and Shift or Incremental by Workload,
Workload Breakdown and Priorities
Cloud Data Warehousing Platform,
Migration Tools
Schema, Data, Process, Metadata,
Users and Applications
Testing and Operationalization
Function Test, Performance Test, DQ Audit,
Scheduling, Monitoring, Support
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Analysis
Dashboards
Scorecards
OLAP
Reporting
Applications
Legacy Data
OLTP Data
External Data
Sources
Data Warehouses
Master Data Repository
Operational Data Store
Data Management
PublishETL
Data Warehousing and Modern Data Architecture
10
Legacy Data Warehousing Architecture
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing and Modern Data Architecture
11
Automation
Prescription
Prediction
Forecasting
Discovery
Exploration
Dashboards
Scorecards
OLAP
Reporting
Web
Open
Commercial
Social Media
Machine / IoT
Geospatial
Legacy
OLTP
External
Data Warehouses
Master Data Repository
Operational Data Store
Data Lake
Analytic Sandboxes
ApplicationsSources Data Management
Modern Data Management Architecture
© David L. Wells
Data PipelinesData PipelinesData PipelinesData PipelinesData Pipelines
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing and Modern Data Architecture
12
Automation
Prescription
Prediction
Forecasting
Discovery
Exploration
Dashboards
Scorecards
OLAP
Reporting
Web
Open
Commercial
Social Media
Machine / IoT
Geospatial
Legacy
OLTP
External
Data Warehouses
Master Data Repository
Operational Data Store
Data Lake
Analytic Sandboxes
ApplicationsSources Data Management
Report
Consumers
Data
Analysts
Business
Analysts
Data
Scientists
Apps and
Algorithms
More sources & types More ways to organize and store data More uses for data More consumers
© David L. Wells
Data PipelinesData PipelinesData PipelinesData PipelinesData Pipelines
More data flow and processing
More data pipelines and more complex data pipelines
Fast and on-demand data delivery
More Components / More Complexity
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing and the Data Lake
13
DataAccess&DataPreparation
withsecurity&governancecontrols
Reporting
OLAP
Scorecards
Dashboards
Exploration
Analytics
Applications
Legacy
Transaction
Web
3rd Party
Social Media
Machine
Geospatial
Sources
DataIngestion
ETL,ELT,BulkLoad,StreamProcessing
landing area for incoming data
raw data, refined data & sandboxes
security, sensitivity, and semantic tagging
classified by trust level (gold, silver, bronze)
relational, subject-oriented, historical
bus or hub-and-spoke architecture
integrated, cleansed, aggregated
includes master reference & metrics data
ETL/ELT Data Refinement
Data Lake
Data Warehouse
Data Warehouse Outside the Data Lake
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing and the Data Lake
14
Data Warehouse Inside the Data Lake
Data Lake
DataAccess&DataPreparation
withsecurity&governancecontrols
Reporting
OLAP
Scorecards
Dashboards
Exploration
Analytics
Applications
Legacy
Transaction
Web
3rd Party
Social Media
Machine
Geospatial
Sources
DataIngestion
ETL,ELT,BulkLoad,StreamProcessing
Raw Data Zone
ingest & lightly tag
Analytic Sandboxes
explore & discover
Refined Data Zone
curate, improve & fully tag
Data Warehouse
integrate & aggregate
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing and the Data Lake
15
Data Warehouse In Front of the Data Lake
Scorecards
Dashboards
Exploration
Analytics
Applications
Sources
Legacy
Transaction
Web
3rd Party
Social Media
Machine
Geospatial
subject-oriented
integrated
non-volatile
time-variant
Sources
DataIngestion
ETL,ELT,BulkLoad,StreamProcessing
Raw Data Zone
ingest & lightly tag
Analytic Sandboxes
explore & discover
Refined Data Zone
curate, improve & fully tag
Data Warehouse(s)
Data Lake
Reporting
OLAP
Applications
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Data Warehousing and the Data Lake
16
Multi-Warehouse Hybrid
Data Lake
DataAccess&DataPreparation
withsecurity&governancecontrols
ApplicationsSources
DataIngestion
ETL,ELT,BulkLoad,StreamProcessing
Raw Data Zone
ingest & lightly tag
Analytic Sandboxes
explore & discover
Refined Data Zone
curate, improve & fully tag
Data Warehouse A
integrate & aggregate
Web
Open
Commercial
Social Media
Machine / IoT
Geospatial
Legacy
OLTP
External
Automation
Prescription
Prediction
Forecasting
Discovery
Exploration
Dashboards
Scorecards
OLAP
Reporting
relational, subject-oriented, historical
bus or hub-and-spoke architecture
integrated, cleansed, aggregated
includes master reference & metrics data
ETL/ELT Data Refinement
Data Warehouse B
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Getting Started with Data Warehouse Modernization
17
Know Your Modernization Goals
✓ Complete, cohesive analytics ecosystem
✓ Complete, cohesive data management architecture
✓ Governed self-service with curated data
✓ Maximum reusability and reuse
✓ Maximum flexibility and agility
✓ Data for all data consumers
✓ Technological updates
✓ Architectural updates
✓ Automation opportunities and process optimization
✓ Deployment independence
© Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019
Getting Started with Data Warehouse Modernization
18
Plan and Execute
Assess
the
Current
State
Define
the
Future
State
Choose
The
Patterns
Look to
the
Future
Expect and
Prepare
for Change
Execute
One Step
at a Time
19
CLOUDERA DATA WAREHOUSE
Eva Nahari, Director, Product Management, Cloudera Data Warehouse
January, 2019
21 © Cloudera, Inc. All rights reserved.
NEW TRENDS IN DATA WAREHOUSING
Deeper Business Insights at Extreme Speed and Scale While Managing Cost
DEEPER
business insights
EXTREME
speed & scale
CONTROLLED
resources & costs
22 © Cloudera, Inc. All rights reserved.
NEW TRENDS IN DATA WAREHOUSING
Deeper Business Insights
Protect
● Proactive Fraud Prevention
● Keep up with Regulatory
Compliance
● Preempt Cyberthreats
Real-time response on
massive data volume
and variety
Optimize
● Improve Operational
Efficiency
● Support Internet of Things
(IoT)
New analytics
techniques
democratized to all
users
Grow
● Customer Sentiment
● Fault Prevention
● Improve Product Quality
● New Revenue Streams
Experimentation and
collaboration at scale
23 © Cloudera, Inc. All rights reserved.
NEW TRENDS IN DATA WAREHOUSING
Extreme Speed and Scale
More Data
● Massive amounts handled
faster at scale
● More variety from new
sources (social media, IoT)
● Insight within minutes of
new data arrival
Performance and
flexibility at scale
More Workloads
● 100’s of production grade
deployments
● Enterprise grade
dependability
● Strict security and
governance
On-demand scale out,
discovery,
collaboration
More People
● 1,000’s of new users and
new user types
● 1,000’s of new use cases
● All skill levels: Analytics,
Data Science, and Machine
Learning
All workloads with a
shared data
experience
24 © Cloudera, Inc. All rights reserved.
NEW TRENDS IN DATA WAREHOUSING
Managing Resources and Costs
Optimize Core
Processes
● Automation to reduce
pressure on organizational
bottlenecks
● Consistent user experience
Broaden data reach
without increasing IT
burden or costs
Self-Service
Everything
● Resource provisioning
● Workload development
● Optimizing and
troubleshooting
Deliver on increased
SLA pressures
without runaway cost
Dynamic Consumption
● Transient Workloads
● Short-lived Workloads
● Permanent Workloads
● Public, Private, Hybrid Cloud
Environmental
flexibility and adaptive
compute, storage
© Cloudera, Inc. All rights reserved. 25
Quickly enable business analytics by sharing petabytes of verified
data
across thousands of users while surpassing demands of SLAs and
costs
26 © Cloudera, Inc. All rights reserved.
TRADITIONAL DATA WAREHOUSE:
Structured Data
Sources
(ERP, CRM, SCM)
Transformations
EDW
Advanced
Analytics
Dashboards
Ad Hoc
Canned
Reports
Staging
Data Marts
Many Months
Master Schema
ETLODS
2 3
4
1 5
Struggle to handle volume
and variety
Limited
access
27 © Cloudera, Inc. All rights reserved.
MODERN DATA WAREHOUSE
Advanced
Analytics
Dashboards
Ad Hoc
Canned
Reports
Data Store
Within Days
Data Marts
1
2
Ingest & Store all data
at scale
Self-serve / On-
demand
Variety of data
sources/types
28 © Cloudera, Inc. All rights reserved.
WHAT CONCEPTS SURVIVE?
Data Modeling Security &
Governance
Reports & Dashboards
29 © Cloudera, Inc. All rights reserved.
WHAT HAS CHANGED?
Traditional DW Modern DW
Supporting Role Foundational Role
Primarily Internal Internal & External
Constrained, Structured
Freeform,
Multi-Structured
Planned ETLs On-Demand Pipelines
Users
Data Exploration
Data Curation
Data & Analytics
30 © Cloudera, Inc. All rights reserved.
WHAT IS NEW?
Experimentation
& Collaboration
Dynamic
Consumption
Self Service
Everything
31 © Cloudera, Inc. All rights reserved.
CLOUDERA MODERN DATA WAREHOUSE
The modern platform for machine learning and analytics optimized for the cloud
Object Store
S3, ADLS
Shared File
Storage
Time Series
Data Store
SECURITY GOVERNANCE
WORKLOAD
MANAGEMENT
INGEST &
REPLICATION
DATA CATALOG
Core
Services
Storage
Services
ANALYTICSDATA
SCIENCE
EXTENSIBLE
SERVICES
OPERATIONAL
DATABASE
DATA
ENGINEERING
32 © Cloudera, Inc. All rights reserved.
CLOUD NATIVE OPTION
● Quick time to value - no software or
clusters to manage
● Bring warehouse to the data with zero
copy simplicity
● Use your security policies with your
data - no proprietary stacks
● Apply enterprise governance to
transient workloads
● Shared data experience with SDX
● Optimized for Azure & AWS
DATA
WAREHOUSE
GOVERNANC
E
SECURITY
CONTROL
PLANE
LIFECYCLE
MANAGEMENT
MULTI-CLOUD
Amazon
S3
Microsoft
ADLS
MULTI-CLOUD PAAS SOLUTION
33 © Cloudera, Inc. All rights reserved.
TD BANK: Delivering “Legendary Customer Experience”
CHALLENGES
Significantly improve customer
experience with sentiment
analysis, behavioral patterns,
and predictive modeling
Current system couldn’t handle:
• Centralizing data from
thousands of sources
• Demands from increased
users and use cases
• Data cost and manageability
at scale
RESULTS
• 30% reduction in repeat
customer complaints
• 90% productivity
improvement for analytics
projects
• 60% decrease in data
management costs
• 98% decrease in per TB
storage costs
SOLUTION
Modern Data Warehouse for
customer marketing, fraud
analytics and cybersecurity
• Ingest data from 100+
corporate systems
• Centralized data into “the
hands of those that need it
much more quickly”
• Significantly reduce storage
and management costs
https://www.cloudera.com/more/customers/td-bank.html
34 © Cloudera, Inc. All rights reserved.
CLOUDERA DW - PARTING THOUGHTS
Hybrid Optimized Shared Data ExperiencePerformance @Scale
Shared Data
Exponential Use Cases, Successful Outcomes
THANK YOU
https://www.cloudera.com/products/data-warehouse.html

More Related Content

What's hot

2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data MeshLibbySchulze
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...DATAVERSITY
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...HostedbyConfluent
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Using Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve QualityUsing Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve QualityNeo4j
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseSnowflake Computing
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta LakeDatabricks
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMatillion
 
Big Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case StudiesBig Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case StudiesJohn Palfreyman
 
Data Modeling and Relational to NoSQL
 Data Modeling and Relational to NoSQL  Data Modeling and Relational to NoSQL
Data Modeling and Relational to NoSQL DATAVERSITY
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Guillaume LE GALIARD
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeKent Graziano
 
Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Enterprise Knowledge
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
 

What's hot (20)

2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Using Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve QualityUsing Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve Quality
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - Snowflake
 
Big Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case StudiesBig Data & Analytics for Government - Case Studies
Big Data & Analytics for Government - Case Studies
 
Data Modeling and Relational to NoSQL
 Data Modeling and Relational to NoSQL  Data Modeling and Relational to NoSQL
Data Modeling and Relational to NoSQL
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0Collibra - Forrester Presentation : Data Governance 2.0
Collibra - Forrester Presentation : Data Governance 2.0
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 

Similar to Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19

Metadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationMetadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationEric Kavanagh
 
The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationEric Kavanagh
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderDataconomy Media
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...Precisely
 
What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021DATAVERSITY
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityDATAVERSITY
 
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformDriven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformArne Roßmann
 
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...Enterprise Management Associates
 
Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product ManagersPentaho
 
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...DataWorks Summit
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesFrank Vullers
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesDataWorks Summit
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesDataWorks Summit
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Denodo
 

Similar to Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19 (20)

Metadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationMetadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI Modernization
 
The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data Integration
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
 
What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021What Is My Enterprise Data Maturity 2021
What Is My Enterprise Data Maturity 2021
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
 
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformDriven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
 
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
 
Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product Managers
 
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use cases
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Recently uploaded

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 

Recently uploaded (20)

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 

Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19

  • 1. 1
  • 2. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019© Eckerson Group 2018 www.eckerson.com WELCOME Modernizing the Legacy Data Warehouse: What, Why, and How Dave Wells Advisory Consultant Eckerson Group Eva Nahari Director of Product Management Cloudera Sponsored & Hosted By
  • 3. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019© Eckerson Group 2018 www.eckerson.com Modernizing the Legacy Data Warehouse: What, Why, and How Dave Wells dwells@eckerson.com
  • 4. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Is the Data Warehouse Dead? 4 4 "The EDW is dead. Period. Like a dodo. Like Monty Python's parrot.” Philip Howard Bloor Research
  • 5. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Is the Data Warehouse Dead? 5 How many data warehouses does your organization have? 2 – 5 (62%) only 1 (8%) 6 or more (28.3%) none (1.7%)
  • 6. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 The Challenges of Legacy Data Warehousing 6 Scalability Grow with increasing data volume, users, and use cases Elasticity Dynamically adapt to workload volatility and fluctuation Data Variety Work well with differently structured (non-relational) data sources Data Latency Satisfy demand for real-time and near-real-time data Data Velocity Work well with streaming data sources Adaptability Quickly adjust to changes without complex data model refactoring Architectural Fit Complement and interoperate with data lake
  • 7. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing in the Cloud 7 Data Storage Metadata Management Much more than just data storage Fast, efficient loading, bulk and Incremental, stream processing Integrate, aggregate, standardize … fast and scalable processing Query optimizers a must … hybrid relational & non-relational is valuable Monitoring, tuning, configuration, Prioritizing, workload balancing Business, process, and technical metadata … lineage, quality, etc. Data sensitivity, PII, data privacy … work with existing security infrastructure Preventing loss and corruption … backups, logging, replication, etc.
  • 8. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing in the Cloud 8 STRENGTHS Business impact? Application of technology? Advanced skills? Expertise and specialized knowledge? Most experienced people? Innovative solutions? WEAKNESSES Business value? User satisfaction? Resources? Tools and technology? Knowledge and skills? Workload vs. capacity? OPPORTUNITIES Business goals and needs? Advancing BI and analytics? User wants and expectations? Technology utilization? Reach into and across the business? Related projects & initiatives? THREATS Obstacles and barriers? Stakeholder engagement & participation? Technological uncertainties? Data & process uncertainties? Scheduling & critical business events? Competing projects & initiatives? Should you migrate to cloud? SWOT once for status quo. SWOT again for migration.
  • 9. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 ContinuousPlanning Initial Planning Data Warehousing in the Cloud 9 Step-by-step migration Migration Technology Selection Migration Strategy Architectural Assessment Business Caseincrementalmigration Scope, Timing, Resources, Schedule, User Transparency, Testing Plan Drivers, Costs, Benefits, Risk of Migrating, Risk of Not Migrating Reliability, Availability, Performance, Scalability, Adaptability, Maintainability Lift and Shift or Incremental by Workload, Workload Breakdown and Priorities Cloud Data Warehousing Platform, Migration Tools Schema, Data, Process, Metadata, Users and Applications Testing and Operationalization Function Test, Performance Test, DQ Audit, Scheduling, Monitoring, Support
  • 10. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Analysis Dashboards Scorecards OLAP Reporting Applications Legacy Data OLTP Data External Data Sources Data Warehouses Master Data Repository Operational Data Store Data Management PublishETL Data Warehousing and Modern Data Architecture 10 Legacy Data Warehousing Architecture
  • 11. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing and Modern Data Architecture 11 Automation Prescription Prediction Forecasting Discovery Exploration Dashboards Scorecards OLAP Reporting Web Open Commercial Social Media Machine / IoT Geospatial Legacy OLTP External Data Warehouses Master Data Repository Operational Data Store Data Lake Analytic Sandboxes ApplicationsSources Data Management Modern Data Management Architecture © David L. Wells Data PipelinesData PipelinesData PipelinesData PipelinesData Pipelines
  • 12. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing and Modern Data Architecture 12 Automation Prescription Prediction Forecasting Discovery Exploration Dashboards Scorecards OLAP Reporting Web Open Commercial Social Media Machine / IoT Geospatial Legacy OLTP External Data Warehouses Master Data Repository Operational Data Store Data Lake Analytic Sandboxes ApplicationsSources Data Management Report Consumers Data Analysts Business Analysts Data Scientists Apps and Algorithms More sources & types More ways to organize and store data More uses for data More consumers © David L. Wells Data PipelinesData PipelinesData PipelinesData PipelinesData Pipelines More data flow and processing More data pipelines and more complex data pipelines Fast and on-demand data delivery More Components / More Complexity
  • 13. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing and the Data Lake 13 DataAccess&DataPreparation withsecurity&governancecontrols Reporting OLAP Scorecards Dashboards Exploration Analytics Applications Legacy Transaction Web 3rd Party Social Media Machine Geospatial Sources DataIngestion ETL,ELT,BulkLoad,StreamProcessing landing area for incoming data raw data, refined data & sandboxes security, sensitivity, and semantic tagging classified by trust level (gold, silver, bronze) relational, subject-oriented, historical bus or hub-and-spoke architecture integrated, cleansed, aggregated includes master reference & metrics data ETL/ELT Data Refinement Data Lake Data Warehouse Data Warehouse Outside the Data Lake
  • 14. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing and the Data Lake 14 Data Warehouse Inside the Data Lake Data Lake DataAccess&DataPreparation withsecurity&governancecontrols Reporting OLAP Scorecards Dashboards Exploration Analytics Applications Legacy Transaction Web 3rd Party Social Media Machine Geospatial Sources DataIngestion ETL,ELT,BulkLoad,StreamProcessing Raw Data Zone ingest & lightly tag Analytic Sandboxes explore & discover Refined Data Zone curate, improve & fully tag Data Warehouse integrate & aggregate
  • 15. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing and the Data Lake 15 Data Warehouse In Front of the Data Lake Scorecards Dashboards Exploration Analytics Applications Sources Legacy Transaction Web 3rd Party Social Media Machine Geospatial subject-oriented integrated non-volatile time-variant Sources DataIngestion ETL,ELT,BulkLoad,StreamProcessing Raw Data Zone ingest & lightly tag Analytic Sandboxes explore & discover Refined Data Zone curate, improve & fully tag Data Warehouse(s) Data Lake Reporting OLAP Applications
  • 16. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Data Warehousing and the Data Lake 16 Multi-Warehouse Hybrid Data Lake DataAccess&DataPreparation withsecurity&governancecontrols ApplicationsSources DataIngestion ETL,ELT,BulkLoad,StreamProcessing Raw Data Zone ingest & lightly tag Analytic Sandboxes explore & discover Refined Data Zone curate, improve & fully tag Data Warehouse A integrate & aggregate Web Open Commercial Social Media Machine / IoT Geospatial Legacy OLTP External Automation Prescription Prediction Forecasting Discovery Exploration Dashboards Scorecards OLAP Reporting relational, subject-oriented, historical bus or hub-and-spoke architecture integrated, cleansed, aggregated includes master reference & metrics data ETL/ELT Data Refinement Data Warehouse B
  • 17. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Getting Started with Data Warehouse Modernization 17 Know Your Modernization Goals ✓ Complete, cohesive analytics ecosystem ✓ Complete, cohesive data management architecture ✓ Governed self-service with curated data ✓ Maximum reusability and reuse ✓ Maximum flexibility and agility ✓ Data for all data consumers ✓ Technological updates ✓ Architectural updates ✓ Automation opportunities and process optimization ✓ Deployment independence
  • 18. © Eckerson Group 2018 www.eckerson.com© Eckerson Group 2019 Getting Started with Data Warehouse Modernization 18 Plan and Execute Assess the Current State Define the Future State Choose The Patterns Look to the Future Expect and Prepare for Change Execute One Step at a Time
  • 19. 19
  • 20. CLOUDERA DATA WAREHOUSE Eva Nahari, Director, Product Management, Cloudera Data Warehouse January, 2019
  • 21. 21 © Cloudera, Inc. All rights reserved. NEW TRENDS IN DATA WAREHOUSING Deeper Business Insights at Extreme Speed and Scale While Managing Cost DEEPER business insights EXTREME speed & scale CONTROLLED resources & costs
  • 22. 22 © Cloudera, Inc. All rights reserved. NEW TRENDS IN DATA WAREHOUSING Deeper Business Insights Protect ● Proactive Fraud Prevention ● Keep up with Regulatory Compliance ● Preempt Cyberthreats Real-time response on massive data volume and variety Optimize ● Improve Operational Efficiency ● Support Internet of Things (IoT) New analytics techniques democratized to all users Grow ● Customer Sentiment ● Fault Prevention ● Improve Product Quality ● New Revenue Streams Experimentation and collaboration at scale
  • 23. 23 © Cloudera, Inc. All rights reserved. NEW TRENDS IN DATA WAREHOUSING Extreme Speed and Scale More Data ● Massive amounts handled faster at scale ● More variety from new sources (social media, IoT) ● Insight within minutes of new data arrival Performance and flexibility at scale More Workloads ● 100’s of production grade deployments ● Enterprise grade dependability ● Strict security and governance On-demand scale out, discovery, collaboration More People ● 1,000’s of new users and new user types ● 1,000’s of new use cases ● All skill levels: Analytics, Data Science, and Machine Learning All workloads with a shared data experience
  • 24. 24 © Cloudera, Inc. All rights reserved. NEW TRENDS IN DATA WAREHOUSING Managing Resources and Costs Optimize Core Processes ● Automation to reduce pressure on organizational bottlenecks ● Consistent user experience Broaden data reach without increasing IT burden or costs Self-Service Everything ● Resource provisioning ● Workload development ● Optimizing and troubleshooting Deliver on increased SLA pressures without runaway cost Dynamic Consumption ● Transient Workloads ● Short-lived Workloads ● Permanent Workloads ● Public, Private, Hybrid Cloud Environmental flexibility and adaptive compute, storage
  • 25. © Cloudera, Inc. All rights reserved. 25 Quickly enable business analytics by sharing petabytes of verified data across thousands of users while surpassing demands of SLAs and costs
  • 26. 26 © Cloudera, Inc. All rights reserved. TRADITIONAL DATA WAREHOUSE: Structured Data Sources (ERP, CRM, SCM) Transformations EDW Advanced Analytics Dashboards Ad Hoc Canned Reports Staging Data Marts Many Months Master Schema ETLODS 2 3 4 1 5 Struggle to handle volume and variety Limited access
  • 27. 27 © Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Advanced Analytics Dashboards Ad Hoc Canned Reports Data Store Within Days Data Marts 1 2 Ingest & Store all data at scale Self-serve / On- demand Variety of data sources/types
  • 28. 28 © Cloudera, Inc. All rights reserved. WHAT CONCEPTS SURVIVE? Data Modeling Security & Governance Reports & Dashboards
  • 29. 29 © Cloudera, Inc. All rights reserved. WHAT HAS CHANGED? Traditional DW Modern DW Supporting Role Foundational Role Primarily Internal Internal & External Constrained, Structured Freeform, Multi-Structured Planned ETLs On-Demand Pipelines Users Data Exploration Data Curation Data & Analytics
  • 30. 30 © Cloudera, Inc. All rights reserved. WHAT IS NEW? Experimentation & Collaboration Dynamic Consumption Self Service Everything
  • 31. 31 © Cloudera, Inc. All rights reserved. CLOUDERA MODERN DATA WAREHOUSE The modern platform for machine learning and analytics optimized for the cloud Object Store S3, ADLS Shared File Storage Time Series Data Store SECURITY GOVERNANCE WORKLOAD MANAGEMENT INGEST & REPLICATION DATA CATALOG Core Services Storage Services ANALYTICSDATA SCIENCE EXTENSIBLE SERVICES OPERATIONAL DATABASE DATA ENGINEERING
  • 32. 32 © Cloudera, Inc. All rights reserved. CLOUD NATIVE OPTION ● Quick time to value - no software or clusters to manage ● Bring warehouse to the data with zero copy simplicity ● Use your security policies with your data - no proprietary stacks ● Apply enterprise governance to transient workloads ● Shared data experience with SDX ● Optimized for Azure & AWS DATA WAREHOUSE GOVERNANC E SECURITY CONTROL PLANE LIFECYCLE MANAGEMENT MULTI-CLOUD Amazon S3 Microsoft ADLS MULTI-CLOUD PAAS SOLUTION
  • 33. 33 © Cloudera, Inc. All rights reserved. TD BANK: Delivering “Legendary Customer Experience” CHALLENGES Significantly improve customer experience with sentiment analysis, behavioral patterns, and predictive modeling Current system couldn’t handle: • Centralizing data from thousands of sources • Demands from increased users and use cases • Data cost and manageability at scale RESULTS • 30% reduction in repeat customer complaints • 90% productivity improvement for analytics projects • 60% decrease in data management costs • 98% decrease in per TB storage costs SOLUTION Modern Data Warehouse for customer marketing, fraud analytics and cybersecurity • Ingest data from 100+ corporate systems • Centralized data into “the hands of those that need it much more quickly” • Significantly reduce storage and management costs https://www.cloudera.com/more/customers/td-bank.html
  • 34. 34 © Cloudera, Inc. All rights reserved. CLOUDERA DW - PARTING THOUGHTS Hybrid Optimized Shared Data ExperiencePerformance @Scale Shared Data Exponential Use Cases, Successful Outcomes