3. … data warehousing has reached the most
significant tipping point since its inception.
The biggest, possibly most elaborate data
management system in IT is changing.
– Gartner, “The State of Data Warehousing in 2012”
Data sources
5. ETL Tool
(SSIS, etc)
EDW
(SQL Svr, Teradata, etc)
Extract
Original Data
Load
Transformed
Data
Transform
BI Tools
Data Marts
Data Lake(s)
Dashboards
Apps
6. ETL Tool
(SSIS, etc)
EDW
(SQL Svr, Teradata, etc)
Extract
Original Data
Load
Transformed
Data
Transform
BI Tools
Ingest (EL)
Original Data
Data Marts
Data Lake(s)
Dashboards
Apps
7. ETL Tool
(SSIS, etc)
EDW
(SQL Svr, Teradata, etc)
Extract
Original Data
Load
Transformed
Data
Transform
BI Tools
Ingest (EL)
Original Data
Scale-out
Storage &
Compute
(HDFS, Blob Storage,
etc)
Transform & Load
Data Marts
Data Lake(s)
Dashboards
Apps
Streaming data
8. ETL Tool
(SSIS, etc)
EDW
(SQL Svr, Teradata, etc)
Extract
Original Data
Load
Transformed
Data
Transform
BI Tools
Ingest (EL)
Original Data
Scale-out
Storage &
Compute
(HDFS, Blob Storage,
etc)
Transform & Load
Data Marts
Data Lake(s)
Dashboards
Apps
Streaming data
9. BI Tools
Data Marts
Data Lake(s)
Dashboards
Apps
Data Hub
(Storage & Compute)
Data Sources
(Import From)
Move data
among Hubs
Data Hub
(Storage & Compute)
Data Sources
(Import From)
Ingest
Connect & Collect Transform & Enrich Publish
Information Production:
Ingest
Move to data mart, etc
10. BI Tools
Data Marts
Data Lake(s)
Dashboards
Apps
Data Hub
(Storage & Compute)
Data Sources
(Import From)
Data Connector:
Import from source to
Hub
Data
Connector:
Import/Export
among Hubs
Data Hub
(Storage & Compute)
Data Sources
(Import From)
Data Connector:
Import from source to
Hub
Data Connector:
Export from Hub to data
store
Connect & Collect Transform & Enrich Publish
Information Production:
• Coordination & Scheduling
• Monitoring & Mgmt
• Data Lineage
18. On Premises SQL Server Azure Blob Storage
1000’s Log FilesNew User View
Azure Data Factory
19. On Premises SQL Server Azure Blob Storage
1000’s Log FilesNew User View
Azure Data FactoryViewOf
Game Usage
ViewOf
New Users
New User
Activity
20. ViewOf
On Premises SQL Server Azure Blob Storage
1000’s Log FilesNew User View
Copy “NewUsers” to
Blob Storage
Cloud New
Users
Azure Data FactoryViewOf
Game Usage
ViewOf
New Users
New User
Activity
Pipeline
21. On Premises SQL Server Azure Blob Storage
1000’s Log FilesNew User View
Copy NewUsers to
Blob Storage
Cloud New
Users
Azure Data FactoryViewOf
Game Usage
ViewOf
Mask & Geo-
Code
New Users
Geo Dictionary
Geo Coded
Game Usage
HDInsight
New User
Activity
Pipeline
Pipeline
22. On Premises SQL Server Azure Blob Storage
1000’s Log FilesNew User View
Copy NewUsers to
Blob Storage
Cloud New
Users
Azure Data FactoryViewOf
Game Usage
ViewOf
RunsOn
Mask & Geo-
Code
New Users
Geo Dictionary
Geo Coded
Game Usage
Join &
Aggregate
HDInsight
New User
Activity
ViewOf
Pipeline
Pipeline
Pipeline
23. On Premises SQL Server Azure Blob Storage
1000’s Log FilesNew User View
Copy NewUsers to
Blob Storage
Cloud New
Users
Azure Data FactoryViewOf
Game Usage
ViewOf
RunsOn
Mask & Geo-
Code
New Users
Geo Dictionary
Geo Coded
Game Usage
Join &
Aggregate
HDInsight
New User
Activity
ViewOf
Pipeline
Pipeline
Pipeline
29. • Is my data successfully getting produced?
• Is it produced on time?
• Am I alerted quickly of failures?
• What about troubleshooting information?
• Are there any policy warnings or errors?