SlideShare a Scribd company logo
1 of 18
Download to read offline
 Data Vault Modeling

 DW2.0 & Unstructured Data

 Big Data

 Ensemble Modeling

 Agile DW

Ensemble Modeling
& Data Vault

© 2014 Genesee Academy, LLC
USA +1 303 526 0340
Sweden 072 736 8700
Hans@GeneseeAcademy.com
www.GeneseeAcademy.com

2014
Ensemble Modeling & Data Vault
AGENDA

About Hans Hultgren:

Ensemble Modeling
&Unified Decomposition
Data Vault Ensemble
Colors of Data Vault
Data Vault Hubs, Links and
Satellites
• More Information
•
•
•
•
•

gohansgo
Author, Advisor, Speaker &
Industry Analyst; President
Genesee Academy LLC,
Principal at

Book available on Amazon.com
© 2014 Genesee Academy, LLC

2
A Saga of Data Warehousing
Once upon a time data warehousing was becoming more popular and
everyone was eager to build their own. But whenever they tried they failed.
They called upon their best to fix this but they just couldn’t solve the
problem.
They discovered that meeting the needs of the data warehouse meant that
the tables got too big and too hard to work with. They just could not handle
changes over time. If the smallest thing changed it always meant they had
to change the entire table. When just a single attribute was updated they
had to insert a record for all of the attributes. All seemed lost.
But around the world there were rebels who questioned the conventional
wisdom. And their voices were finally heard: Why not separate the things
that change from the things that don’t change?

© 2014 Genesee Academy, LLC

3
Ensemble Modeling™
• The constellation of component parts acts as a whole – an Ensemble.
All the parts of a thing taken together, so that
each part is considered only in relation to the whole.

• With Ensemble Modeling the Core Business Concepts that we define and
model are represented as a whole – an ensemble – including all of the
component parts.
© 2014 Genesee Academy, LLC

4
Based on Unified Decomposition™
• With the EDW, we break things out into parts for flexibility, agility, and
generally to facilitate the capture of things that are either interpreted in
different ways or changing independently of each other.

• At the same time a core premise of data warehousing is integration and
moving to a common standard view of unified concepts. So we also
want to tie things together – Unify.
© 2014 Genesee Academy, LLC

5
THE DATA VAULT ENSEMBLE:
APPLYING THE ENSEMBLE

© 2014 Genesee Academy, LLC

6
The Data Vault Ensemble
• The Data Vault Ensemble conforms to a single key – embodied in the Hub
construct.

• The component parts for the Data Vault Ensemble include:
– Hub
The Natural Business Key
– Link
The Natural Business Relationships
– Satellite
All Context, Descriptive Data and History
© 2014 Genesee Academy, LLC

7
The Data Vault modeling approach
3NF

Data Vault

HUB

SAT

LINK

Entity

Dimensional

SAT

Dim

Core Concept Business Keys
Associations / Relationships
Details / Context

© 2014 Genesee Academy, LLC

8
Modeling Comparison
Start Schema and Snow Flake Models:

Region

Store
Customer

Sale Fact

Associations
Business Keys
Details
Product

Facts contain all three types of data…

Employee

Vendor

Dimensions can also contain all types

*** Requires complex loading routines for key dependencies…

© 2014 Genesee Academy, LLC

9
Modeling Comparison
3rd Normal Form has the same issue: each construct – or Entity –
typically contains a business key, one or more associations
and also details (context, descriptive data)…

Region

Customer

Store
Sale

Sale LI

Employee

Product

Vendor

© 2014 Genesee Academy, LLC

10
Colors of the Data Vault
Sat
Sat Sat
Sat Sat
Sat Sat

Sat
Sat
Sat
Sat Sat
Sat
Sat
Sat
Sat
Sat

Sat
Sat
Sat
Sat
Sat
Sat
Region

Sat
Sat
Sat
Sat
Sat
Sat
Sat
Sat
Sat
Sat
Employee
Customer

Sat
Sat

Link

Store

Link
Sat
Sat
Sat
Sat

Sat
Sat
Sat

Link

Sat
Sat

Product

Sale

Link
Sat
Sat

Vendor

Sat
Sat Sat
© 2014 Genesee Academy, LLC

Sat
Sat
Sat Sat
Sat Sat
Sat
Sat
Sat
Sat

Sat
Sat
Sat
Sat

11
Data Vault means thinking differently
• The minimal construct then for an “entity”
such as “Customer” is now a
Customer

Hub with a set of Satellites

Customer

© 2014 Genesee Academy, LLC

12
Data Vault Modeling Process
• The Modeling Process for creating a Data Vault model includes
three primary steps:
1) Identify and Model your Core Business Concepts
• Business Interviews is at the heart of this step
What do you do?

What are the main things you work with?

• Also find best/target Natural Business Key

2) Identify and Model your Natural Business Relationships
• Specific Unique Relationships
• Be considerate of the Unit of Work and Grain

3) Analyze and Design your Context Satellites
• Consider Rate of Change, Type of Data
and also the Sources of your
data during design process
© 2014 Genesee Academy, LLC

13
Hubs
– A Hub Construct in Data Vault
• contains Business Key
• only the Business Key
• contains No Context
• is always 1:1 with EWBK

H_Customer
H_Customer_SID
Business Key 
Date/Time Stamp
Record source

– A Hub Table contains only
• Business Key
• Surrogate Key (Data Warehouse)
• Load Date / Time Stamp
• Record Source
© 2014 Genesee Academy, LLC

14
Links
H_Customer

– A Link Construct in Data Vault
• contains Relationship
• only a Relationship
• contains No Context
• is always 1:1 with Relationship
– A Link Table contains only
• 2-n FKs for the Relationship
• Surrogate Key (Data Warehouse)
• Load Date / Time Stamp
• Record Source
© 2014 Genesee Academy, LLC

H_Customer_SID
Business Key 
Date/Tim e Stamp

L_Cust_Class

Record source

L_Cust_Class_SID
H_Customer_SID
H_Sequence2_SID
Date/Time Stamp
Record source

– Unique
– Specific
– Natural
Business
Relationship

15
Satellites
– A Satellite Construct in Data Vault
• contains Context only
• has no FKs (no relationships)
• Designed by * Rate of Change
* Type of Data * System…

S_Customer
H_Customer_SID
Date/Time Stamp
Context A
Context B
Context C
Context D

– A Satellite Table contains only
• Business Key FK +
•
Load Date / Time Stamp
• Context Data…
• Record Source

© 2014 Genesee Academy, LLC

Record source

H_Customer
H_Customer_SID
Business Key 
Date/Tim e Stamp
Record source

16
About Data Vault Ensemble

Estimated 800 Data Vault based
Data Warehouses around the world

© 2014 Genesee Academy, LLC

17
Links and Information
CDVDM Training & Certification
www.GeneseeAcademy.com
Hans@GeneseeAcademy.com

gohansgo

Book DataVaultBook.blogspot.com
HansHultgren.WordPress.com
HansHultgren
DataVaultAcademy

Online video-lesson training

DataVaultAcademy.com
© 2014 Genesee Academy, LLC

18

More Related Content

Viewers also liked

Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesCGI
 
DOAG 2016 Oracle Logon Security
DOAG 2016 Oracle Logon SecurityDOAG 2016 Oracle Logon Security
DOAG 2016 Oracle Logon SecurityLoopback.ORG
 
Introduction to data vault ilja dmitrijev
Introduction to data vault   ilja dmitrijevIntroduction to data vault   ilja dmitrijev
Introduction to data vault ilja dmitrijevIlja Dmitrijevs
 
IT Governance - Core Concepts for Business Managers
IT Governance - Core Concepts for Business ManagersIT Governance - Core Concepts for Business Managers
IT Governance - Core Concepts for Business ManagersWalter Adamson
 
Credit Scoring 101 Education
Credit Scoring 101 EducationCredit Scoring 101 Education
Credit Scoring 101 EducationData Facts, Inc
 
Modellierung agliler Data Warehouses mit Data Vault
Modellierung agliler Data Warehouses mit Data VaultModellierung agliler Data Warehouses mit Data Vault
Modellierung agliler Data Warehouses mit Data VaultTrivadis
 
Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249Daniel Upton
 
Data vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopData vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopjohannesvdb
 
Ensemble modeling overview, Big Data meetup
Ensemble modeling overview, Big Data meetupEnsemble modeling overview, Big Data meetup
Ensemble modeling overview, Big Data meetupOptimalBI Limited
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultDaniel Upton
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Andreas Buckenhofer
 
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieCDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieAndreas Buckenhofer
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileDaniel Upton
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationVishal Kumar
 

Viewers also liked (17)

Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best Practices
 
DOAG 2016 Oracle Logon Security
DOAG 2016 Oracle Logon SecurityDOAG 2016 Oracle Logon Security
DOAG 2016 Oracle Logon Security
 
Introduction to data vault ilja dmitrijev
Introduction to data vault   ilja dmitrijevIntroduction to data vault   ilja dmitrijev
Introduction to data vault ilja dmitrijev
 
IT Governance - Core Concepts for Business Managers
IT Governance - Core Concepts for Business ManagersIT Governance - Core Concepts for Business Managers
IT Governance - Core Concepts for Business Managers
 
Credit Scoring 101 Education
Credit Scoring 101 EducationCredit Scoring 101 Education
Credit Scoring 101 Education
 
Modellierung agliler Data Warehouses mit Data Vault
Modellierung agliler Data Warehouses mit Data VaultModellierung agliler Data Warehouses mit Data Vault
Modellierung agliler Data Warehouses mit Data Vault
 
Credit scorecard
Credit scorecardCredit scorecard
Credit scorecard
 
Data Vault: What is it? Where does it fit? SQL Saturday #249
Data Vault: What is it?  Where does it fit?  SQL Saturday #249Data Vault: What is it?  Where does it fit?  SQL Saturday #249
Data Vault: What is it? Where does it fit? SQL Saturday #249
 
Data vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshopData vault seminar May 5-6 Dommel - The factory and the workshop
Data vault seminar May 5-6 Dommel - The factory and the workshop
 
Ensemble modeling overview, Big Data meetup
Ensemble modeling overview, Big Data meetupEnsemble modeling overview, Big Data meetup
Ensemble modeling overview, Big Data meetup
 
Data Vault Overview
Data Vault OverviewData Vault Overview
Data Vault Overview
 
Lean Data Warehouse via Data Vault
Lean Data Warehouse via Data VaultLean Data Warehouse via Data Vault
Lean Data Warehouse via Data Vault
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)Metadaten und Data Vault (Meta Vault)
Metadaten und Data Vault (Meta Vault)
 
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der AutomobilindustrieCDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
CDC und Data Vault für den Aufbau eines DWH in der Automobilindustrie
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes Agile
 
Agile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data PresentationAgile Data Warehouse Design for Big Data Presentation
Agile Data Warehouse Design for Big Data Presentation
 

Recently uploaded

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 

Recently uploaded (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

Ensemble Modeling and Data Vault 2014

  • 1.  Data Vault Modeling  DW2.0 & Unstructured Data  Big Data  Ensemble Modeling  Agile DW Ensemble Modeling & Data Vault © 2014 Genesee Academy, LLC USA +1 303 526 0340 Sweden 072 736 8700 Hans@GeneseeAcademy.com www.GeneseeAcademy.com 2014
  • 2. Ensemble Modeling & Data Vault AGENDA About Hans Hultgren: Ensemble Modeling &Unified Decomposition Data Vault Ensemble Colors of Data Vault Data Vault Hubs, Links and Satellites • More Information • • • • • gohansgo Author, Advisor, Speaker & Industry Analyst; President Genesee Academy LLC, Principal at Book available on Amazon.com © 2014 Genesee Academy, LLC 2
  • 3. A Saga of Data Warehousing Once upon a time data warehousing was becoming more popular and everyone was eager to build their own. But whenever they tried they failed. They called upon their best to fix this but they just couldn’t solve the problem. They discovered that meeting the needs of the data warehouse meant that the tables got too big and too hard to work with. They just could not handle changes over time. If the smallest thing changed it always meant they had to change the entire table. When just a single attribute was updated they had to insert a record for all of the attributes. All seemed lost. But around the world there were rebels who questioned the conventional wisdom. And their voices were finally heard: Why not separate the things that change from the things that don’t change? © 2014 Genesee Academy, LLC 3
  • 4. Ensemble Modeling™ • The constellation of component parts acts as a whole – an Ensemble. All the parts of a thing taken together, so that each part is considered only in relation to the whole. • With Ensemble Modeling the Core Business Concepts that we define and model are represented as a whole – an ensemble – including all of the component parts. © 2014 Genesee Academy, LLC 4
  • 5. Based on Unified Decomposition™ • With the EDW, we break things out into parts for flexibility, agility, and generally to facilitate the capture of things that are either interpreted in different ways or changing independently of each other. • At the same time a core premise of data warehousing is integration and moving to a common standard view of unified concepts. So we also want to tie things together – Unify. © 2014 Genesee Academy, LLC 5
  • 6. THE DATA VAULT ENSEMBLE: APPLYING THE ENSEMBLE © 2014 Genesee Academy, LLC 6
  • 7. The Data Vault Ensemble • The Data Vault Ensemble conforms to a single key – embodied in the Hub construct. • The component parts for the Data Vault Ensemble include: – Hub The Natural Business Key – Link The Natural Business Relationships – Satellite All Context, Descriptive Data and History © 2014 Genesee Academy, LLC 7
  • 8. The Data Vault modeling approach 3NF Data Vault HUB SAT LINK Entity Dimensional SAT Dim Core Concept Business Keys Associations / Relationships Details / Context © 2014 Genesee Academy, LLC 8
  • 9. Modeling Comparison Start Schema and Snow Flake Models: Region Store Customer Sale Fact Associations Business Keys Details Product Facts contain all three types of data… Employee Vendor Dimensions can also contain all types *** Requires complex loading routines for key dependencies… © 2014 Genesee Academy, LLC 9
  • 10. Modeling Comparison 3rd Normal Form has the same issue: each construct – or Entity – typically contains a business key, one or more associations and also details (context, descriptive data)… Region Customer Store Sale Sale LI Employee Product Vendor © 2014 Genesee Academy, LLC 10
  • 11. Colors of the Data Vault Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Region Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Employee Customer Sat Sat Link Store Link Sat Sat Sat Sat Sat Sat Sat Link Sat Sat Product Sale Link Sat Sat Vendor Sat Sat Sat © 2014 Genesee Academy, LLC Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat Sat 11
  • 12. Data Vault means thinking differently • The minimal construct then for an “entity” such as “Customer” is now a Customer Hub with a set of Satellites Customer © 2014 Genesee Academy, LLC 12
  • 13. Data Vault Modeling Process • The Modeling Process for creating a Data Vault model includes three primary steps: 1) Identify and Model your Core Business Concepts • Business Interviews is at the heart of this step What do you do? What are the main things you work with? • Also find best/target Natural Business Key 2) Identify and Model your Natural Business Relationships • Specific Unique Relationships • Be considerate of the Unit of Work and Grain 3) Analyze and Design your Context Satellites • Consider Rate of Change, Type of Data and also the Sources of your data during design process © 2014 Genesee Academy, LLC 13
  • 14. Hubs – A Hub Construct in Data Vault • contains Business Key • only the Business Key • contains No Context • is always 1:1 with EWBK H_Customer H_Customer_SID Business Key  Date/Time Stamp Record source – A Hub Table contains only • Business Key • Surrogate Key (Data Warehouse) • Load Date / Time Stamp • Record Source © 2014 Genesee Academy, LLC 14
  • 15. Links H_Customer – A Link Construct in Data Vault • contains Relationship • only a Relationship • contains No Context • is always 1:1 with Relationship – A Link Table contains only • 2-n FKs for the Relationship • Surrogate Key (Data Warehouse) • Load Date / Time Stamp • Record Source © 2014 Genesee Academy, LLC H_Customer_SID Business Key  Date/Tim e Stamp L_Cust_Class Record source L_Cust_Class_SID H_Customer_SID H_Sequence2_SID Date/Time Stamp Record source – Unique – Specific – Natural Business Relationship 15
  • 16. Satellites – A Satellite Construct in Data Vault • contains Context only • has no FKs (no relationships) • Designed by * Rate of Change * Type of Data * System… S_Customer H_Customer_SID Date/Time Stamp Context A Context B Context C Context D – A Satellite Table contains only • Business Key FK + • Load Date / Time Stamp • Context Data… • Record Source © 2014 Genesee Academy, LLC Record source H_Customer H_Customer_SID Business Key  Date/Tim e Stamp Record source 16
  • 17. About Data Vault Ensemble Estimated 800 Data Vault based Data Warehouses around the world © 2014 Genesee Academy, LLC 17
  • 18. Links and Information CDVDM Training & Certification www.GeneseeAcademy.com Hans@GeneseeAcademy.com gohansgo Book DataVaultBook.blogspot.com HansHultgren.WordPress.com HansHultgren DataVaultAcademy Online video-lesson training DataVaultAcademy.com © 2014 Genesee Academy, LLC 18