SlideShare a Scribd company logo
1 of 34
Download to read offline
Data Modeling & Business Intelligence
Donna Burbank
Global Data Strategy Ltd.
Lessons in Data Modeling DATAVERSITY Series
February 23rd, 2017
Global Data Strategy, Ltd. 2017
Donna Burbank
Donna is a recognised industry expert in
information management with over 20
years of experience in data strategy,
information management, data modeling,
metadata management, and enterprise
architecture. Her background is multi-
faceted across consulting, product
development, product management,
brand strategy, marketing, and business
leadership.
She is currently the Managing Director at
Global Data Strategy, Ltd., an international
information management consulting
company that specialises in the alignment
of business drivers with data-centric
technology. In past roles, she has served in
key brand strategy and product
management roles at CA Technologies and
Embarcadero Technologies for several of
the leading data management products in
the market.
As an active contributor to the data
management community, she is a long
time DAMA International member and is
the Past President & Advisor to the DAMA
Rocky Mountain chapter. She was also on
the review committee for the Object
Management Group’s Information
Management Metamodel (IMM) and a
member of the OMG’s Finalization
Taskforce for the Business Process
Modeling Notation (BPMN).
She has worked with dozens of Fortune
500 companies worldwide in the
Americas, Europe, Asia, and Africa and
speaks regularly at industry
conferences. She has co-authored two
books: Data Modeling for the
Business and Data Modeling Made Simple
with ERwin Data Modeler and is a regular
contributor to industry publications.
She can be reached at
donna.burbank@globaldatastrategy.com
Donna is based in Boulder, Colorado, USA.
2
Follow on Twitter @donnaburbank
Today’s hashtag: #LessonsDM
Global Data Strategy, Ltd. 2017
Lessons in Data Modeling Series
• January 26th How Data Modeling Fits Into an Overall Enterprise Architecture
• February 23rd Data Modeling and Business Intelligence
• March Conceptual Data Modeling – How to Get the Attention of Business Users
• April The Evolving Role of the Data Architect – What does it mean for your Career?
• May Data Modeling & Metadata Management
• June Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling
• July Data Modeling & Metadata for Graph Databases
• August Data Modeling & Data Integration
• September Data Modeling & MDM
• October Agile & Data Modeling – How Can They Work Together?
• December Data Modeling, Data Quality & Data Governance
3
This Year’s Line Up
Global Data Strategy, Ltd. 2017
Agenda
• Where does Data Modeling fit with the rise of Self-Service BI?
• How Data Modeling is the “Intelligence Behind Business Intelligence”
• Creating business meaning & context
• Understand source and target data systems
• Optimize data structures to align queries with reports
• Integration with other tools (BI, ETL, etc.)
• Summary
4
What we’ll cover today
Global Data Strategy, Ltd. 2017
The Importance of Data Modeling in Business Intelligence
5
If You Can Read This,
Thank a Data Modeler!
Global Data Strategy, Ltd. 2017
The Rise of the Data-Driven Business
Data, more than ever, is seen as a key business asset and strategic differentiator.
6
Global Data Strategy, Ltd. 2017
The Rise of Self-Service Business Intelligence
• As a result of this growing importance of data, the interest in self-service data
reporting has increased among data-savvy business users.
• The availability of tools & data sets has made it easier for business people to do their
own data manipulation & reporting
• Self Service BI & Data Manipulation – the tools are slick!
• Accessible Data & Open Data Sets – the amount of data available is amazing!
• Tech-Savvy Business Users – this isn’t any harder than a spreadsheet!
• While this offers great opportunities, it can also be fraught with challenges.
• Data modelers and the models & metadata they create can make the job of business
intelligence easier for both BI professionals and the casual BI reporting user.
7
Global Data Strategy, Ltd. 2017
Users are Often Frustrated with Self-Service BI
8
What they want
What they often get
Global Data Strategy, Ltd. 2017
Data is Only as Good as the Metadata
9
Open Data Example: Road Safety - Vehicles by Make and Model
Global Data Strategy, Ltd. 2017
Metadata Matters
With Self-Service BI and Analytics, attention needs to be paid to the
quality, context, & structure of data
Raw data used in Self-Service Analytics and BI environments is
often so poor that many data scientists and BI professionals
spend an estimated 50 – 90% of their time cleaning and
reformatting data to make it fit for purpose.(4
Source: DataCenterJournal.com
Correcting poor data quality is a Data Scientist’s least favorite
task, consuming on average 80% of their working day
Source: Forbes 2016
(aka Data Models & Metadata)
Global Data Strategy, Ltd. 2017
Business Intelligence is a Key Driver for Metadata Management
11
Global Data Strategy, Ltd. 2017
Data Modeling for Data Warehousing & Business Intelligence
• What is the definition of customer?
• Where is the data stored?
• How is it structured?
• Who uses or owns the data?
Data Warehouse BI Report:
Customers by Region
• What are the definitions of key business terms?
• What do I want to report on?
• How do I optimize the database for these reports?
Data Modeling helps answer:
For Data Warehousing For BI Reporting
Data Modeling helps answer:
• Data Modeling is the “Intelligence behind Business Intelligence”
• Creating business meaning & context
• Understand source and target data systems
• Optimize data structures to align queries with reports
Show me all
customers by region
Source Systems
Relational Model
Dimensional Model
Global Data Strategy, Ltd. 2017
Finding Balance – Model What Matters
13
• It’s important to find a balance between
• Managing & modeling “trusted data sets”
• Giving users the flexibility to explore.
• Most users will find these trusted data sets a welcome asset, but don’t want to be restricted from
doing data exploration when appropriate.
IoT
Log Files
Data Warehouse
Master Data
Reference Data
Structure Flexibility & Exploration
Global Data Strategy, Ltd. 2017
Data Models Levels – Both Business & Technical
14
Conceptual
Logical
Physical
Purpose
Communication & Definition of
Business Terms & Rules
Clarification & Detail
of Business Rules &
Data Structures
Technical
Implementation on
a Physical Database
Audience
Business Stakeholders
Data Architecture
Business Analysts
DBAs
Developers
Business Concepts
Data Entities
Physical Tables
• For Data Modeling for Business Intelligence, it’s important to focus on both the business & technical views.
Global Data Strategy, Ltd. 2017
Business Meaning & Context is Critical
15
Show me all
customers by region
Businessperson Data Architect
“Does this include current customers only? Or
lapsed customers as well?
“Do we have to obfuscate PII?”
Global Data Strategy, Ltd. 2017
The Importance of Business Definitions
From Data Modeling for the Business by
Hoberman, Burbank, Bradley, Technics
Publications, 2009
Global Data Strategy, Ltd. 2017
Conceptual Data Model
• Communication & definition of core data concepts & their definitions
Global Data Strategy, Ltd. 2017
Data Model Metadata Can Be Used by Many Roles
18
Business Person
(e.g. Finance)
What’s the definition of
“Regional Sales”
Auditor
How was “Total Sales”
calculated? Show me the
lineage.
Data Architect
What is the approved data
structure for storing customer
data?
Data
Warehouse
Architect
What are the source-to-target
mappings for the DW?
Business Person
(e.g. HR)
How can I get new staff up-to-
speed on our company’s
business terminology?
Global Data Strategy, Ltd. 2017
Data Model Design Layer Relationships
• Data model design layer mappings show the relationship between business terms and their
physical implementations on a database platform.
19
Showing Semantic Mapping
Conceptual
Logical
Physical
Business Concepts
Data Entities
Physical Tables
Client
Customer
DB2TeradataOracle
CUST CUSTOMER CTABLE_16
In a Conceptual data model, there may
be a concept called “Client” which is the
term businesspeople use to describe the
people they sell to and work with.
The Logical model might
use the term “Customer”
for that same concept.
Which may be implemented in a number
of physical tables with varying naming
conventions.
Conceptual
Logical
Physical
Business Concepts
Data Entities
Physical Tables
Global Data Strategy, Ltd. 2017
Data Modeling Creates an “Active Inventory” of Data Assets
• Know what data you have: Create a visual inventory of database systems
• Know what your data means: Communicate key business requirements between business and IT
stakeholders
• Support data consistency: Build consistent database structures & support data governance initiatives
Sybase
MySQL
Oracle
Data Models
Teradata
Sybase
SQL
Server
DB2
Teradata
SQL
Server DB2
MySQLSQL
Azure
SQL
Azure
Oracle
Global Data Strategy, Ltd. 2017
Metadata Adds Context & Definition
• Metadata stored in data models provides valuable business & technical context.
21
Global Data Strategy, Ltd. 2017
Technical & Business Metadata
• Technical Metadata describes the structure, format, and rules for storing data
• Business Metadata describes the business definitions, rules, and context for data.
• Data represents actual instances (e.g. John Smith)
22
CREATE TABLE EMPLOYEE (
employee_id INTEGER NOT NULL,
department_id INTEGER NOT NULL,
employee_fname VARCHAR(50) NULL,
employee_lname VARCHAR(50) NULL,
employee_ssn CHAR(9) NULL);
CREATE TABLE CUSTOMER (
customer_id INTEGER NOT NULL,
customer_name VARCHAR(50) NULL,
customer_address VARCHAR(150) NULL,
customer_city VARCHAR(50) NULL,
customer_state CHAR(2) NULL,
customer_zip CHAR(9) NULL);
Technical Metadata
John Smith
Business Metadata
Data
Term Definition
Employee
An employee is an individual who currently
works for the organization or who has been
recently employed within the past 6 months.
Customer
A customer is a person or organization who
has purchased from the organization within
the past 2 years and has an active loyalty card
or maintenance contract.
Global Data Strategy, Ltd. 2017
Data Warehousing – An Example
• In the data warehouse example below, metadata for CUSTOMER exists in a number tools & data
stores.
23
Sales Report
CUSTOMER
Database Table
CUST
Database Table
CUSTOMER
Database Table
CUSTOMER
Database Table
TBL_C1
Database Table
Business Glossary
ETL Tool ETL Tool
Physical Data Model
Physical Data Model
Logical Data Model
Dimensional
Data Model
BI Tool
Global Data Strategy, Ltd. 2017
Data Lineage
24
• Data Lineage shows the source to target mapping, or provenance for information.
• Many data modeling tools track this lineage through integration with ETL tools, or with internal
mapping functionality.
Global Data Strategy, Ltd. 2017
Why Model the Data Warehouse
• Proper modeling of a data warehouse creates data sets that are:
• Easy to use
• Fast to access
• Combined with other data warehousing best practices around data integration,
transformation, & governance, the data warehouse also creates data sets that:
• Contain high quality data
• Provide a broad, integrated set of data across the enterprise
25
Computing report…elapsed time 4 days,
10 hours, 27 seconds…
Global Data Strategy, Ltd. 2017
Modeling for BI Reporting – the Dimensional Model
• A common way to model the data warehouse is Dimensional Modeling using a Star Schema, based
on methodology spearheaded by Ralf Kimball.
• For Dimensional Modeling, think of what you’re reporting “by” (e.g. by Month, by Region, etc.)
• Dimensional modeling focuses on capturing and aggregating the metrics from daily operations that
enable the business to evaluate how well it is doing.
“What do I want to report by?”
(Apologies to grammarians!), e.g.
by month
by region
by quarter
by product
The lines on a dimensional
data model represent
navigation paths, not
business rules.
Global Data Strategy, Ltd. 2017
The Star Schema
Dimension
Dimension
Dimension
Dimension
Dimension
Fact
Facts: Contain the actual values to be reported upon.
e.g. Sales Figures
• Few attributes (with links/keys to the dimensions)
• Many values
Dimensions: Contain the details that describe the
central fact. e.g. Month, Region, Quarter
• Many attributes
• Few values
Global Data Strategy, Ltd. 2017
The Star Schema
• The following is a sample Star
Schema in a data modeling
tool showing:
• Internet Sales (Fact) by:
• Time Period (Dimension)
• Promotion (Dimension)
• Product (Dimension)
• Customer (Dimension)
Global Data Strategy, Ltd. 2017
Summary
• The rise of the “Data Driven Business” has increased demand for BI reporting, particularly Self-
Service Reporting
• BI Reporting is only as good as the underlying metadata, data structures, and data quality
• Data Models are a critical tool for
• Understanding the business meaning of data
• Making BI Reporting more intuitive
• Improving the performance of BI queries
• Understanding source & target systems and the resultant data lineage
• Find a balance – “Model What Matters”
• Modeling & metadata helps define key trusted data sets
• But not all data needs to be modeled – allow for exploration & discovery
Global Data Strategy, Ltd. 2017
Contact Info
• Email: donna.burbank@globaldatastrategy.com
• Twitter: @donnaburbank
@GlobalDataStrat
• Website: www.globaldatastrategy.com
• Company Linkedin: https://www.linkedin.com/company/global-data-strategy-ltd
• Personal Linkedin: https://www.linkedin.com/in/donnaburbank
30
Global Data Strategy, Ltd. 2017
About Global Data Strategy, Ltd
• Global Data Strategy is an international information management consulting company that specializes
in the alignment of business drivers with data-centric technology.
• Our passion is data, and helping organizations enrich their business opportunities through data and
information.
• Our core values center around providing solutions that are:
• Business-Driven: We put the needs of your business first, before we look at any technology solution.
• Clear & Relevant: We provide clear explanations using real-world examples.
• Customized & Right-Sized: Our implementations are based on the unique needs of your organization’s
size, corporate culture, and geography.
• High Quality & Technically Precise: We pride ourselves in excellence of execution, with years of
technical expertise in the industry.
31
Data-Driven Business Transformation
Business Strategy
Aligned With
Data Strategy
Visit www.globaldatastrategy.com for more information
Global Data Strategy, Ltd. 2017
DATAVERSITY Training Center
• Learn the basics of Metadata Management and practical tips on how to apply metadata
management in the real world. This online course hosted by DATAVERSITY provides a series of six
courses including:
• What is Metadata
• The Business Value of Metadata
• Sources of Metadata
• Metamodels and Metadata Standards
• Metadata Architecture, Integration, and Storage
• Metadata Strategy and Implementation
• Purchase all six courses for $399 or individually at $79 each.
Register here
• Other courses available on Data Governance & Data Quality
32
Online Training Courses
Metadata Management Course
Visit: http://training.dataversity.net/lms/
Global Data Strategy, Ltd. 2017
Lessons in Data Modeling Series
• January 26th How Data Modeling Fits Into an Overall Enterprise Architecture
• February 23rd Data Modeling and Business Intelligence
• March Conceptual Data Modeling – How to Get the Attention of Business Users
• April The Evolving Role of the Data Architect – What does it mean for your Career?
• May Data Modeling & Metadata Management
• June Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling
• July Data Modeling & Metadata for Graph Databases
• August Data Modeling & Data Integration
• September Data Modeling & MDM
• October Agile & Data Modeling – How Can They Work Together?
• December Data Modeling, Data Quality & Data Governance
33
This Year’s Line Up
Global Data Strategy, Ltd. 2017
Questions?
34
Thoughts? Ideas?

More Related Content

What's hot

What's hot (20)

Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
The what, why, and how of master data management
The what, why, and how of master data managementThe what, why, and how of master data management
The what, why, and how of master data management
 
Data Quality Management
Data Quality ManagementData Quality Management
Data Quality Management
 
Best Practices in Metadata Management
Best Practices in Metadata ManagementBest Practices in Metadata Management
Best Practices in Metadata Management
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Lessons in Data Modeling: Data Modeling & MDM
Lessons in Data Modeling: Data Modeling & MDMLessons in Data Modeling: Data Modeling & MDM
Lessons in Data Modeling: Data Modeling & MDM
 
Introduction to Data Governance
Introduction to Data GovernanceIntroduction to Data Governance
Introduction to Data Governance
 
Master Data Management methodology
Master Data Management methodologyMaster Data Management methodology
Master Data Management methodology
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Informatica MDM Presentation
Informatica MDM PresentationInformatica MDM Presentation
Informatica MDM Presentation
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata Strategies
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
 
Data modeling for the business
Data modeling for the businessData modeling for the business
Data modeling for the business
 
Mdm: why, when, how
Mdm: why, when, howMdm: why, when, how
Mdm: why, when, how
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 

Similar to LDM Webinar: Data Modeling & Business Intelligence

Similar to LDM Webinar: Data Modeling & Business Intelligence (20)

Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
 
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling –...
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?The Evolving Role of the Data Architect – What Does It Mean for Your Career?
The Evolving Role of the Data Architect – What Does It Mean for Your Career?
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
DAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data ArchitectureDAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data Architecture
 
These Are The Data You Are Looking For
These Are The Data You Are Looking ForThese Are The Data You Are Looking For
These Are The Data You Are Looking For
 
Business Centric Data Modeling
Business Centric Data ModelingBusiness Centric Data Modeling
Business Centric Data Modeling
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata Management
 
Data Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical ApproachesData Modeling Best Practices - Business & Technical Approaches
Data Modeling Best Practices - Business & Technical Approaches
 
Fuel your Data-Driven Ambitions with Data Governance
Fuel your Data-Driven Ambitions with Data GovernanceFuel your Data-Driven Ambitions with Data Governance
Fuel your Data-Driven Ambitions with Data Governance
 
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data LandscapeData Architecture Best Practices for Today’s Rapidly Changing Data Landscape
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
 
Data Governance & Data Architecture - Alignment and Synergies
Data Governance & Data Architecture - Alignment and SynergiesData Governance & Data Architecture - Alignment and Synergies
Data Governance & Data Architecture - Alignment and Synergies
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDM
 
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & RisksDAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
 
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
 

More from DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business Intelligence
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

LDM Webinar: Data Modeling & Business Intelligence

  • 1. Data Modeling & Business Intelligence Donna Burbank Global Data Strategy Ltd. Lessons in Data Modeling DATAVERSITY Series February 23rd, 2017
  • 2. Global Data Strategy, Ltd. 2017 Donna Burbank Donna is a recognised industry expert in information management with over 20 years of experience in data strategy, information management, data modeling, metadata management, and enterprise architecture. Her background is multi- faceted across consulting, product development, product management, brand strategy, marketing, and business leadership. She is currently the Managing Director at Global Data Strategy, Ltd., an international information management consulting company that specialises in the alignment of business drivers with data-centric technology. In past roles, she has served in key brand strategy and product management roles at CA Technologies and Embarcadero Technologies for several of the leading data management products in the market. As an active contributor to the data management community, she is a long time DAMA International member and is the Past President & Advisor to the DAMA Rocky Mountain chapter. She was also on the review committee for the Object Management Group’s Information Management Metamodel (IMM) and a member of the OMG’s Finalization Taskforce for the Business Process Modeling Notation (BPMN). She has worked with dozens of Fortune 500 companies worldwide in the Americas, Europe, Asia, and Africa and speaks regularly at industry conferences. She has co-authored two books: Data Modeling for the Business and Data Modeling Made Simple with ERwin Data Modeler and is a regular contributor to industry publications. She can be reached at donna.burbank@globaldatastrategy.com Donna is based in Boulder, Colorado, USA. 2 Follow on Twitter @donnaburbank Today’s hashtag: #LessonsDM
  • 3. Global Data Strategy, Ltd. 2017 Lessons in Data Modeling Series • January 26th How Data Modeling Fits Into an Overall Enterprise Architecture • February 23rd Data Modeling and Business Intelligence • March Conceptual Data Modeling – How to Get the Attention of Business Users • April The Evolving Role of the Data Architect – What does it mean for your Career? • May Data Modeling & Metadata Management • June Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling • July Data Modeling & Metadata for Graph Databases • August Data Modeling & Data Integration • September Data Modeling & MDM • October Agile & Data Modeling – How Can They Work Together? • December Data Modeling, Data Quality & Data Governance 3 This Year’s Line Up
  • 4. Global Data Strategy, Ltd. 2017 Agenda • Where does Data Modeling fit with the rise of Self-Service BI? • How Data Modeling is the “Intelligence Behind Business Intelligence” • Creating business meaning & context • Understand source and target data systems • Optimize data structures to align queries with reports • Integration with other tools (BI, ETL, etc.) • Summary 4 What we’ll cover today
  • 5. Global Data Strategy, Ltd. 2017 The Importance of Data Modeling in Business Intelligence 5 If You Can Read This, Thank a Data Modeler!
  • 6. Global Data Strategy, Ltd. 2017 The Rise of the Data-Driven Business Data, more than ever, is seen as a key business asset and strategic differentiator. 6
  • 7. Global Data Strategy, Ltd. 2017 The Rise of Self-Service Business Intelligence • As a result of this growing importance of data, the interest in self-service data reporting has increased among data-savvy business users. • The availability of tools & data sets has made it easier for business people to do their own data manipulation & reporting • Self Service BI & Data Manipulation – the tools are slick! • Accessible Data & Open Data Sets – the amount of data available is amazing! • Tech-Savvy Business Users – this isn’t any harder than a spreadsheet! • While this offers great opportunities, it can also be fraught with challenges. • Data modelers and the models & metadata they create can make the job of business intelligence easier for both BI professionals and the casual BI reporting user. 7
  • 8. Global Data Strategy, Ltd. 2017 Users are Often Frustrated with Self-Service BI 8 What they want What they often get
  • 9. Global Data Strategy, Ltd. 2017 Data is Only as Good as the Metadata 9 Open Data Example: Road Safety - Vehicles by Make and Model
  • 10. Global Data Strategy, Ltd. 2017 Metadata Matters With Self-Service BI and Analytics, attention needs to be paid to the quality, context, & structure of data Raw data used in Self-Service Analytics and BI environments is often so poor that many data scientists and BI professionals spend an estimated 50 – 90% of their time cleaning and reformatting data to make it fit for purpose.(4 Source: DataCenterJournal.com Correcting poor data quality is a Data Scientist’s least favorite task, consuming on average 80% of their working day Source: Forbes 2016 (aka Data Models & Metadata)
  • 11. Global Data Strategy, Ltd. 2017 Business Intelligence is a Key Driver for Metadata Management 11
  • 12. Global Data Strategy, Ltd. 2017 Data Modeling for Data Warehousing & Business Intelligence • What is the definition of customer? • Where is the data stored? • How is it structured? • Who uses or owns the data? Data Warehouse BI Report: Customers by Region • What are the definitions of key business terms? • What do I want to report on? • How do I optimize the database for these reports? Data Modeling helps answer: For Data Warehousing For BI Reporting Data Modeling helps answer: • Data Modeling is the “Intelligence behind Business Intelligence” • Creating business meaning & context • Understand source and target data systems • Optimize data structures to align queries with reports Show me all customers by region Source Systems Relational Model Dimensional Model
  • 13. Global Data Strategy, Ltd. 2017 Finding Balance – Model What Matters 13 • It’s important to find a balance between • Managing & modeling “trusted data sets” • Giving users the flexibility to explore. • Most users will find these trusted data sets a welcome asset, but don’t want to be restricted from doing data exploration when appropriate. IoT Log Files Data Warehouse Master Data Reference Data Structure Flexibility & Exploration
  • 14. Global Data Strategy, Ltd. 2017 Data Models Levels – Both Business & Technical 14 Conceptual Logical Physical Purpose Communication & Definition of Business Terms & Rules Clarification & Detail of Business Rules & Data Structures Technical Implementation on a Physical Database Audience Business Stakeholders Data Architecture Business Analysts DBAs Developers Business Concepts Data Entities Physical Tables • For Data Modeling for Business Intelligence, it’s important to focus on both the business & technical views.
  • 15. Global Data Strategy, Ltd. 2017 Business Meaning & Context is Critical 15 Show me all customers by region Businessperson Data Architect “Does this include current customers only? Or lapsed customers as well? “Do we have to obfuscate PII?”
  • 16. Global Data Strategy, Ltd. 2017 The Importance of Business Definitions From Data Modeling for the Business by Hoberman, Burbank, Bradley, Technics Publications, 2009
  • 17. Global Data Strategy, Ltd. 2017 Conceptual Data Model • Communication & definition of core data concepts & their definitions
  • 18. Global Data Strategy, Ltd. 2017 Data Model Metadata Can Be Used by Many Roles 18 Business Person (e.g. Finance) What’s the definition of “Regional Sales” Auditor How was “Total Sales” calculated? Show me the lineage. Data Architect What is the approved data structure for storing customer data? Data Warehouse Architect What are the source-to-target mappings for the DW? Business Person (e.g. HR) How can I get new staff up-to- speed on our company’s business terminology?
  • 19. Global Data Strategy, Ltd. 2017 Data Model Design Layer Relationships • Data model design layer mappings show the relationship between business terms and their physical implementations on a database platform. 19 Showing Semantic Mapping Conceptual Logical Physical Business Concepts Data Entities Physical Tables Client Customer DB2TeradataOracle CUST CUSTOMER CTABLE_16 In a Conceptual data model, there may be a concept called “Client” which is the term businesspeople use to describe the people they sell to and work with. The Logical model might use the term “Customer” for that same concept. Which may be implemented in a number of physical tables with varying naming conventions. Conceptual Logical Physical Business Concepts Data Entities Physical Tables
  • 20. Global Data Strategy, Ltd. 2017 Data Modeling Creates an “Active Inventory” of Data Assets • Know what data you have: Create a visual inventory of database systems • Know what your data means: Communicate key business requirements between business and IT stakeholders • Support data consistency: Build consistent database structures & support data governance initiatives Sybase MySQL Oracle Data Models Teradata Sybase SQL Server DB2 Teradata SQL Server DB2 MySQLSQL Azure SQL Azure Oracle
  • 21. Global Data Strategy, Ltd. 2017 Metadata Adds Context & Definition • Metadata stored in data models provides valuable business & technical context. 21
  • 22. Global Data Strategy, Ltd. 2017 Technical & Business Metadata • Technical Metadata describes the structure, format, and rules for storing data • Business Metadata describes the business definitions, rules, and context for data. • Data represents actual instances (e.g. John Smith) 22 CREATE TABLE EMPLOYEE ( employee_id INTEGER NOT NULL, department_id INTEGER NOT NULL, employee_fname VARCHAR(50) NULL, employee_lname VARCHAR(50) NULL, employee_ssn CHAR(9) NULL); CREATE TABLE CUSTOMER ( customer_id INTEGER NOT NULL, customer_name VARCHAR(50) NULL, customer_address VARCHAR(150) NULL, customer_city VARCHAR(50) NULL, customer_state CHAR(2) NULL, customer_zip CHAR(9) NULL); Technical Metadata John Smith Business Metadata Data Term Definition Employee An employee is an individual who currently works for the organization or who has been recently employed within the past 6 months. Customer A customer is a person or organization who has purchased from the organization within the past 2 years and has an active loyalty card or maintenance contract.
  • 23. Global Data Strategy, Ltd. 2017 Data Warehousing – An Example • In the data warehouse example below, metadata for CUSTOMER exists in a number tools & data stores. 23 Sales Report CUSTOMER Database Table CUST Database Table CUSTOMER Database Table CUSTOMER Database Table TBL_C1 Database Table Business Glossary ETL Tool ETL Tool Physical Data Model Physical Data Model Logical Data Model Dimensional Data Model BI Tool
  • 24. Global Data Strategy, Ltd. 2017 Data Lineage 24 • Data Lineage shows the source to target mapping, or provenance for information. • Many data modeling tools track this lineage through integration with ETL tools, or with internal mapping functionality.
  • 25. Global Data Strategy, Ltd. 2017 Why Model the Data Warehouse • Proper modeling of a data warehouse creates data sets that are: • Easy to use • Fast to access • Combined with other data warehousing best practices around data integration, transformation, & governance, the data warehouse also creates data sets that: • Contain high quality data • Provide a broad, integrated set of data across the enterprise 25 Computing report…elapsed time 4 days, 10 hours, 27 seconds…
  • 26. Global Data Strategy, Ltd. 2017 Modeling for BI Reporting – the Dimensional Model • A common way to model the data warehouse is Dimensional Modeling using a Star Schema, based on methodology spearheaded by Ralf Kimball. • For Dimensional Modeling, think of what you’re reporting “by” (e.g. by Month, by Region, etc.) • Dimensional modeling focuses on capturing and aggregating the metrics from daily operations that enable the business to evaluate how well it is doing. “What do I want to report by?” (Apologies to grammarians!), e.g. by month by region by quarter by product The lines on a dimensional data model represent navigation paths, not business rules.
  • 27. Global Data Strategy, Ltd. 2017 The Star Schema Dimension Dimension Dimension Dimension Dimension Fact Facts: Contain the actual values to be reported upon. e.g. Sales Figures • Few attributes (with links/keys to the dimensions) • Many values Dimensions: Contain the details that describe the central fact. e.g. Month, Region, Quarter • Many attributes • Few values
  • 28. Global Data Strategy, Ltd. 2017 The Star Schema • The following is a sample Star Schema in a data modeling tool showing: • Internet Sales (Fact) by: • Time Period (Dimension) • Promotion (Dimension) • Product (Dimension) • Customer (Dimension)
  • 29. Global Data Strategy, Ltd. 2017 Summary • The rise of the “Data Driven Business” has increased demand for BI reporting, particularly Self- Service Reporting • BI Reporting is only as good as the underlying metadata, data structures, and data quality • Data Models are a critical tool for • Understanding the business meaning of data • Making BI Reporting more intuitive • Improving the performance of BI queries • Understanding source & target systems and the resultant data lineage • Find a balance – “Model What Matters” • Modeling & metadata helps define key trusted data sets • But not all data needs to be modeled – allow for exploration & discovery
  • 30. Global Data Strategy, Ltd. 2017 Contact Info • Email: donna.burbank@globaldatastrategy.com • Twitter: @donnaburbank @GlobalDataStrat • Website: www.globaldatastrategy.com • Company Linkedin: https://www.linkedin.com/company/global-data-strategy-ltd • Personal Linkedin: https://www.linkedin.com/in/donnaburbank 30
  • 31. Global Data Strategy, Ltd. 2017 About Global Data Strategy, Ltd • Global Data Strategy is an international information management consulting company that specializes in the alignment of business drivers with data-centric technology. • Our passion is data, and helping organizations enrich their business opportunities through data and information. • Our core values center around providing solutions that are: • Business-Driven: We put the needs of your business first, before we look at any technology solution. • Clear & Relevant: We provide clear explanations using real-world examples. • Customized & Right-Sized: Our implementations are based on the unique needs of your organization’s size, corporate culture, and geography. • High Quality & Technically Precise: We pride ourselves in excellence of execution, with years of technical expertise in the industry. 31 Data-Driven Business Transformation Business Strategy Aligned With Data Strategy Visit www.globaldatastrategy.com for more information
  • 32. Global Data Strategy, Ltd. 2017 DATAVERSITY Training Center • Learn the basics of Metadata Management and practical tips on how to apply metadata management in the real world. This online course hosted by DATAVERSITY provides a series of six courses including: • What is Metadata • The Business Value of Metadata • Sources of Metadata • Metamodels and Metadata Standards • Metadata Architecture, Integration, and Storage • Metadata Strategy and Implementation • Purchase all six courses for $399 or individually at $79 each. Register here • Other courses available on Data Governance & Data Quality 32 Online Training Courses Metadata Management Course Visit: http://training.dataversity.net/lms/
  • 33. Global Data Strategy, Ltd. 2017 Lessons in Data Modeling Series • January 26th How Data Modeling Fits Into an Overall Enterprise Architecture • February 23rd Data Modeling and Business Intelligence • March Conceptual Data Modeling – How to Get the Attention of Business Users • April The Evolving Role of the Data Architect – What does it mean for your Career? • May Data Modeling & Metadata Management • June Self-Service Data Analysis, Data Wrangling, Data Munging, and Data Modeling • July Data Modeling & Metadata for Graph Databases • August Data Modeling & Data Integration • September Data Modeling & MDM • October Agile & Data Modeling – How Can They Work Together? • December Data Modeling, Data Quality & Data Governance 33 This Year’s Line Up
  • 34. Global Data Strategy, Ltd. 2017 Questions? 34 Thoughts? Ideas?