More Related Content Similar to DataEd Online: Data Architecture and Data Modeling Differences — Achieving a Common Understanding (20) More from DATAVERSITY (20) DataEd Online: Data Architecture and Data Modeling Differences — Achieving a Common Understanding1. © Copyright 2021 by Peter Aiken Slide # 1
paiken@plusanythingawesome.com+1.804.382.5957 Peter Aiken, PhD
Data Architecture and
Data Modeling Differences:
Data mapping from two perspectives–achieving a common understanding
Peter Aiken, Ph.D.
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Institute for Defense Analyses (ida.org)
• DAMA International (dama.org)
• MIT CDO Society (iscdo.org)
• Anything Awesome (plusanythingawesome.com)
• 11 books and dozens of articles
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart …
© Copyright 2021 by Peter Aiken Slide # 2
https://plusanythingawesome.com
2. Event Pricing
© Copyright 2021 by Peter Aiken Slide # 3
https://plusanythingawesome.com
• 20% off
directly from the publisher on
select titles
• My Book Store @
http://plusanythingawwsome.com
• Enter the code
"anythingawesome" at the
Technics bookstore checkout
where it says to
"Apply Coupon"
anythingawesome
4
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
3. Data ...
• As a subject is
– Complex and detailed
– Taught inconsistently, and
– Poorly understood
• Data maps (models) are necessary but
insufficient prerequisites to data architectures
– Fully leveraging data assets
• Maps are incomplete without purpose statements
– More powerful than definitions
– Remedy
• Add purpose statements
• Validate resulting model
• Maps are required to share information about data
• Data architectures are composed of data models
– Data modeling is an engineering activity required to product data maps that are
necessary but insufficient prerequisites to leveraging data assets
© Copyright 2021 by Peter Aiken Slide # 5
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
Metadata
Management
6
https://plusanythingawesome.com
Data
Management
Body
of
Knowledge
(DM
BoK
V2)
Practice
Areas
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
• Analysis
• Database Design
• Implementation
• Additional data
development
4. https://www.amazon.com/Infonomics-Monetize-Information-Competitive-Advantage/dp/1138090387
Data's Unique Properties
• Does not obey all of the laws of physics
– Not really visible (visualization expertise)
• Non rivalrous
– the cost of providing an additional copy is zero
• Non depleting
– Does not require replenishment
• Regenerative
• Nearly unlimited
• Low inventory and transportation/transmission costs
• More difficult to control and own
• Eco friendly
• Impossible to clean-up if you spill it
© Copyright 2021 by Peter Aiken Slide # 7
https://plusanythingawesome.com
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 8
https://plusanythingawesome.com
5. process of discovering, analyzing, and scoping data requirements
© Copyright 2021 by Peter Aiken Slide # 9
https://plusanythingawesome.com
• List organizational places
• These are called Attributes
– Attributes are characteristics of "things"
• List organizational places that need to be
persons
places
things
created
read
updated
deleted
archived
https://plusanythingawesome.com
process of discovering, analyzing, and scoping data requirements
• An organization might decide to
characterize the parts of a THING as:
– Attributes: ID, description, status,
sex.to.be.assigned, reserve.reason
• Decisions to manage information
about each specific attribute has
direct consequences
– A decision to use the above data
attributes permits the organization to
determine if it has female THINGs are available to be reserved
• Characteristics can be shared
– All THINGs may have a status
– Many THINGs can be assigned to females
• Characteristics may be required to be unique
– ID permits identification every THING as distinct for every other THING
– Description is likely to be unique for each THING
© Copyright 2021 by Peter Aiken Slide # 10
https://plusanythingawesome.com
THING
Thing.Id #
Thing.Description
Thing.Status
Thing.Sex.To.Be.Assigned
Thing.Reserve.Reason
Attributes arranged into an
entity named "thing" – the
attribute Thing.Id is the means
used to identify a unique
occurrence of thing
6. Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 11
https://plusanythingawesome.com
representing and communicating these in a precise form called a data model
© Copyright 2021 by Peter Aiken Slide # 12
https://plusanythingawesome.com
Thing.Id #
Thing.Description
Thing.Status
Thing.Sex.To.Be.Assigned
Thing.Reserve.Reason
Thing 1
Thing.Id #
…
Thing 2
Each THING 2 must be accompanied by a THING 1
7. representing/communicating requirements in a precise form called a data model
© Copyright 2021 by Peter Aiken Slide # 13
https://plusanythingawesome.com
• Defines mandatory/optional
relationships between using
minimum/maximum
occurrences from one entity
to another
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 14
https://plusanythingawesome.com
8. © Copyright 2021 by Peter Aiken Slide #
!
!
!
!
15
https://plusanythingawesome.com
Organizational Needs
become instantiated
and integrated into a
Data Models
Informa(on)System)
Requirements
authorizes and
articulates
satisfy
specific
organizational
needs
The process is iterative
Data models and data architectures are
developed in response to needs
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 16
https://plusanythingawesome.com
9. ANSI-SPARC 3-Layer Schema
1. CONCEPTUAL - Allows independent
customized user views:
– Each should be able to access the same
data, but have a different customized view
of the data.
2. LOGICAL - This hides the physical
storage details from users:
– Users should not have to deal with physical
database storage details. They should be
allowed to work with the data itself, without
concern for how it is physically stored.
3. PHYSICAL - The database
administrator should be able to
change the database storage
structures without affecting the
users’ views:
– Changes to the structure of an
organization's data will be required. The
internal structure of the database should
be unaffected by changes to the physical
aspects of the storage.
© Copyright 2021 by Peter Aiken Slide # 17
https://plusanythingawesome.com
For example, a changeover to a new DBMS
technology. The database administrator
should be able to change the conceptual or
global structure of the database without
affecting the users.
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 18
https://plusanythingawesome.com
10. Each data arrangement is a data structure
Some data structure characteristics:
• Grammar for data objects
– Grammar is the principles
or rules of an art, science,
or technique "a grammar
of the theater"
• Constraints for data objects
• Sequential order
• Uniqueness
• Order
– Hierarchical, relational,
network, lake, other
• Balance
• Optimality
© Copyright 2021 by Peter Aiken Slide # 19
https://plusanythingawesome.com
http://www.nist.gov/dads/HTML/datastructur.html
"An organization of information, usually in memory, for better algorithm efficiency, such
as queue, stack, linked list, heap, dictionary, and tree, or conceptual unity, such as the
name and address of a person. It may include redundant information, such as length of
the list or number of nodes in a subtree."
https://plusanythingawesome.com
Hierarchy
• A hierarchy is an
arrangement of items
(objects, names, values,
categories, etc.) in which
the items are represented
as being "above",
"below", or "at the same
level as" one another.
• Hierarchy is an important
concept in a wide variety
of fields, such as
philosophy, mathematics,
computer science,
organizational theory,
systems theory, and the
social sciences
(especially political
philosophy).
© Copyright 2021 by Peter Aiken Slide # 20
https://plusanythingawesome.com
Data Architecture
Data Maps
/Models
Mess
Data Maps
/Models
Data Maps
/Models
Data Maps
/Models
Data Maps
/Models
Model View
11. Social
© Copyright 2021 by Peter Aiken Slide # 21
https://plusanythingawesome.com
https://plusanythingawesome.com
Differences between Programs and Projects
• Programs are Ongoing, Projects End
– Managing a program involves long term strategic planning and
continuous process improvement is not required of a project
• Programs are Tied to the Financial Calendar
– Program managers are often responsible for delivering
results tied to the organization's financial calendar
• Program Management is Governance Intensive
– Programs are governed by a senior board that provides direction,
oversight, and control while projects tend to be less governance-intensive
• Programs Have Greater Scope of Financial Management
– Projects typically have a straight-forward budget and project financial management
is focused on spending to budget while program planning, management and control
is significantly more complex
• Program Change Management is an Executive Leadership Capability
– Projects employ a formal change management process while at the program level,
change management requires executive leadership skills and program change is
driven more by an organization's strategy and is subject to market conditions and
changing business goals
© Copyright 2021 by Peter Aiken Slide #
Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management
Your data program must
last at least as long as
your HR program!
22
https://plusanythingawesome.com
12. What do we teach knowledge workers about data?
© Copyright 2021 by Peter Aiken Slide # 23
https://plusanythingawesome.com
What percentage of the deal with it daily?
Political
© Copyright 2021 by Peter Aiken Slide # 24
https://plusanythingawesome.com
https://plusanythingawesome.com
13. What do we teach IT professionals about data?
© Copyright 2021 by Peter Aiken Slide # 25
https://plusanythingawesome.com
• 1 course
– How to build a
new database
• What
impressions do IT
professionals get
from this
education?
– Data is a technical
skill that is needed
when developing
new databases
© Copyright 2021 by Peter Aiken Slide # 26
https://plusanythingawesome.com
If the only tool you
know is a hammer
you tend to see
every problem as a
nail (slightly reworded
from Abraham Maslow)
14. © Copyright 2021 by Peter Aiken Slide # 27
https://plusanythingawesome.com
Bad data decisions
Technical deci-
sion makers are not
data knowledgable
Business decision
makers are not
data knowledgable
Poor organizational outcomes
Poor treatment of
organizational data
assets
Poor
quality
data
Bad Data Decisions Spiral
Economic
© Copyright 2021 by Peter Aiken Slide # 28
https://plusanythingawesome.com
https://plusanythingawesome.com
15. Tacoma Narrows Bridge/Gallopin' Gertie
© Copyright 2021 by Peter Aiken Slide # 29
https://plusanythingawesome.com
https://plusanythingawesome.com
• World's 3rd longest suspension span
• Slender, elegant and graceful
• Opened on July 1st 1940, collapsed in a windstorm on 7 Nov 1940
• "The most dramatic failure in
bridge engineering history"
• Changed forever how engineers
design suspension bridges leading
to safer spans today.
© Copyright 2021 by Peter Aiken Slide # 30
https://plusanythingawesome.com
Tacoma Narrows Bridge/Gallopin' Gertie
16. © Copyright 2021 by Peter Aiken Slide # 31
https://plusanythingawesome.com
Similarly data failures cost organizations
minimally 20-40% of their IT budget
https://plusanythingawesome.com
Data incoherence is a hidden expense
• How does maltreated data cost money?
• Consider the opposite question:
– Were your systems explicitly designed to
be integrated or otherwise work together?
– If not then what is the likelihood that they
will work well together?
• Organizations spend 20-40% of their IT
budget evolving their data - including:
– Data migration
• Changing the location from one place to another
– Data conversion
• Changing data into another form, state, or product
– Data improving
• "Inspecting and manipulating, or re-keying data to prepare it for
subsequent use" - Source: John Zachman
© Copyright 2021 by Peter Aiken Slide # 32
https://plusanythingawesome.com
PETER AIKEN WITH JUANITA BILLINGS
FOREWORD BY JOHN BOTTEGA
MONETIZING
DATA MANAGEMENT
Unlocking the Value in Your Organization’s
Most Important Asset.
17. Doing a poor job with data
• Takes longer
• Costs more
• Delivers less
• Presents greater risk (with thanks to Tom DeMarco)
© Copyright 2021 by Peter Aiken Slide # 33
https://plusanythingawesome.com
34
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
18. 4 Minute Architecture Lesson from Steve Jobs, Introducing iCloud
© Copyright 2021 by Peter Aiken Slide # 35
https://plusanythingawesome.com
https://plusanythingawesome.com
Architecture is about ...
• Things
– (components)
• The functions of the things
– (individually)
• How the things interact
– (as a system,
– towards a goal)
© Copyright 2021 by Peter Aiken Slide # 36
https://plusanythingawesome.com
• Business
• Process
• Systems
• Security
• Technical
• Data / Information
19. Typically Managed Architectures
• Business Architecture
– Goals, strategies, roles, organizational structure, location(s)
• Process Architecture
– Arrangement of inputs -> transformations = value -> outputs
– Typical elements: Functions, activities, workflow, events, cycles, products, procedures
• Systems Architecture
– Applications, software components, interfaces, projects
• Security Architecture
– Arrangement of security controls relation to IT Architecture
• Technical Architecture/Tarchitecture
– Relation of software capabilities/technology stack
– Structure of the technology infrastructure of an enterprise, solution or system
– Typical elements: Networks, hardware, software platforms, standards/protocols
• Data / Information Architecture
– Arrangement of data assets supporting organizational strategy
– Typical elements: specifications expressed as entities, relationships, attributes,
definitions, values, vocabularies
© Copyright 2021 by Peter Aiken Slide # 37
https://plusanythingawesome.com
1 in 10 organizations manage 1
or more of the formally
Data Architectures: here, whether you like it or not
© Copyright 2021 by Peter Aiken Slide # 38
https://plusanythingawesome.com
deviantart.com
• All organizations
have data
architectures
– Some are better
understood and
documented (and
therefore more
useful to the
organization) than
others
Business
Process
Systems
Security
Technical
Data/Information
20. Data model focus is typically domain specific
© Copyright 2021 by Peter Aiken Slide # 39
https://plusanythingawesome.com
Program A
Program C
Program B
Focus of a software engineering effort
Underutilized
data modeling
effort
Database architecture focus Can vary
© Copyright 2021 by Peter Aiken Slide # 40
https://plusanythingawesome.com
Application
domain 1
Program A
Program C
Program B
Focus of a software engineering effort
Underutilized
data modeling
effort
Better utilized
data modeling
effort
ERPs and COTS are marketed as
being similarly integrated!
Program F
Program E
Program G
Program H
Program I
Application
domain 2
Application
domain 3
Program D
21. Application
domain 1
Program A
Program C
Program B
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
Program F
Program E
Program D
Program G
Program H
Program I
Application
domain 2
Application
domain 3
D
a
t
a
D
a
t
a
D
a
t
a
Data Architecture Focus has Greater Potential Value
• Broader focus than
either software
architecture or
database
architecture
• Analysis scope is
on the system
wide use of data
• Problems caused
by data exchange
or interface
problems
• Architectural goals
more strategic
than operational
© Copyright 2021 by Peter Aiken Slide # 41
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
Metadata
Management
42
https://plusanythingawesome.com
Data
Management
Body
of
Knowledge
(DM
BoK
V2)
Practice
Areas
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
• Enterprise data
modeling
• Value chain analysis
• Related data
architecture
22. How are components expressed as architectures?
• Details are
organized into
larger components
• Larger components
are organized into
models
• Models are
organized into
architectures
(composed of
architectural
components)
© Copyright 2021 by Peter Aiken Slide # 43
https://plusanythingawesome.com
A B
C D
A B
C D
A
D
C
B
Intricate
Dependencies
Purposefulness
How are data structures expressed as architectures?
• Attributes are organized into entities/objects
– Attributes are characteristics of "things"
– Entitles/objects are "things" whose
information is managed in support of strategy
– Example(s)
• Entities/objects are organized into models
– Combinations of attributes and entities are
structured to represent information requirements
– Poorly structured data, constrains organizational
information delivery capabilities
– Example(s)
• Models are organized into architectures
– When building new systems, architectures are used to plan development
– More often, data managers do not know what existing architectures are and -
therefore - cannot make use of them in support of strategy implementation
• Why no examples?
© Copyright 2021 by Peter Aiken Slide # 44
https://plusanythingawesome.com
Intricate
Dependencies
Purposefulness
THING
Thing.Id #
Thing.Description
Thing.Status
Thing.Sex.To.Be.Assigned
Thing.Reserve.Reason
23. Data architectures are composed of data models
© Copyright 2021 by Peter Aiken Slide # 45
https://plusanythingawesome.com
Data
Data
Data
Information
Fact Meaning
Request
A model defining 3 important concepts composing a data architecture
© Copyright 2021 by Peter Aiken Slide #
[Built on definitions from Dan Appleton 1983]
Intelligence
Strategic Use
Data
Data
Data Data
46
https://plusanythingawesome.com
“You can have data without information, but
you cannot have information without data”
— Daniel Keys Moran, Science Fiction Writer
1. Each FACT combines with one or more MEANINGS.
2. Each specific FACT and MEANING combination is referred to as a DATUM.
3. An INFORMATION is one or more DATA that are returned in response to a specific REQUEST
4. INFORMATION REUSE is enabled when one FACT is combined with more than one MEANING.
5. INTELLIGENCE is INFORMATION associated with its STRATEGIC USES.
6. DATA/INFORMATION must formally arranged into an ARCHITECTURE.
Wisdom & knowledge are
often used synonymously
Useful Data
24. Data Architectures Determine
Interoperability
• Required to enable
self-correction/generation
capabilities
• Permits governance of data as an
asset
• Prerequisite to meaningful data
exchanges
• Lowers costs of organization-wide
and extra-organizational data
sharing
• Permits managed evolution - rapidly
responding to changing needs, new
partners, time criticality's
• Required for (full) role-based
security implementation
• Decreases the cost of maintaining
data inventories
Data Architectures
• Capture the business meaning of the
data required to run the organization
• Living document – constantly
evolving to meet upcoming and
discovered business requirements
• A potential entry point for
architecture engagements
• Validated data architectural
components can be used to
populate a business glossary
• Major collection of metadata
© Copyright 2021 by Peter Aiken Slide # 47
https://plusanythingawesome.com
Levels of Abstraction, Completeness and Utility
• Models more downward facing - detail
• Architecture is higher level of abstraction - integration
• In the past architecture attempted to gain complete (perfect)
understanding
– Not timely
– Not feasible
• Focus instead on
architectural components
– Governed by a framework
– More immediate utility
• http://www.architecturalcomponentsinc.com
© Copyright 2021 by Peter Aiken Slide # 48
https://plusanythingawesome.com
25. Data structures organized into an Architecture
• How do data structures support strategy?
• Consider the opposite question?
– Were your systems explicitly designed to be
integrated or otherwise work together?
– If not then what is the likelihood that they will
work well together?
– In all likelihood your organization is spending
between 20-40% of its IT budget compensating
for poor data structure integration
– They cannot be helpful as long as their
structure is unknown
• Two answers/two separate strategies
– Achieving efficiency and
effectiveness goals
– Providing organizational dexterity
for rapid implementation
© Copyright 2021 by Peter Aiken Slide # 49
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
Engineering
Architecture
50
https://plusanythingawesome.com
Engineering/Architecting
Relationship
• Architecting is used to
create and build systems too
complex to be treated by
engineering analysis alone
– Require technical details as the
exception
• Engineers develop the
technical designs for
implementation
– Engineering/Crafts-persons
deliver work product
components supervised by:
• Manufacturer
• Building Contractor
26. You cannot architect after implementation!
© Copyright 2021 by Peter Aiken Slide # 51
https://plusanythingawesome.com
https://plusanythingawesome.com
USS Midway
& Pancakes
What is this?
• It is tall
• It has a clutch
• It was built in 1942
• It is cemented to the floor
• It is still in regular use!
© Copyright 2021 by Peter Aiken Slide # 52
https://plusanythingawesome.com
27. © Copyright 2021 by Peter Aiken Slide # 53
https://plusanythingawesome.com
Definition of Bed
https://plusanythingawesome.com
process of discovering, analyzing, and scoping data requirements
• An organization might decide to
characterize the parts of a BED as:
– Attributes: ID, description, status,
sex.to.be.assigned, reserve.reason
• Decisions to manage information
about each specific attribute has
direct consequences
– A decision to use the above data
attributes permits the organization to
determine if it has female beds are available to be reserved
• Characteristics can be shared
– All beds may have a status
– Many beds can be assigned to females
• Characteristics may be required to be unique
– ID permits identification every bed as distinct for every other bed
– Description is unlikely to be the same for each bed
© Copyright 2021 by Peter Aiken Slide # 54
https://plusanythingawesome.com
BED
Bed.Id #
Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Attributes arranged into an
entity named "bed" – the
attribute Bed.Id is the means
used to identify a unique
occurrence of bed
28. Q: What is the proper relationship for these entities?
© Copyright 2021 by Peter Aiken Slide # 55
https://plusanythingawesome.com
Bed Room
Bed Room
Data Maps at the Entity Level ➜ Stored Facts
© Copyright 2021 by Peter Aiken Slide # 56
https://plusanythingawesome.com
Bed Room
a BED is related to a ROOM
More precision:
many BEDS are related to many ROOMS
Bed Room
Better information:
many BEDS may be contained in each ROOM and each room may contain many beds
What if beds can
be moved?
29. Eventually One or Many (optional)
Eventually One (optional)
Zero, or Many (optional)
One or Many (mandatory)
Exactly One (mandatory)
Possible Entity Relationship Cardinality Options
© Copyright 2021 by Peter Aiken Slide # 57
https://plusanythingawesome.com
Families of Modeling Notation Variants
© Copyright 2021 by Peter Aiken Slide # 58
https://plusanythingawesome.com
Information Engineering
30. What is a Relationship?
• Natural associations between two or more entities
© Copyright 2021 by Peter Aiken Slide # 59
https://plusanythingawesome.com
Ordinality & Cardinality
• Defines mandatory/optional relationships using minimum/
maximum occurrences from one entity to another
© Copyright 2021 by Peter Aiken Slide # 60
https://plusanythingawesome.com
A BED is placed
in one and only
one ROOM A ROOM
contains zero
or more BEDS
A BED is occupied by zero or more
PATIENTS
A PATIENT
occupies at least
one or more BEDS
ROOM
BED
PATIENT
31. Standard definition reporting does not provide conceptual context
© Copyright 2021 by Peter Aiken Slide # 61
https://plusanythingawesome.com
BED
Something you sleep in
Purpose statement incorporates motivations
Entity: BED
Data Asset Type: Principal Data Entity
Purpose: This is a substructure within the Room
substructure of the Facility Location. It contains
information about beds within rooms.
Source: Maintenance Manual for File and Table
Data (Software Version 3.0, Release 3.1)
Attributes: Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Associations: >0-+ Room
Status: Validated
© Copyright 2021 by Peter Aiken Slide # 62
https://plusanythingawesome.com
Draft
A purpose statement describing
– Why the organization is maintaining information about this business concept;
– Sources of information about it;
– A partial list of the attributes or characteristics of the entity; and
– Associations with other data items(read as "One room contains zero or many beds.")
32. 63
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
BUILD?
WHAT? HOW?
As Is Requirements
Assets WHAT?
As Is Design Assets
HOW?
As Is Implementation
Assets AS BUILT
Forward Engineering
© Copyright 2021 by Peter Aiken Slide # 64
https://plusanythingawesome.com
New
Building new stuff - in this case, new databases
33. Systems Development Life Cycle (SDLC)
© Copyright 2021 by Peter Aiken Slide # 65
https://plusanythingawesome.com
WHAT?
HOW?
BUILD?
66
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
34. Data Representation is the Essence of Programming
• Mythical Man Month ➜ 9 parallel effort x 1 month each ≠ baby
• Fred Brooks Jr.'s observation
– Data representation is the essence of programming
– "Show me your flowchart and
conceal your tables, and
I shall continue to be mystified.
– Show me your tables, and
I won't usually need your flowchart;
it'll be obvious."
© Copyright 2021 by Peter Aiken Slide # 67
https://plusanythingawesome.com
As Is Requirements
Assets WHAT?
As Is Design Assets
HOW?
As Is Implementation
Assets AS BUILT
Existing
Reverse Engineering
© Copyright 2021 by Peter Aiken Slide # 68
https://plusanythingawesome.com
A structured technique aimed at recovering rigorous knowledge
of the existing system to leverage enhancement efforts
[Chikofsky & Cross 1990]
35. 69
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
As Is Requirements
Assets WHAT?
As Is Design Assets
HOW?
As Is Implementation
Assets AS BUILT
Existing
New
Reengineering
Reverse Engineering
Forward engineering
Reimplement
To Be
Implementation
Assets
To Be
Design
Assets
To Be Requirements
Assets
• First, reverse engineering the existing system
to understand its strengths/weaknesses
• Next, use this information to inform the design
of the new system
© Copyright 2021 by Peter Aiken Slide # 70
https://plusanythingawesome.com
36. Data Modeling Process
1. Identify entities
2. Identify key for each
entity
3. Draw rough draft of
entity relationship
data model
4. Identify data
attributes
5. Map data attributes
to entities
© Copyright 2021 by Peter Aiken Slide # 71
https://plusanythingawesome.com
Model evolution is good, at first ...
1. Identify entities
2. Identify key for each
entity
3. Draw rough draft of
entity relationship
data model
4. Identify data
attributes
5. Map data attributes
to entities
© Copyright 2021 by Peter Aiken Slide # 72
https://plusanythingawesome.com
37. Relative use of time allocated to tasks during modeling
© Copyright 2021 by Peter Aiken Slide # 73
https://plusanythingawesome.com
Preliminary Modeling Wrapup
Activity activities Cycles activities
Evidence Analysis
collection &
analysis Collection
Project
coordination
requirements Declining coordination requirements
Target
system
analysis Increasing amounts of target system analysis
Modeling Validation
cycle
focus Refinement
Source:http://dmreview.com/article_sub.cfm?articleID=1000941 used with permission
Meta Data Models
© Copyright 2021 by Peter Aiken Slide # 74
https://plusanythingawesome.com
38. 75
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
Take Aways
• Attributes are organized into entities/objects
– Attributes are characteristics of "things"
– Entitles/objects are "things" whose
information is managed in support of strategy
– Example(s)
• Entities/objects are organized into models
– Combinations of attributes and entities are
structured to represent information requirements
– Poorly structured data, constrains organizational
information delivery capabilities
– Example(s)
• Models are organized into architectures
– When building new systems, architectures are used to plan development
– More often, data managers do not know what existing architectures are and -
therefore - cannot make use of them in support of strategy implementation
– Why no examples?
© Copyright 2021 by Peter Aiken Slide # 76
https://plusanythingawesome.com
Entity: BED
Attributes: Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Intricate
Dependencies
Purposefulness
39. Learning Objectives
• Both data architecture and data models are
made more useful by each other
– Power derived from their interdependence
– Data models are a primary means to achieve shared
understanding of data challenge specifics
– They are literally the pages that intersect data assets
and the organizational response
• Data architecture is the sum of the data
models
– Coverage is rarely complete
• Data models
– Documentation
– The currency of data coordination
– Used to verify integration
– Mandated input to any data systems evolution
• What is taught
– Forward engineering with a goal of building
• What is also needed
– Reverse engineering with a goal of understanding
© Copyright 2021 by Peter Aiken Slide # 77
https://plusanythingawesome.com
Upcoming Events (Webinars begin @ 19:00 UTC/2:00 PM NYC)
© Copyright 2021 by Peter Aiken Slide # 78
https://plusanythingawesome.com
Brought to you by:
Why Data Modeling
Is Fundamental
8 June 2021
Business Value through
Reference & Master Data Strategies
13 July 2021
Getting (Re)Started with Data Stewardship
10 August 2021