SlideShare a Scribd company logo
1 of 40
Download to read offline
© Copyright 2021 by Peter Aiken Slide # 1
paiken@plusanythingawesome.com+1.804.382.5957 Peter Aiken, PhD
Data Architecture and
Data Modeling Differences:
Data mapping from two perspectives–achieving a common understanding
Peter Aiken, Ph.D.
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Institute for Defense Analyses (ida.org)
• DAMA International (dama.org)
• MIT CDO Society (iscdo.org)
• Anything Awesome (plusanythingawesome.com)
• 11 books and dozens of articles
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart …
© Copyright 2021 by Peter Aiken Slide # 2
https://plusanythingawesome.com
Event Pricing
© Copyright 2021 by Peter Aiken Slide # 3
https://plusanythingawesome.com
• 20% off
directly from the publisher on
select titles
• My Book Store @
http://plusanythingawwsome.com
• Enter the code
"anythingawesome" at the
Technics bookstore checkout
where it says to
"Apply Coupon"
anythingawesome
4
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
Data ...
• As a subject is
– Complex and detailed
– Taught inconsistently, and
– Poorly understood
• Data maps (models) are necessary but
insufficient prerequisites to data architectures
– Fully leveraging data assets
• Maps are incomplete without purpose statements
– More powerful than definitions
– Remedy
• Add purpose statements
• Validate resulting model
• Maps are required to share information about data
• Data architectures are composed of data models
– Data modeling is an engineering activity required to product data maps that are
necessary but insufficient prerequisites to leveraging data assets
© Copyright 2021 by Peter Aiken Slide # 5
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
Metadata
Management
6
https://plusanythingawesome.com
Data
Management
Body
of
Knowledge
(DM
BoK
V2)
Practice
Areas
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
• Analysis
• Database Design
• Implementation
• Additional data
development
https://www.amazon.com/Infonomics-Monetize-Information-Competitive-Advantage/dp/1138090387
Data's Unique Properties
• Does not obey all of the laws of physics
– Not really visible (visualization expertise)
• Non rivalrous
– the cost of providing an additional copy is zero
• Non depleting
– Does not require replenishment
• Regenerative
• Nearly unlimited
• Low inventory and transportation/transmission costs
• More difficult to control and own
• Eco friendly
• Impossible to clean-up if you spill it
© Copyright 2021 by Peter Aiken Slide # 7
https://plusanythingawesome.com
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 8
https://plusanythingawesome.com
process of discovering, analyzing, and scoping data requirements
© Copyright 2021 by Peter Aiken Slide # 9
https://plusanythingawesome.com
• List organizational places
• These are called Attributes
– Attributes are characteristics of "things"
• List organizational places that need to be
persons
places
things
created
read
updated
deleted
archived
https://plusanythingawesome.com
process of discovering, analyzing, and scoping data requirements
• An organization might decide to
characterize the parts of a THING as:
– Attributes: ID, description, status,
sex.to.be.assigned, reserve.reason
• Decisions to manage information
about each specific attribute has
direct consequences
– A decision to use the above data
attributes permits the organization to
determine if it has female THINGs are available to be reserved
• Characteristics can be shared
– All THINGs may have a status
– Many THINGs can be assigned to females
• Characteristics may be required to be unique
– ID permits identification every THING as distinct for every other THING
– Description is likely to be unique for each THING
© Copyright 2021 by Peter Aiken Slide # 10
https://plusanythingawesome.com
THING
Thing.Id #
Thing.Description
Thing.Status
Thing.Sex.To.Be.Assigned
Thing.Reserve.Reason
Attributes arranged into an
entity named "thing" – the
attribute Thing.Id is the means
used to identify a unique
occurrence of thing
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 11
https://plusanythingawesome.com
representing and communicating these in a precise form called a data model
© Copyright 2021 by Peter Aiken Slide # 12
https://plusanythingawesome.com
Thing.Id #
Thing.Description
Thing.Status
Thing.Sex.To.Be.Assigned
Thing.Reserve.Reason
Thing 1
Thing.Id #
…
Thing 2
Each THING 2 must be accompanied by a THING 1
representing/communicating requirements in a precise form called a data model
© Copyright 2021 by Peter Aiken Slide # 13
https://plusanythingawesome.com
• Defines mandatory/optional
relationships between using
minimum/maximum
occurrences from one entity
to another
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 14
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
!

!

 !

!

15
https://plusanythingawesome.com
Organizational Needs
become instantiated
and integrated into a
Data Models
Informa(on)System)
Requirements
authorizes and
articulates
satisfy
specific
organizational
needs
The process is iterative
Data models and data architectures are
developed in response to needs
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 16
https://plusanythingawesome.com
ANSI-SPARC 3-Layer Schema
1. CONCEPTUAL - Allows independent
customized user views:
– Each should be able to access the same
data, but have a different customized view
of the data.
2. LOGICAL - This hides the physical
storage details from users:
– Users should not have to deal with physical
database storage details. They should be
allowed to work with the data itself, without
concern for how it is physically stored.
3. PHYSICAL - The database
administrator should be able to
change the database storage
structures without affecting the
users’ views:
– Changes to the structure of an
organization's data will be required. The
internal structure of the database should
be unaffected by changes to the physical
aspects of the storage.
© Copyright 2021 by Peter Aiken Slide # 17
https://plusanythingawesome.com
For example, a changeover to a new DBMS
technology. The database administrator
should be able to change the conceptual or
global structure of the database without
affecting the users.
Data modeling
• The process of discovering, analyzing, and
scoping data requirements
– Understand what the data things are?
– What do they do?
– How do they interact?
• Representing/communicating requirements in
a precise form called a data model
– Maps of critical business assets
– Compose and contain metadata essential to data
consumers
– Function as a kind of sheet music language
– Metadata is essential to other business functions
(definitions for governance, lineage for analytics, etc.)
• The process is iterative and may include a
conceptual, logical, and physical model
© Copyright 2021 by Peter Aiken Slide # 18
https://plusanythingawesome.com
Each data arrangement is a data structure
Some data structure characteristics:
• Grammar for data objects
– Grammar is the principles
or rules of an art, science,
or technique "a grammar
of the theater"
• Constraints for data objects
• Sequential order
• Uniqueness
• Order
– Hierarchical, relational,
network, lake, other
• Balance
• Optimality
© Copyright 2021 by Peter Aiken Slide # 19
https://plusanythingawesome.com
http://www.nist.gov/dads/HTML/datastructur.html
"An organization of information, usually in memory, for better algorithm efficiency, such
as queue, stack, linked list, heap, dictionary, and tree, or conceptual unity, such as the
name and address of a person. It may include redundant information, such as length of
the list or number of nodes in a subtree."
https://plusanythingawesome.com
Hierarchy
• A hierarchy is an
arrangement of items
(objects, names, values,
categories, etc.) in which
the items are represented
as being "above",
"below", or "at the same
level as" one another.
• Hierarchy is an important
concept in a wide variety
of fields, such as
philosophy, mathematics,
computer science,
organizational theory,
systems theory, and the
social sciences
(especially political
philosophy).
© Copyright 2021 by Peter Aiken Slide # 20
https://plusanythingawesome.com
Data Architecture
Data Maps
/Models
Mess
Data Maps
/Models
Data Maps
/Models
Data Maps
/Models
Data Maps
/Models
Model View
Social
© Copyright 2021 by Peter Aiken Slide # 21
https://plusanythingawesome.com
https://plusanythingawesome.com
Differences between Programs and Projects
• Programs are Ongoing, Projects End
– Managing a program involves long term strategic planning and
continuous process improvement is not required of a project
• Programs are Tied to the Financial Calendar
– Program managers are often responsible for delivering
results tied to the organization's financial calendar
• Program Management is Governance Intensive
– Programs are governed by a senior board that provides direction,
oversight, and control while projects tend to be less governance-intensive
• Programs Have Greater Scope of Financial Management
– Projects typically have a straight-forward budget and project financial management
is focused on spending to budget while program planning, management and control
is significantly more complex
• Program Change Management is an Executive Leadership Capability
– Projects employ a formal change management process while at the program level,
change management requires executive leadership skills and program change is
driven more by an organization's strategy and is subject to market conditions and
changing business goals
© Copyright 2021 by Peter Aiken Slide #
Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management
Your data program must
last at least as long as
your HR program!
22
https://plusanythingawesome.com
What do we teach knowledge workers about data?
© Copyright 2021 by Peter Aiken Slide # 23
https://plusanythingawesome.com
What percentage of the deal with it daily?
Political
© Copyright 2021 by Peter Aiken Slide # 24
https://plusanythingawesome.com
https://plusanythingawesome.com
What do we teach IT professionals about data?
© Copyright 2021 by Peter Aiken Slide # 25
https://plusanythingawesome.com
• 1 course
– How to build a
new database
• What
impressions do IT
professionals get
from this
education?
– Data is a technical
skill that is needed
when developing
new databases
© Copyright 2021 by Peter Aiken Slide # 26
https://plusanythingawesome.com
If the only tool you
know is a hammer
you tend to see
every problem as a
nail (slightly reworded
from Abraham Maslow)
© Copyright 2021 by Peter Aiken Slide # 27
https://plusanythingawesome.com
Bad data decisions
Technical deci-
sion makers are not
data knowledgable
Business decision
makers are not
data knowledgable
Poor organizational outcomes
Poor treatment of
organizational data
assets
Poor
quality
data
Bad Data Decisions Spiral
Economic
© Copyright 2021 by Peter Aiken Slide # 28
https://plusanythingawesome.com
https://plusanythingawesome.com
Tacoma Narrows Bridge/Gallopin' Gertie
© Copyright 2021 by Peter Aiken Slide # 29
https://plusanythingawesome.com
https://plusanythingawesome.com
• World's 3rd longest suspension span
• Slender, elegant and graceful
• Opened on July 1st 1940, collapsed in a windstorm on 7 Nov 1940
• "The most dramatic failure in
bridge engineering history"
• Changed forever how engineers
design suspension bridges leading
to safer spans today.
© Copyright 2021 by Peter Aiken Slide # 30
https://plusanythingawesome.com
Tacoma Narrows Bridge/Gallopin' Gertie
© Copyright 2021 by Peter Aiken Slide # 31
https://plusanythingawesome.com
Similarly data failures cost organizations
minimally 20-40% of their IT budget
https://plusanythingawesome.com
Data incoherence is a hidden expense
• How does maltreated data cost money?
• Consider the opposite question:
– Were your systems explicitly designed to
be integrated or otherwise work together?
– If not then what is the likelihood that they
will work well together?
• Organizations spend 20-40% of their IT
budget evolving their data - including:
– Data migration
• Changing the location from one place to another
– Data conversion
• Changing data into another form, state, or product
– Data improving
• "Inspecting and manipulating, or re-keying data to prepare it for
subsequent use" - Source: John Zachman
© Copyright 2021 by Peter Aiken Slide # 32
https://plusanythingawesome.com
PETER AIKEN WITH JUANITA BILLINGS
FOREWORD BY JOHN BOTTEGA
MONETIZING
DATA MANAGEMENT
Unlocking the Value in Your Organization’s
Most Important Asset.
Doing a poor job with data
• Takes longer
• Costs more
• Delivers less
• Presents greater risk (with thanks to Tom DeMarco)
© Copyright 2021 by Peter Aiken Slide # 33
https://plusanythingawesome.com
34
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
4 Minute Architecture Lesson from Steve Jobs, Introducing iCloud
© Copyright 2021 by Peter Aiken Slide # 35
https://plusanythingawesome.com
https://plusanythingawesome.com
Architecture is about ...
• Things
– (components)
• The functions of the things
– (individually)
• How the things interact
– (as a system,
– towards a goal)
© Copyright 2021 by Peter Aiken Slide # 36
https://plusanythingawesome.com
• Business
• Process
• Systems
• Security
• Technical
• Data / Information
Typically Managed Architectures
• Business Architecture
– Goals, strategies, roles, organizational structure, location(s)
• Process Architecture
– Arrangement of inputs -> transformations = value -> outputs
– Typical elements: Functions, activities, workflow, events, cycles, products, procedures
• Systems Architecture
– Applications, software components, interfaces, projects
• Security Architecture
– Arrangement of security controls relation to IT Architecture
• Technical Architecture/Tarchitecture
– Relation of software capabilities/technology stack
– Structure of the technology infrastructure of an enterprise, solution or system
– Typical elements: Networks, hardware, software platforms, standards/protocols
• Data / Information Architecture
– Arrangement of data assets supporting organizational strategy
– Typical elements: specifications expressed as entities, relationships, attributes,
definitions, values, vocabularies
© Copyright 2021 by Peter Aiken Slide # 37
https://plusanythingawesome.com
1 in 10 organizations manage 1
or more of the formally
Data Architectures: here, whether you like it or not
© Copyright 2021 by Peter Aiken Slide # 38
https://plusanythingawesome.com
deviantart.com
• All organizations
have data
architectures
– Some are better
understood and
documented (and
therefore more
useful to the
organization) than
others
Business
Process
Systems
Security
Technical
Data/Information
Data model focus is typically domain specific
© Copyright 2021 by Peter Aiken Slide # 39
https://plusanythingawesome.com
Program A
Program C
Program B
Focus of a software engineering effort
Underutilized
data modeling
effort
Database architecture focus Can vary
© Copyright 2021 by Peter Aiken Slide # 40
https://plusanythingawesome.com
Application
domain 1
Program A
Program C
Program B
Focus of a software engineering effort
Underutilized
data modeling
effort
Better utilized
data modeling
effort
ERPs and COTS are marketed as
being similarly integrated!
Program F
Program E
Program G
Program H
Program I
Application
domain 2
Application
domain 3
Program D
Application
domain 1
Program A
Program C
Program B
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
D
a
t
a
Program F
Program E
Program D
Program G
Program H
Program I
Application
domain 2
Application
domain 3
D
a
t
a
D
a
t
a
D
a
t
a
Data Architecture Focus has Greater Potential Value
• Broader focus than
either software
architecture or
database
architecture
• Analysis scope is
on the system
wide use of data
• Problems caused
by data exchange
or interface
problems
• Architectural goals
more strategic
than operational
© Copyright 2021 by Peter Aiken Slide # 41
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
Metadata
Management
42
https://plusanythingawesome.com
Data
Management
Body
of
Knowledge
(DM
BoK
V2)
Practice
Areas
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
• Enterprise data
modeling
• Value chain analysis
• Related data
architecture
How are components expressed as architectures?
• Details are
organized into
larger components
• Larger components
are organized into
models
• Models are
organized into
architectures
(composed of
architectural
components)
© Copyright 2021 by Peter Aiken Slide # 43
https://plusanythingawesome.com
A B
C D
A B
C D
A
D
C
B
Intricate
Dependencies
Purposefulness
How are data structures expressed as architectures?
• Attributes are organized into entities/objects
– Attributes are characteristics of "things"
– Entitles/objects are "things" whose
information is managed in support of strategy
– Example(s)
• Entities/objects are organized into models
– Combinations of attributes and entities are
structured to represent information requirements
– Poorly structured data, constrains organizational
information delivery capabilities
– Example(s)
• Models are organized into architectures
– When building new systems, architectures are used to plan development
– More often, data managers do not know what existing architectures are and -
therefore - cannot make use of them in support of strategy implementation
• Why no examples?
© Copyright 2021 by Peter Aiken Slide # 44
https://plusanythingawesome.com
Intricate
Dependencies
Purposefulness
THING
Thing.Id #
Thing.Description
Thing.Status
Thing.Sex.To.Be.Assigned
Thing.Reserve.Reason
Data architectures are composed of data models
© Copyright 2021 by Peter Aiken Slide # 45
https://plusanythingawesome.com
Data
Data
Data
Information
Fact Meaning
Request
A model defining 3 important concepts composing a data architecture
© Copyright 2021 by Peter Aiken Slide #
[Built on definitions from Dan Appleton 1983]
Intelligence
Strategic Use
Data
Data
Data Data
46
https://plusanythingawesome.com
“You can have data without information, but
you cannot have information without data”
— Daniel Keys Moran, Science Fiction Writer
1. Each FACT combines with one or more MEANINGS.
2. Each specific FACT and MEANING combination is referred to as a DATUM.
3. An INFORMATION is one or more DATA that are returned in response to a specific REQUEST
4. INFORMATION REUSE is enabled when one FACT is combined with more than one MEANING.
5. INTELLIGENCE is INFORMATION associated with its STRATEGIC USES.
6. DATA/INFORMATION must formally arranged into an ARCHITECTURE.
Wisdom & knowledge are
often used synonymously
Useful Data
Data Architectures Determine
Interoperability
• Required to enable
self-correction/generation
capabilities
• Permits governance of data as an
asset
• Prerequisite to meaningful data
exchanges
• Lowers costs of organization-wide
and extra-organizational data
sharing
• Permits managed evolution - rapidly
responding to changing needs, new
partners, time criticality's
• Required for (full) role-based
security implementation
• Decreases the cost of maintaining
data inventories
Data Architectures
• Capture the business meaning of the
data required to run the organization
• Living document – constantly
evolving to meet upcoming and
discovered business requirements
• A potential entry point for
architecture engagements
• Validated data architectural
components can be used to
populate a business glossary
• Major collection of metadata
© Copyright 2021 by Peter Aiken Slide # 47
https://plusanythingawesome.com
Levels of Abstraction, Completeness and Utility
• Models more downward facing - detail
• Architecture is higher level of abstraction - integration
• In the past architecture attempted to gain complete (perfect)
understanding
– Not timely
– Not feasible
• Focus instead on
architectural components
– Governed by a framework
– More immediate utility
• http://www.architecturalcomponentsinc.com
© Copyright 2021 by Peter Aiken Slide # 48
https://plusanythingawesome.com
Data structures organized into an Architecture
• How do data structures support strategy?
• Consider the opposite question?
– Were your systems explicitly designed to be
integrated or otherwise work together?
– If not then what is the likelihood that they will
work well together?
– In all likelihood your organization is spending
between 20-40% of its IT budget compensating
for poor data structure integration
– They cannot be helpful as long as their
structure is unknown
• Two answers/two separate strategies
– Achieving efficiency and
effectiveness goals
– Providing organizational dexterity
for rapid implementation
© Copyright 2021 by Peter Aiken Slide # 49
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide #
Engineering
Architecture
50
https://plusanythingawesome.com
Engineering/Architecting
Relationship
• Architecting is used to
create and build systems too
complex to be treated by
engineering analysis alone
– Require technical details as the
exception
• Engineers develop the
technical designs for
implementation
– Engineering/Crafts-persons
deliver work product
components supervised by:
• Manufacturer
• Building Contractor
You cannot architect after implementation!
© Copyright 2021 by Peter Aiken Slide # 51
https://plusanythingawesome.com
https://plusanythingawesome.com
USS Midway
& Pancakes
What is this?
• It is tall
• It has a clutch
• It was built in 1942
• It is cemented to the floor
• It is still in regular use!
© Copyright 2021 by Peter Aiken Slide # 52
https://plusanythingawesome.com
© Copyright 2021 by Peter Aiken Slide # 53
https://plusanythingawesome.com
Definition of Bed
https://plusanythingawesome.com
process of discovering, analyzing, and scoping data requirements
• An organization might decide to
characterize the parts of a BED as:
– Attributes: ID, description, status,
sex.to.be.assigned, reserve.reason
• Decisions to manage information
about each specific attribute has
direct consequences
– A decision to use the above data
attributes permits the organization to
determine if it has female beds are available to be reserved
• Characteristics can be shared
– All beds may have a status
– Many beds can be assigned to females
• Characteristics may be required to be unique
– ID permits identification every bed as distinct for every other bed
– Description is unlikely to be the same for each bed
© Copyright 2021 by Peter Aiken Slide # 54
https://plusanythingawesome.com
BED
Bed.Id #
Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Attributes arranged into an
entity named "bed" – the
attribute Bed.Id is the means
used to identify a unique
occurrence of bed
Q: What is the proper relationship for these entities?
© Copyright 2021 by Peter Aiken Slide # 55
https://plusanythingawesome.com
Bed Room
Bed Room
Data Maps at the Entity Level ➜ Stored Facts
© Copyright 2021 by Peter Aiken Slide # 56
https://plusanythingawesome.com
Bed Room
a BED is related to a ROOM
More precision:
many BEDS are related to many ROOMS
Bed Room
Better information:
many BEDS may be contained in each ROOM and each room may contain many beds
What if beds can
be moved?
Eventually One or Many (optional)
Eventually One (optional)
Zero, or Many (optional)
One or Many (mandatory)
Exactly One (mandatory)
Possible Entity Relationship Cardinality Options
© Copyright 2021 by Peter Aiken Slide # 57
https://plusanythingawesome.com
Families of Modeling Notation Variants
© Copyright 2021 by Peter Aiken Slide # 58
https://plusanythingawesome.com
Information Engineering
What is a Relationship?
• Natural associations between two or more entities
© Copyright 2021 by Peter Aiken Slide # 59
https://plusanythingawesome.com
Ordinality & Cardinality
• Defines mandatory/optional relationships using minimum/
maximum occurrences from one entity to another
© Copyright 2021 by Peter Aiken Slide # 60
https://plusanythingawesome.com
A BED is placed
in one and only
one ROOM A ROOM
contains zero
or more BEDS
A BED is occupied by zero or more
PATIENTS
A PATIENT
occupies at least
one or more BEDS
ROOM
BED
PATIENT
Standard definition reporting does not provide conceptual context
© Copyright 2021 by Peter Aiken Slide # 61
https://plusanythingawesome.com
BED
Something you sleep in
Purpose statement incorporates motivations
Entity: BED
Data Asset Type: Principal Data Entity
Purpose: This is a substructure within the Room
substructure of the Facility Location. It contains
information about beds within rooms.
Source: Maintenance Manual for File and Table
Data (Software Version 3.0, Release 3.1)
Attributes: Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Associations: >0-+ Room
Status: Validated
© Copyright 2021 by Peter Aiken Slide # 62
https://plusanythingawesome.com
Draft
A purpose statement describing
– Why the organization is maintaining information about this business concept;
– Sources of information about it;
– A partial list of the attributes or characteristics of the entity; and
– Associations with other data items(read as "One room contains zero or many beds.")
63
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
BUILD?
WHAT? HOW?
As Is Requirements
Assets WHAT?
As Is Design Assets
HOW?
As Is Implementation
Assets AS BUILT
Forward Engineering
© Copyright 2021 by Peter Aiken Slide # 64
https://plusanythingawesome.com
New
Building new stuff - in this case, new databases
Systems Development Life Cycle (SDLC)
© Copyright 2021 by Peter Aiken Slide # 65
https://plusanythingawesome.com
WHAT?
HOW?
BUILD?
66
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
Data Representation is the Essence of Programming
• Mythical Man Month ➜ 9 parallel effort x 1 month each ≠ baby
• Fred Brooks Jr.'s observation
– Data representation is the essence of programming
– "Show me your flowchart and
conceal your tables, and
I shall continue to be mystified.
– Show me your tables, and
I won't usually need your flowchart;
it'll be obvious."
© Copyright 2021 by Peter Aiken Slide # 67
https://plusanythingawesome.com
As Is Requirements
Assets WHAT?
As Is Design Assets
HOW?
As Is Implementation
Assets AS BUILT
Existing
Reverse Engineering
© Copyright 2021 by Peter Aiken Slide # 68
https://plusanythingawesome.com
A structured technique aimed at recovering rigorous knowledge
of the existing system to leverage enhancement efforts
[Chikofsky & Cross 1990]
69
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
As Is Requirements
Assets WHAT?
As Is Design Assets
HOW?
As Is Implementation
Assets AS BUILT
Existing
New
Reengineering
Reverse Engineering
Forward engineering
Reimplement
To Be
Implementation
Assets
To Be
Design
Assets
To Be Requirements
Assets
• First, reverse engineering the existing system
to understand its strengths/weaknesses
• Next, use this information to inform the design
of the new system
© Copyright 2021 by Peter Aiken Slide # 70
https://plusanythingawesome.com
Data Modeling Process
1. Identify entities
2. Identify key for each
entity
3. Draw rough draft of
entity relationship
data model
4. Identify data
attributes
5. Map data attributes
to entities
© Copyright 2021 by Peter Aiken Slide # 71
https://plusanythingawesome.com
Model evolution is good, at first ...
1. Identify entities
2. Identify key for each
entity
3. Draw rough draft of
entity relationship
data model
4. Identify data
attributes
5. Map data attributes
to entities
© Copyright 2021 by Peter Aiken Slide # 72
https://plusanythingawesome.com
Relative use of time allocated to tasks during modeling
© Copyright 2021 by Peter Aiken Slide # 73
https://plusanythingawesome.com
Preliminary Modeling Wrapup
Activity activities Cycles activities
Evidence Analysis
collection &
analysis Collection
Project
coordination
requirements Declining coordination requirements
Target
system
analysis Increasing amounts of target system analysis
Modeling Validation
cycle
focus Refinement
Source:http://dmreview.com/article_sub.cfm?articleID=1000941 used with permission
Meta Data Models
© Copyright 2021 by Peter Aiken Slide # 74
https://plusanythingawesome.com
75
© Copyright 2021 by Peter Aiken Slide #
https://plusanythingawesome.com
• Data Maps ➜ Models
– Why do we need them?
– How are they used?
– Challenges to increased use
(social, political, economic)
• Architecture/Engineering
– Two sides of same data coin
– Must operate on standard,
shared data of known quality
• From the Top
– Means: Forward engineering
– Goal: Composition/Building
• From the Bottom
– Means: Reverse engineering
– Goal: Understanding
• Working Together
– Functions required for effective
data management
– Need for simplicity
• Take Aways/Q&A
Data Architecture and
Data Modeling Differences:
Achieving a common understanding
Program
Take Aways
• Attributes are organized into entities/objects
– Attributes are characteristics of "things"
– Entitles/objects are "things" whose
information is managed in support of strategy
– Example(s)
• Entities/objects are organized into models
– Combinations of attributes and entities are
structured to represent information requirements
– Poorly structured data, constrains organizational
information delivery capabilities
– Example(s)
• Models are organized into architectures
– When building new systems, architectures are used to plan development
– More often, data managers do not know what existing architectures are and -
therefore - cannot make use of them in support of strategy implementation
– Why no examples?
© Copyright 2021 by Peter Aiken Slide # 76
https://plusanythingawesome.com
Entity: BED
Attributes: Bed.Description
Bed.Status
Bed.Sex.To.Be.Assigned
Bed.Reserve.Reason
Intricate
Dependencies
Purposefulness
Learning Objectives
• Both data architecture and data models are
made more useful by each other
– Power derived from their interdependence
– Data models are a primary means to achieve shared
understanding of data challenge specifics
– They are literally the pages that intersect data assets
and the organizational response
• Data architecture is the sum of the data
models
– Coverage is rarely complete
• Data models
– Documentation
– The currency of data coordination
– Used to verify integration
– Mandated input to any data systems evolution
• What is taught
– Forward engineering with a goal of building
• What is also needed
– Reverse engineering with a goal of understanding
© Copyright 2021 by Peter Aiken Slide # 77
https://plusanythingawesome.com
Upcoming Events (Webinars begin @ 19:00 UTC/2:00 PM NYC)
© Copyright 2021 by Peter Aiken Slide # 78
https://plusanythingawesome.com
Brought to you by:
Why Data Modeling
Is Fundamental
8 June 2021
Business Value through
Reference & Master Data Strategies
13 July 2021
Getting (Re)Started with Data Stewardship
10 August 2021
paiken@plusanythingawesome.com +1.804.382.5957
Questions?
Thank You!
© Copyright 2021 by Peter Aiken Slide # 79
Book a call with Peter to discuss anything - https://plusanythingawesome.com/OfficeHours.html
+ =

More Related Content

What's hot

What's hot (20)

RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
Whitepaper on Master Data Management
Whitepaper on Master Data Management Whitepaper on Master Data Management
Whitepaper on Master Data Management
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Master Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and GovernanceMaster Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and Governance
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Best Practices in Metadata Management
Best Practices in Metadata ManagementBest Practices in Metadata Management
Best Practices in Metadata Management
 
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)Data-Ed Slides: Best Practices in Data Stewardship (Technical)
Data-Ed Slides: Best Practices in Data Stewardship (Technical)
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Master Data Management - Gartner Presentation
Master Data Management - Gartner PresentationMaster Data Management - Gartner Presentation
Master Data Management - Gartner Presentation
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Management is Data Governance
Data Management is Data GovernanceData Management is Data Governance
Data Management is Data Governance
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data Squared
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 

Similar to DataEd Online: Data Architecture and Data Modeling Differences — Achieving a Common Understanding

DataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
DataEd Slides: Data Architecture vs. Data Modeling – Compare and ContrastDataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
DataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
DATAVERSITY
 

Similar to DataEd Online: Data Architecture and Data Modeling Differences — Achieving a Common Understanding (20)

Approaching Data Quality
Approaching Data QualityApproaching Data Quality
Approaching Data Quality
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Data-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata StrategiesData-Ed: Essential Metadata Strategies
Data-Ed: Essential Metadata Strategies
 
Why Data Modeling Is Fundamental
Why Data Modeling Is FundamentalWhy Data Modeling Is Fundamental
Why Data Modeling Is Fundamental
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
 
DataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance ProgramsDataEd Slides: Growing Practical Data Governance Programs
DataEd Slides: Growing Practical Data Governance Programs
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
DataEd Slides: Data Architecture versus Data Modeling
DataEd Slides:  Data Architecture versus Data ModelingDataEd Slides:  Data Architecture versus Data Modeling
DataEd Slides: Data Architecture versus Data Modeling
 
DataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
DataEd Slides: Data Architecture vs. Data Modeling – Compare and ContrastDataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
DataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
 
Data Management vs. Data Governance Program
Data Management vs. Data Governance ProgramData Management vs. Data Governance Program
Data Management vs. Data Governance Program
 
Business Value Through Reference and Master Data Strategies
Business Value Through Reference and Master Data StrategiesBusiness Value Through Reference and Master Data Strategies
Business Value Through Reference and Master Data Strategies
 
DataEd Slides: Data Management + Data Strategy = Interoperability
DataEd Slides: Data Management + Data Strategy = InteroperabilityDataEd Slides: Data Management + Data Strategy = Interoperability
DataEd Slides: Data Management + Data Strategy = Interoperability
 
Data Architecture Strategies
Data Architecture StrategiesData Architecture Strategies
Data Architecture Strategies
 
Essential Reference and Master Data Management
Essential Reference and Master Data ManagementEssential Reference and Master Data Management
Essential Reference and Master Data Management
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
DataEd Slides: Getting (Re)Started with Data Stewardship
DataEd Slides: Getting (Re)Started with Data StewardshipDataEd Slides: Getting (Re)Started with Data Stewardship
DataEd Slides: Getting (Re)Started with Data Stewardship
 
DataEd Slides: Data Modeling is Fundamental
DataEd Slides:  Data Modeling is FundamentalDataEd Slides:  Data Modeling is Fundamental
DataEd Slides: Data Modeling is Fundamental
 
Data Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s HomeData Structures - The Cornerstone of Your Data’s Home
Data Structures - The Cornerstone of Your Data’s Home
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data Stewardship
 

More from DATAVERSITY

The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business Intelligence
 
Including All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and AnalyticsIncluding All Your Mission-Critical Data in Modern Apps and Analytics
Including All Your Mission-Critical Data in Modern Apps and Analytics
 

Recently uploaded

PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
cnajjemba
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Abortion pills in Riyadh +966572737505 get cytotec
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
wsppdmt
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 

Recently uploaded (20)

PLE-statistics document for primary schs
PLE-statistics document for primary schsPLE-statistics document for primary schs
PLE-statistics document for primary schs
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdf
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 

DataEd Online: Data Architecture and Data Modeling Differences — Achieving a Common Understanding

  • 1. © Copyright 2021 by Peter Aiken Slide # 1 paiken@plusanythingawesome.com+1.804.382.5957 Peter Aiken, PhD Data Architecture and Data Modeling Differences: Data mapping from two perspectives–achieving a common understanding Peter Aiken, Ph.D. • I've been doing this a long time • My work is recognized as useful • Associate Professor of IS (vcu.edu) • Institute for Defense Analyses (ida.org) • DAMA International (dama.org) • MIT CDO Society (iscdo.org) • Anything Awesome (plusanythingawesome.com) • 11 books and dozens of articles • Experienced w/ 500+ data management practices worldwide • Multi-year immersions – US DoD (DISA/Army/Marines/DLA) – Nokia – Deutsche Bank – Wells Fargo – Walmart … © Copyright 2021 by Peter Aiken Slide # 2 https://plusanythingawesome.com
  • 2. Event Pricing © Copyright 2021 by Peter Aiken Slide # 3 https://plusanythingawesome.com • 20% off directly from the publisher on select titles • My Book Store @ http://plusanythingawwsome.com • Enter the code "anythingawesome" at the Technics bookstore checkout where it says to "Apply Coupon" anythingawesome 4 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • Data Maps ➜ Models – Why do we need them? – How are they used? – Challenges to increased use (social, political, economic) • Architecture/Engineering – Two sides of same data coin – Must operate on standard, shared data of known quality • From the Top – Means: Forward engineering – Goal: Composition/Building • From the Bottom – Means: Reverse engineering – Goal: Understanding • Working Together – Functions required for effective data management – Need for simplicity • Take Aways/Q&A Data Architecture and Data Modeling Differences: Achieving a common understanding Program
  • 3. Data ... • As a subject is – Complex and detailed – Taught inconsistently, and – Poorly understood • Data maps (models) are necessary but insufficient prerequisites to data architectures – Fully leveraging data assets • Maps are incomplete without purpose statements – More powerful than definitions – Remedy • Add purpose statements • Validate resulting model • Maps are required to share information about data • Data architectures are composed of data models – Data modeling is an engineering activity required to product data maps that are necessary but insufficient prerequisites to leveraging data assets © Copyright 2021 by Peter Aiken Slide # 5 https://plusanythingawesome.com © Copyright 2021 by Peter Aiken Slide # Metadata Management 6 https://plusanythingawesome.com Data Management Body of Knowledge (DM BoK V2) Practice Areas from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International • Analysis • Database Design • Implementation • Additional data development
  • 4. https://www.amazon.com/Infonomics-Monetize-Information-Competitive-Advantage/dp/1138090387 Data's Unique Properties • Does not obey all of the laws of physics – Not really visible (visualization expertise) • Non rivalrous – the cost of providing an additional copy is zero • Non depleting – Does not require replenishment • Regenerative • Nearly unlimited • Low inventory and transportation/transmission costs • More difficult to control and own • Eco friendly • Impossible to clean-up if you spill it © Copyright 2021 by Peter Aiken Slide # 7 https://plusanythingawesome.com Data modeling • The process of discovering, analyzing, and scoping data requirements – Understand what the data things are? – What do they do? – How do they interact? • Representing/communicating requirements in a precise form called a data model – Maps of critical business assets – Compose and contain metadata essential to data consumers – Function as a kind of sheet music language – Metadata is essential to other business functions (definitions for governance, lineage for analytics, etc.) • The process is iterative and may include a conceptual, logical, and physical model © Copyright 2021 by Peter Aiken Slide # 8 https://plusanythingawesome.com
  • 5. process of discovering, analyzing, and scoping data requirements © Copyright 2021 by Peter Aiken Slide # 9 https://plusanythingawesome.com • List organizational places • These are called Attributes – Attributes are characteristics of "things" • List organizational places that need to be persons places things created read updated deleted archived https://plusanythingawesome.com process of discovering, analyzing, and scoping data requirements • An organization might decide to characterize the parts of a THING as: – Attributes: ID, description, status, sex.to.be.assigned, reserve.reason • Decisions to manage information about each specific attribute has direct consequences – A decision to use the above data attributes permits the organization to determine if it has female THINGs are available to be reserved • Characteristics can be shared – All THINGs may have a status – Many THINGs can be assigned to females • Characteristics may be required to be unique – ID permits identification every THING as distinct for every other THING – Description is likely to be unique for each THING © Copyright 2021 by Peter Aiken Slide # 10 https://plusanythingawesome.com THING Thing.Id # Thing.Description Thing.Status Thing.Sex.To.Be.Assigned Thing.Reserve.Reason Attributes arranged into an entity named "thing" – the attribute Thing.Id is the means used to identify a unique occurrence of thing
  • 6. Data modeling • The process of discovering, analyzing, and scoping data requirements – Understand what the data things are? – What do they do? – How do they interact? • Representing/communicating requirements in a precise form called a data model – Maps of critical business assets – Compose and contain metadata essential to data consumers – Function as a kind of sheet music language – Metadata is essential to other business functions (definitions for governance, lineage for analytics, etc.) • The process is iterative and may include a conceptual, logical, and physical model © Copyright 2021 by Peter Aiken Slide # 11 https://plusanythingawesome.com representing and communicating these in a precise form called a data model © Copyright 2021 by Peter Aiken Slide # 12 https://plusanythingawesome.com Thing.Id # Thing.Description Thing.Status Thing.Sex.To.Be.Assigned Thing.Reserve.Reason Thing 1 Thing.Id # … Thing 2 Each THING 2 must be accompanied by a THING 1
  • 7. representing/communicating requirements in a precise form called a data model © Copyright 2021 by Peter Aiken Slide # 13 https://plusanythingawesome.com • Defines mandatory/optional relationships between using minimum/maximum occurrences from one entity to another Data modeling • The process of discovering, analyzing, and scoping data requirements – Understand what the data things are? – What do they do? – How do they interact? • Representing/communicating requirements in a precise form called a data model – Maps of critical business assets – Compose and contain metadata essential to data consumers – Function as a kind of sheet music language – Metadata is essential to other business functions (definitions for governance, lineage for analytics, etc.) • The process is iterative and may include a conceptual, logical, and physical model © Copyright 2021 by Peter Aiken Slide # 14 https://plusanythingawesome.com
  • 8. © Copyright 2021 by Peter Aiken Slide # ! ! ! ! 15 https://plusanythingawesome.com Organizational Needs become instantiated and integrated into a Data Models Informa(on)System) Requirements authorizes and articulates satisfy specific organizational needs The process is iterative Data models and data architectures are developed in response to needs Data modeling • The process of discovering, analyzing, and scoping data requirements – Understand what the data things are? – What do they do? – How do they interact? • Representing/communicating requirements in a precise form called a data model – Maps of critical business assets – Compose and contain metadata essential to data consumers – Function as a kind of sheet music language – Metadata is essential to other business functions (definitions for governance, lineage for analytics, etc.) • The process is iterative and may include a conceptual, logical, and physical model © Copyright 2021 by Peter Aiken Slide # 16 https://plusanythingawesome.com
  • 9. ANSI-SPARC 3-Layer Schema 1. CONCEPTUAL - Allows independent customized user views: – Each should be able to access the same data, but have a different customized view of the data. 2. LOGICAL - This hides the physical storage details from users: – Users should not have to deal with physical database storage details. They should be allowed to work with the data itself, without concern for how it is physically stored. 3. PHYSICAL - The database administrator should be able to change the database storage structures without affecting the users’ views: – Changes to the structure of an organization's data will be required. The internal structure of the database should be unaffected by changes to the physical aspects of the storage. © Copyright 2021 by Peter Aiken Slide # 17 https://plusanythingawesome.com For example, a changeover to a new DBMS technology. The database administrator should be able to change the conceptual or global structure of the database without affecting the users. Data modeling • The process of discovering, analyzing, and scoping data requirements – Understand what the data things are? – What do they do? – How do they interact? • Representing/communicating requirements in a precise form called a data model – Maps of critical business assets – Compose and contain metadata essential to data consumers – Function as a kind of sheet music language – Metadata is essential to other business functions (definitions for governance, lineage for analytics, etc.) • The process is iterative and may include a conceptual, logical, and physical model © Copyright 2021 by Peter Aiken Slide # 18 https://plusanythingawesome.com
  • 10. Each data arrangement is a data structure Some data structure characteristics: • Grammar for data objects – Grammar is the principles or rules of an art, science, or technique "a grammar of the theater" • Constraints for data objects • Sequential order • Uniqueness • Order – Hierarchical, relational, network, lake, other • Balance • Optimality © Copyright 2021 by Peter Aiken Slide # 19 https://plusanythingawesome.com http://www.nist.gov/dads/HTML/datastructur.html "An organization of information, usually in memory, for better algorithm efficiency, such as queue, stack, linked list, heap, dictionary, and tree, or conceptual unity, such as the name and address of a person. It may include redundant information, such as length of the list or number of nodes in a subtree." https://plusanythingawesome.com Hierarchy • A hierarchy is an arrangement of items (objects, names, values, categories, etc.) in which the items are represented as being "above", "below", or "at the same level as" one another. • Hierarchy is an important concept in a wide variety of fields, such as philosophy, mathematics, computer science, organizational theory, systems theory, and the social sciences (especially political philosophy). © Copyright 2021 by Peter Aiken Slide # 20 https://plusanythingawesome.com Data Architecture Data Maps /Models Mess Data Maps /Models Data Maps /Models Data Maps /Models Data Maps /Models Model View
  • 11. Social © Copyright 2021 by Peter Aiken Slide # 21 https://plusanythingawesome.com https://plusanythingawesome.com Differences between Programs and Projects • Programs are Ongoing, Projects End – Managing a program involves long term strategic planning and continuous process improvement is not required of a project • Programs are Tied to the Financial Calendar – Program managers are often responsible for delivering results tied to the organization's financial calendar • Program Management is Governance Intensive – Programs are governed by a senior board that provides direction, oversight, and control while projects tend to be less governance-intensive • Programs Have Greater Scope of Financial Management – Projects typically have a straight-forward budget and project financial management is focused on spending to budget while program planning, management and control is significantly more complex • Program Change Management is an Executive Leadership Capability – Projects employ a formal change management process while at the program level, change management requires executive leadership skills and program change is driven more by an organization's strategy and is subject to market conditions and changing business goals © Copyright 2021 by Peter Aiken Slide # Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management Your data program must last at least as long as your HR program! 22 https://plusanythingawesome.com
  • 12. What do we teach knowledge workers about data? © Copyright 2021 by Peter Aiken Slide # 23 https://plusanythingawesome.com What percentage of the deal with it daily? Political © Copyright 2021 by Peter Aiken Slide # 24 https://plusanythingawesome.com https://plusanythingawesome.com
  • 13. What do we teach IT professionals about data? © Copyright 2021 by Peter Aiken Slide # 25 https://plusanythingawesome.com • 1 course – How to build a new database • What impressions do IT professionals get from this education? – Data is a technical skill that is needed when developing new databases © Copyright 2021 by Peter Aiken Slide # 26 https://plusanythingawesome.com If the only tool you know is a hammer you tend to see every problem as a nail (slightly reworded from Abraham Maslow)
  • 14. © Copyright 2021 by Peter Aiken Slide # 27 https://plusanythingawesome.com Bad data decisions Technical deci- sion makers are not data knowledgable Business decision makers are not data knowledgable Poor organizational outcomes Poor treatment of organizational data assets Poor quality data Bad Data Decisions Spiral Economic © Copyright 2021 by Peter Aiken Slide # 28 https://plusanythingawesome.com https://plusanythingawesome.com
  • 15. Tacoma Narrows Bridge/Gallopin' Gertie © Copyright 2021 by Peter Aiken Slide # 29 https://plusanythingawesome.com https://plusanythingawesome.com • World's 3rd longest suspension span • Slender, elegant and graceful • Opened on July 1st 1940, collapsed in a windstorm on 7 Nov 1940 • "The most dramatic failure in bridge engineering history" • Changed forever how engineers design suspension bridges leading to safer spans today. © Copyright 2021 by Peter Aiken Slide # 30 https://plusanythingawesome.com Tacoma Narrows Bridge/Gallopin' Gertie
  • 16. © Copyright 2021 by Peter Aiken Slide # 31 https://plusanythingawesome.com Similarly data failures cost organizations minimally 20-40% of their IT budget https://plusanythingawesome.com Data incoherence is a hidden expense • How does maltreated data cost money? • Consider the opposite question: – Were your systems explicitly designed to be integrated or otherwise work together? – If not then what is the likelihood that they will work well together? • Organizations spend 20-40% of their IT budget evolving their data - including: – Data migration • Changing the location from one place to another – Data conversion • Changing data into another form, state, or product – Data improving • "Inspecting and manipulating, or re-keying data to prepare it for subsequent use" - Source: John Zachman © Copyright 2021 by Peter Aiken Slide # 32 https://plusanythingawesome.com PETER AIKEN WITH JUANITA BILLINGS FOREWORD BY JOHN BOTTEGA MONETIZING DATA MANAGEMENT Unlocking the Value in Your Organization’s Most Important Asset.
  • 17. Doing a poor job with data • Takes longer • Costs more • Delivers less • Presents greater risk (with thanks to Tom DeMarco) © Copyright 2021 by Peter Aiken Slide # 33 https://plusanythingawesome.com 34 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • Data Maps ➜ Models – Why do we need them? – How are they used? – Challenges to increased use (social, political, economic) • Architecture/Engineering – Two sides of same data coin – Must operate on standard, shared data of known quality • From the Top – Means: Forward engineering – Goal: Composition/Building • From the Bottom – Means: Reverse engineering – Goal: Understanding • Working Together – Functions required for effective data management – Need for simplicity • Take Aways/Q&A Data Architecture and Data Modeling Differences: Achieving a common understanding Program
  • 18. 4 Minute Architecture Lesson from Steve Jobs, Introducing iCloud © Copyright 2021 by Peter Aiken Slide # 35 https://plusanythingawesome.com https://plusanythingawesome.com Architecture is about ... • Things – (components) • The functions of the things – (individually) • How the things interact – (as a system, – towards a goal) © Copyright 2021 by Peter Aiken Slide # 36 https://plusanythingawesome.com • Business • Process • Systems • Security • Technical • Data / Information
  • 19. Typically Managed Architectures • Business Architecture – Goals, strategies, roles, organizational structure, location(s) • Process Architecture – Arrangement of inputs -> transformations = value -> outputs – Typical elements: Functions, activities, workflow, events, cycles, products, procedures • Systems Architecture – Applications, software components, interfaces, projects • Security Architecture – Arrangement of security controls relation to IT Architecture • Technical Architecture/Tarchitecture – Relation of software capabilities/technology stack – Structure of the technology infrastructure of an enterprise, solution or system – Typical elements: Networks, hardware, software platforms, standards/protocols • Data / Information Architecture – Arrangement of data assets supporting organizational strategy – Typical elements: specifications expressed as entities, relationships, attributes, definitions, values, vocabularies © Copyright 2021 by Peter Aiken Slide # 37 https://plusanythingawesome.com 1 in 10 organizations manage 1 or more of the formally Data Architectures: here, whether you like it or not © Copyright 2021 by Peter Aiken Slide # 38 https://plusanythingawesome.com deviantart.com • All organizations have data architectures – Some are better understood and documented (and therefore more useful to the organization) than others Business Process Systems Security Technical Data/Information
  • 20. Data model focus is typically domain specific © Copyright 2021 by Peter Aiken Slide # 39 https://plusanythingawesome.com Program A Program C Program B Focus of a software engineering effort Underutilized data modeling effort Database architecture focus Can vary © Copyright 2021 by Peter Aiken Slide # 40 https://plusanythingawesome.com Application domain 1 Program A Program C Program B Focus of a software engineering effort Underutilized data modeling effort Better utilized data modeling effort ERPs and COTS are marketed as being similarly integrated! Program F Program E Program G Program H Program I Application domain 2 Application domain 3 Program D
  • 21. Application domain 1 Program A Program C Program B D a t a D a t a D a t a D a t a D a t a D a t a D a t a Program F Program E Program D Program G Program H Program I Application domain 2 Application domain 3 D a t a D a t a D a t a Data Architecture Focus has Greater Potential Value • Broader focus than either software architecture or database architecture • Analysis scope is on the system wide use of data • Problems caused by data exchange or interface problems • Architectural goals more strategic than operational © Copyright 2021 by Peter Aiken Slide # 41 https://plusanythingawesome.com © Copyright 2021 by Peter Aiken Slide # Metadata Management 42 https://plusanythingawesome.com Data Management Body of Knowledge (DM BoK V2) Practice Areas from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International • Enterprise data modeling • Value chain analysis • Related data architecture
  • 22. How are components expressed as architectures? • Details are organized into larger components • Larger components are organized into models • Models are organized into architectures (composed of architectural components) © Copyright 2021 by Peter Aiken Slide # 43 https://plusanythingawesome.com A B C D A B C D A D C B Intricate Dependencies Purposefulness How are data structures expressed as architectures? • Attributes are organized into entities/objects – Attributes are characteristics of "things" – Entitles/objects are "things" whose information is managed in support of strategy – Example(s) • Entities/objects are organized into models – Combinations of attributes and entities are structured to represent information requirements – Poorly structured data, constrains organizational information delivery capabilities – Example(s) • Models are organized into architectures – When building new systems, architectures are used to plan development – More often, data managers do not know what existing architectures are and - therefore - cannot make use of them in support of strategy implementation • Why no examples? © Copyright 2021 by Peter Aiken Slide # 44 https://plusanythingawesome.com Intricate Dependencies Purposefulness THING Thing.Id # Thing.Description Thing.Status Thing.Sex.To.Be.Assigned Thing.Reserve.Reason
  • 23. Data architectures are composed of data models © Copyright 2021 by Peter Aiken Slide # 45 https://plusanythingawesome.com Data Data Data Information Fact Meaning Request A model defining 3 important concepts composing a data architecture © Copyright 2021 by Peter Aiken Slide # [Built on definitions from Dan Appleton 1983] Intelligence Strategic Use Data Data Data Data 46 https://plusanythingawesome.com “You can have data without information, but you cannot have information without data” — Daniel Keys Moran, Science Fiction Writer 1. Each FACT combines with one or more MEANINGS. 2. Each specific FACT and MEANING combination is referred to as a DATUM. 3. An INFORMATION is one or more DATA that are returned in response to a specific REQUEST 4. INFORMATION REUSE is enabled when one FACT is combined with more than one MEANING. 5. INTELLIGENCE is INFORMATION associated with its STRATEGIC USES. 6. DATA/INFORMATION must formally arranged into an ARCHITECTURE. Wisdom & knowledge are often used synonymously Useful Data
  • 24. Data Architectures Determine Interoperability • Required to enable self-correction/generation capabilities • Permits governance of data as an asset • Prerequisite to meaningful data exchanges • Lowers costs of organization-wide and extra-organizational data sharing • Permits managed evolution - rapidly responding to changing needs, new partners, time criticality's • Required for (full) role-based security implementation • Decreases the cost of maintaining data inventories Data Architectures • Capture the business meaning of the data required to run the organization • Living document – constantly evolving to meet upcoming and discovered business requirements • A potential entry point for architecture engagements • Validated data architectural components can be used to populate a business glossary • Major collection of metadata © Copyright 2021 by Peter Aiken Slide # 47 https://plusanythingawesome.com Levels of Abstraction, Completeness and Utility • Models more downward facing - detail • Architecture is higher level of abstraction - integration • In the past architecture attempted to gain complete (perfect) understanding – Not timely – Not feasible • Focus instead on architectural components – Governed by a framework – More immediate utility • http://www.architecturalcomponentsinc.com © Copyright 2021 by Peter Aiken Slide # 48 https://plusanythingawesome.com
  • 25. Data structures organized into an Architecture • How do data structures support strategy? • Consider the opposite question? – Were your systems explicitly designed to be integrated or otherwise work together? – If not then what is the likelihood that they will work well together? – In all likelihood your organization is spending between 20-40% of its IT budget compensating for poor data structure integration – They cannot be helpful as long as their structure is unknown • Two answers/two separate strategies – Achieving efficiency and effectiveness goals – Providing organizational dexterity for rapid implementation © Copyright 2021 by Peter Aiken Slide # 49 https://plusanythingawesome.com © Copyright 2021 by Peter Aiken Slide # Engineering Architecture 50 https://plusanythingawesome.com Engineering/Architecting Relationship • Architecting is used to create and build systems too complex to be treated by engineering analysis alone – Require technical details as the exception • Engineers develop the technical designs for implementation – Engineering/Crafts-persons deliver work product components supervised by: • Manufacturer • Building Contractor
  • 26. You cannot architect after implementation! © Copyright 2021 by Peter Aiken Slide # 51 https://plusanythingawesome.com https://plusanythingawesome.com USS Midway & Pancakes What is this? • It is tall • It has a clutch • It was built in 1942 • It is cemented to the floor • It is still in regular use! © Copyright 2021 by Peter Aiken Slide # 52 https://plusanythingawesome.com
  • 27. © Copyright 2021 by Peter Aiken Slide # 53 https://plusanythingawesome.com Definition of Bed https://plusanythingawesome.com process of discovering, analyzing, and scoping data requirements • An organization might decide to characterize the parts of a BED as: – Attributes: ID, description, status, sex.to.be.assigned, reserve.reason • Decisions to manage information about each specific attribute has direct consequences – A decision to use the above data attributes permits the organization to determine if it has female beds are available to be reserved • Characteristics can be shared – All beds may have a status – Many beds can be assigned to females • Characteristics may be required to be unique – ID permits identification every bed as distinct for every other bed – Description is unlikely to be the same for each bed © Copyright 2021 by Peter Aiken Slide # 54 https://plusanythingawesome.com BED Bed.Id # Bed.Description Bed.Status Bed.Sex.To.Be.Assigned Bed.Reserve.Reason Attributes arranged into an entity named "bed" – the attribute Bed.Id is the means used to identify a unique occurrence of bed
  • 28. Q: What is the proper relationship for these entities? © Copyright 2021 by Peter Aiken Slide # 55 https://plusanythingawesome.com Bed Room Bed Room Data Maps at the Entity Level ➜ Stored Facts © Copyright 2021 by Peter Aiken Slide # 56 https://plusanythingawesome.com Bed Room a BED is related to a ROOM More precision: many BEDS are related to many ROOMS Bed Room Better information: many BEDS may be contained in each ROOM and each room may contain many beds What if beds can be moved?
  • 29. Eventually One or Many (optional) Eventually One (optional) Zero, or Many (optional) One or Many (mandatory) Exactly One (mandatory) Possible Entity Relationship Cardinality Options © Copyright 2021 by Peter Aiken Slide # 57 https://plusanythingawesome.com Families of Modeling Notation Variants © Copyright 2021 by Peter Aiken Slide # 58 https://plusanythingawesome.com Information Engineering
  • 30. What is a Relationship? • Natural associations between two or more entities © Copyright 2021 by Peter Aiken Slide # 59 https://plusanythingawesome.com Ordinality & Cardinality • Defines mandatory/optional relationships using minimum/ maximum occurrences from one entity to another © Copyright 2021 by Peter Aiken Slide # 60 https://plusanythingawesome.com A BED is placed in one and only one ROOM A ROOM contains zero or more BEDS A BED is occupied by zero or more PATIENTS A PATIENT occupies at least one or more BEDS ROOM BED PATIENT
  • 31. Standard definition reporting does not provide conceptual context © Copyright 2021 by Peter Aiken Slide # 61 https://plusanythingawesome.com BED Something you sleep in Purpose statement incorporates motivations Entity: BED Data Asset Type: Principal Data Entity Purpose: This is a substructure within the Room substructure of the Facility Location. It contains information about beds within rooms. Source: Maintenance Manual for File and Table Data (Software Version 3.0, Release 3.1) Attributes: Bed.Description Bed.Status Bed.Sex.To.Be.Assigned Bed.Reserve.Reason Associations: >0-+ Room Status: Validated © Copyright 2021 by Peter Aiken Slide # 62 https://plusanythingawesome.com Draft A purpose statement describing – Why the organization is maintaining information about this business concept; – Sources of information about it; – A partial list of the attributes or characteristics of the entity; and – Associations with other data items(read as "One room contains zero or many beds.")
  • 32. 63 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • Data Maps ➜ Models – Why do we need them? – How are they used? – Challenges to increased use (social, political, economic) • Architecture/Engineering – Two sides of same data coin – Must operate on standard, shared data of known quality • From the Top – Means: Forward engineering – Goal: Composition/Building • From the Bottom – Means: Reverse engineering – Goal: Understanding • Working Together – Functions required for effective data management – Need for simplicity • Take Aways/Q&A Data Architecture and Data Modeling Differences: Achieving a common understanding Program BUILD? WHAT? HOW? As Is Requirements Assets WHAT? As Is Design Assets HOW? As Is Implementation Assets AS BUILT Forward Engineering © Copyright 2021 by Peter Aiken Slide # 64 https://plusanythingawesome.com New Building new stuff - in this case, new databases
  • 33. Systems Development Life Cycle (SDLC) © Copyright 2021 by Peter Aiken Slide # 65 https://plusanythingawesome.com WHAT? HOW? BUILD? 66 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • Data Maps ➜ Models – Why do we need them? – How are they used? – Challenges to increased use (social, political, economic) • Architecture/Engineering – Two sides of same data coin – Must operate on standard, shared data of known quality • From the Top – Means: Forward engineering – Goal: Composition/Building • From the Bottom – Means: Reverse engineering – Goal: Understanding • Working Together – Functions required for effective data management – Need for simplicity • Take Aways/Q&A Data Architecture and Data Modeling Differences: Achieving a common understanding Program
  • 34. Data Representation is the Essence of Programming • Mythical Man Month ➜ 9 parallel effort x 1 month each ≠ baby • Fred Brooks Jr.'s observation – Data representation is the essence of programming – "Show me your flowchart and conceal your tables, and I shall continue to be mystified. – Show me your tables, and I won't usually need your flowchart; it'll be obvious." © Copyright 2021 by Peter Aiken Slide # 67 https://plusanythingawesome.com As Is Requirements Assets WHAT? As Is Design Assets HOW? As Is Implementation Assets AS BUILT Existing Reverse Engineering © Copyright 2021 by Peter Aiken Slide # 68 https://plusanythingawesome.com A structured technique aimed at recovering rigorous knowledge of the existing system to leverage enhancement efforts [Chikofsky & Cross 1990]
  • 35. 69 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • Data Maps ➜ Models – Why do we need them? – How are they used? – Challenges to increased use (social, political, economic) • Architecture/Engineering – Two sides of same data coin – Must operate on standard, shared data of known quality • From the Top – Means: Forward engineering – Goal: Composition/Building • From the Bottom – Means: Reverse engineering – Goal: Understanding • Working Together – Functions required for effective data management – Need for simplicity • Take Aways/Q&A Data Architecture and Data Modeling Differences: Achieving a common understanding Program As Is Requirements Assets WHAT? As Is Design Assets HOW? As Is Implementation Assets AS BUILT Existing New Reengineering Reverse Engineering Forward engineering Reimplement To Be Implementation Assets To Be Design Assets To Be Requirements Assets • First, reverse engineering the existing system to understand its strengths/weaknesses • Next, use this information to inform the design of the new system © Copyright 2021 by Peter Aiken Slide # 70 https://plusanythingawesome.com
  • 36. Data Modeling Process 1. Identify entities 2. Identify key for each entity 3. Draw rough draft of entity relationship data model 4. Identify data attributes 5. Map data attributes to entities © Copyright 2021 by Peter Aiken Slide # 71 https://plusanythingawesome.com Model evolution is good, at first ... 1. Identify entities 2. Identify key for each entity 3. Draw rough draft of entity relationship data model 4. Identify data attributes 5. Map data attributes to entities © Copyright 2021 by Peter Aiken Slide # 72 https://plusanythingawesome.com
  • 37. Relative use of time allocated to tasks during modeling © Copyright 2021 by Peter Aiken Slide # 73 https://plusanythingawesome.com Preliminary Modeling Wrapup Activity activities Cycles activities Evidence Analysis collection & analysis Collection Project coordination requirements Declining coordination requirements Target system analysis Increasing amounts of target system analysis Modeling Validation cycle focus Refinement Source:http://dmreview.com/article_sub.cfm?articleID=1000941 used with permission Meta Data Models © Copyright 2021 by Peter Aiken Slide # 74 https://plusanythingawesome.com
  • 38. 75 © Copyright 2021 by Peter Aiken Slide # https://plusanythingawesome.com • Data Maps ➜ Models – Why do we need them? – How are they used? – Challenges to increased use (social, political, economic) • Architecture/Engineering – Two sides of same data coin – Must operate on standard, shared data of known quality • From the Top – Means: Forward engineering – Goal: Composition/Building • From the Bottom – Means: Reverse engineering – Goal: Understanding • Working Together – Functions required for effective data management – Need for simplicity • Take Aways/Q&A Data Architecture and Data Modeling Differences: Achieving a common understanding Program Take Aways • Attributes are organized into entities/objects – Attributes are characteristics of "things" – Entitles/objects are "things" whose information is managed in support of strategy – Example(s) • Entities/objects are organized into models – Combinations of attributes and entities are structured to represent information requirements – Poorly structured data, constrains organizational information delivery capabilities – Example(s) • Models are organized into architectures – When building new systems, architectures are used to plan development – More often, data managers do not know what existing architectures are and - therefore - cannot make use of them in support of strategy implementation – Why no examples? © Copyright 2021 by Peter Aiken Slide # 76 https://plusanythingawesome.com Entity: BED Attributes: Bed.Description Bed.Status Bed.Sex.To.Be.Assigned Bed.Reserve.Reason Intricate Dependencies Purposefulness
  • 39. Learning Objectives • Both data architecture and data models are made more useful by each other – Power derived from their interdependence – Data models are a primary means to achieve shared understanding of data challenge specifics – They are literally the pages that intersect data assets and the organizational response • Data architecture is the sum of the data models – Coverage is rarely complete • Data models – Documentation – The currency of data coordination – Used to verify integration – Mandated input to any data systems evolution • What is taught – Forward engineering with a goal of building • What is also needed – Reverse engineering with a goal of understanding © Copyright 2021 by Peter Aiken Slide # 77 https://plusanythingawesome.com Upcoming Events (Webinars begin @ 19:00 UTC/2:00 PM NYC) © Copyright 2021 by Peter Aiken Slide # 78 https://plusanythingawesome.com Brought to you by: Why Data Modeling Is Fundamental 8 June 2021 Business Value through Reference & Master Data Strategies 13 July 2021 Getting (Re)Started with Data Stewardship 10 August 2021
  • 40. paiken@plusanythingawesome.com +1.804.382.5957 Questions? Thank You! © Copyright 2021 by Peter Aiken Slide # 79 Book a call with Peter to discuss anything - https://plusanythingawesome.com/OfficeHours.html + =