SlideShare a Scribd company logo
1 of 39
Download to read offline
10-Step Methodology to
Building a Single View
MatKeep,DirectorofProduct&MarketAnalysis. mat.keep@mongodb.com @matkeep
JonRangel,DirectorofProfessionalServices,EMEA. jon.rangel@mongodb.com
What You
Will Learn
1.  Single View: Opportunities & Challenges
2.  Repeatable 10-Step Methodology
3.  Required Technical Capabilities
Why Single View
Single View Defined
•  What
–  Single, real-time representation of a business entity or
domain
–  Customer, product, supply chain, financial asset class,
& more
•  How
–  Gathers and organizes data from multiple,
disconnected sources;
–  Aggregates information into a standardized format
and joint information model
•  Why
–  Improves business visibility
–  Serve operational applications
–  Foundation for analytics
Single View Use Cases
•  Comparative view of
traders or products
•  Firm-wide view of
asset exposure
•  Aggregated
transactions for fraud
models
•  Omni-channel view of
customers for
personalized marketing
•  Inventory control &
management
•  Single view of product
across channels &
demographics
•  Management of patient
medical records for
treatment plans
•  Macro-analysis view for
public health
•  Medical history to
identify insurance risk
Finance Retail Healthcare
Challenges
•  Current State
–  Data dispersed across multitude of systems
–  Different structures, different attributes
–  Apps built to meet specific business requirements, not
integrated
–  New data sources from new apps, M&A
•  Governance Processes
–  How to deliver & maintain single view in face of
constant business change
•  Technology Limitations
–  Traditional databases not well suited to single view
required capabilities
10-Step Methodology
ETLorMessageQueue
Web
Mobile
CRM
Mainframe
Single View
Call Center
Analytics
Technical
Support
Billing
Source Systems Consuming Systems
Load Reads
High Level Architecture
10-Step Methodology
Step 1:
Define Scope
Step 4:
Appoint
Data Stewards
Step 5:
Develop
Data Model
Step 6:
Load &
Standardize
Step 7:
Merge,
Test & Reconcile
Step 8:
Infrastructure
Design
Step 3:
Identify
Data Producers
Step 2:
Identify
Data Consumers
Step 9:
Modify Consuming
Systems
Step 10:
Maintenance
Processes
Discover
Develop
Deploy
Step 1: Define Scope & Sponsorship
•  Scope needs to be realistic, defined by specific success metric
–  Long term: aggregate all customer data into a single view, serving all
business functions
–  Initial phase: collecting all customer interactions on digital channels over
past 3-months to improve call center MTTR
•  Appoint executive sponsors
–  Senior: allocate resources and command credibility
–  Combination of senior title from the business, and from the technology
group
Discover
Web
Mobile
CRM
Mainframe
Source Systems
Steps 2 & 3:
Identify Data Consumers & Producers
•  Single View Consumers Define
–  Typical queries and SLAs
–  Required data attributes
–  Current data sources
•  Identify apps generating the source data
–  Identify application owners + associated databases
–  Profile apps: operational, analytical
Step 2: Data Consumers
Step 3: Data Producers
Discover
Step 4: Appoint Data Stewards
•  Data steward appointed for each data
source.
•  Deep knowledge of:
–  Source system schema
–  Which tables store required attributes, what format
–  Clients and apps that generate & consume the
source data
•  Advise on data loading strategies
Develop
Step 5: Develop Single View Data Model
•  Key inputs
–  Required data attributes
–  Query patterns
•  Define common fields & data types
–  Create rules to validate common data
•  Define primary & secondary indexes
•  Identify dynamic fields
–  No need to pre-declare when using a document database
•  Localize data into a single document (where
appropriate)
{
_id : “mark.smith@mongodb.com”,
first_name : "Mark",
last_name : "Smith",
city : "San Francisco",
phones: [ {
number : “1-212-777-1212”,
dnc : true,
type : “home”
},
{
number : “1-212-777-1213”,
type : “cell”
}]}
Single View
Develop
Resources to Support Schema Design
MongoDB
Documentation
MongoDB
Development Rapid Start
Develop
Step 6: Load
2 phases: Initial Load & Delta Load
Emit JSON to preserve data types. Use Extended JSON
Load
ETLorMessageQueue
Single View
Develop
Initial Load
•  ETL Tools
•  Custom Loaders
Delta Load
•  Batch loads: use tools above
•  Real-time loads: Message queue
Step 6 (cont’d): Standardize
Data	
  Source	
  A	
   Data	
  Source	
  B	
   Data	
  Source	
  C	
  
14	
  
77	
  
26	
  
cust_id:	
  14	
  
f_name:	
  James	
  
l_name:	
  Bond	
  
dob:	
  07/14/1968	
  
eMail:	
  007@spook.com	
  
fno:	
  77	
  
first:	
  Jim	
  
last:	
  Bond	
  
born:	
  1968-­‐07-­‐14	
  
email:	
  007@spook.com	
  
xc_id:	
  26	
  
name:	
  James	
  Bind	
  
bdate:	
  July	
  14,	
  68	
  
Email:	
  007@spook.com	
  
Develop
Step 7: Match, Merge & Reconcile
Develop
cust_id:	
  14	
  
f_name:	
  James	
  
l_name:	
  Bond	
  
dob:	
  07/14/1968	
  
eMail:	
  007@spook.com	
  
xc_id:	
  26	
  
name:	
  James	
  Bind	
  
bdate:	
  July	
  14,	
  68	
  
Email:	
  007@spook.com	
  
source_id:	
  A_14	
  
first_name:	
  James	
  
last_name:	
  Bond	
  
dob:	
  1968-­‐07-­‐14	
  
eMail:	
  007@spook.com	
  
source_id:	
  B_77	
  
first_name:	
  Jim	
  
last_name:	
  Bond	
  
dob:	
  1968-­‐07-­‐14	
  
eMail:	
  007@spook.com	
  
source_id:	
  C_26	
  
first_name:	
  James	
  
last_name:	
  Bind	
  
dob:	
  1968-­‐07-­‐14	
  
eMail:	
  007@spook.com	
  
_id:	
  007@spook.com	
  
first_name:	
  James	
  
last_name:	
  Bond	
  
dob:	
  1968-­‐07-­‐14	
  
Source	
  
Data	
  
Standardized	
  Data	
  
Field	
  names	
  &	
  data	
  types	
  
Single	
  View	
  
Data	
  merged,	
  	
  
tested	
  &	
  reconciled	
  
fno:	
  77	
  
first:	
  Jim	
  
last:	
  Bond	
  
born:	
  1968-­‐07-­‐14	
  
email:	
  007@spook.com	
  
Step 7 (cont’d): Match, Merge & Reconcile
•  Use iterative grouping functions to cluster records with similar
attributes
1.  Match against unique, authoritative attributes (email address, credit card #)
2.  Match by combining attributes (last name, DoB, zip code)
3.  Use fuzzy matching to catch errors in source data (i.e. different spellings of customer
name)
•  Apply confidence factor to dictate merging
–  Automatically merge records with 95%+ confidence
–  Manually inspect records with lower confidence
Develop
Step 7 (cont’d): MongoDB Tools
•  Workers framework to parallelize document comparisons
•  Grouping tool to cluster documents based on attribute similarity
–  Levenshtein to calculate distances, single-linkage clustering for matching
Develop
Step 8: Architecture Design
Deploy
•  Deployment infrastructure
•  MongoDB Production Readiness Consulting
Package provides recommendations:
–  Hardware sizing
–  HA/DR strategies
–  Scaling
–  Security for corporate and regulatory compliance
•  Follow-on services for implementation
Step 9: Modify Consuming Systems
Deploy
•  Modify the apps that consume the
single view
–  Create an API that exposes the single view (i.e.
RESTful web service)
–  Re-point apps to the web service (reads initially)
•  Modify one consuming application at
time
Call Center
Analytics
Technical
Support
Billing
Consuming
Systems
Reads
Single View
Step 10: Implement Maintenance Processes
Deploy
•  Frequency of application launch & evolution
is accelerating
•  Impacts to single view
–  Adding new attributes from source systems
–  Onboarding new data sources or digital channels
–  Creating new apps that consume the single view
•  Single view team needs to institutionalize
governance around on-going maintenance
–  Repeat the 10-step process
–  Dynamic schema is HUGE!
Single View Maturity Model
Scope
BusinessBenefits
Transactions are written first to the single view, which
propagates the data back to the source system of record.
Writes are performed concurrently to the source systems as
well as the single view
The single view data model is enriched with additional
sources to serve more applications, including real-time
analytics. The single view becomes a platform serving
multiple applications
Single View
Platform
Records are copied via ETL or message queue
mechanisms from the source systems into the single view,
serving read queries. The single view serves one specific
application
Single View
Application
Single View First
Dual Writes
Read
Centric
Transforming the role of
the single view
Reads & Writes
Single View Maturity Model
•  Advantages of writing to the single view
–  Fresher data
–  Reduced app complexity
–  Improved application agility
Architecture for Writes to the Single View
ETLorMessageQueue
Web
Mobile
CRM
Mainframe
Single View Call Center
Analytics
Technical
Support
Billing
Update
Queue
Reads
Writes
Source Systems Consuming Systems
Load
Required Capabilities for Single View
Single View with a Relational Database
Required Database Capabilities
•  Data model flexibility with a dynamic schema
•  Real-time analytics
•  Performance, scale & always-on
•  Enterprise deployment model
MongoDB Compass MongoDB Connector for BI
MongoDB Enterprise Server
Enterprise Deployment Model 	
  24x7Support
(1hourSLA)
CommercialLicense
(NoAGPLCopyleftRestrictions)
Platform
Certifications
MongoDB Ops Manager
Monitoring	
  &	
  
AlerBng	
  
Query	
  
OpBmizaBon	
  
Backup	
  &	
  
Recovery	
  
AutomaBon	
  &	
  
ConfiguraBon	
  
Schema	
  VisualizaBon	
  
Data	
  ExploraBon	
  
Ad-­‐Hoc	
  Queries	
  
VisualizaBon	
  
Analysis	
  
ReporBng	
  
AuthorizaBon	
   AudiBng	
  
EncrypBon	
  
(In	
  Flight	
  &	
  at	
  Rest)	
  
AuthenBcaBon	
  
REST	
  API	
  Emergency
Patches
Customer
Success
Program
On-Demand
Online Training
Warranty
Limitation of
Liability
Indemnification
Single View In Action
Single View of Customer
Insurance leader generates coveted single view of
customers in 90 days – “The Wall”
Problem	
   Why	
  MongoDB	
   Results	
  Problem Solution Results
No single view of customer, leading
to poor customer experience and
churn
145 years of policy data, 70+
systems, 24 800 numbers, 15+
front-end apps that are not
integrated
Spent 2 years, $25M trying build
single view with RDBMS – failed
Built “The Wall,” pulling in disparate
data and serving single view to
customer service reps in real time
Flexible data model to aggregate
disparate data into single data
store
Expressive query language and
secondary indexes to serve any
field in real time
Prototyped in 2 weeks
Deployed to production in 90 days
Decreased churn and improved
ability to upsell/cross-sell
Single View of LHC Analytics
Data aggregation system to accelerate scientific research &
discovery
Problem	
   Why	
  MongoDB	
   Results	
  Problem Solution Results
Raw data from LHC & experiments
distributed across multitude of
source systems
Scientists don’t know location of
source data, or how to extract it
Relational databases rigid data
model prevented aggregation of
data from different sources
Data Aggregation System built on
MongoDB, consolidating analytics
into a single view
Dynamic schema represents data
of any structure
MongoDB query language
supports simple lookups to
complex search, traversals &
analytics
A single query to MongoDB can
return 10,000 documents from
different data sources for real time
analytics
Accelerates scientific time to
insight
Accessed by 3,000 physicists from
200 research institutions across
the globe
Wrap Up
Where to Go from Here?
•  Single view projects are challenging
–  Partner with a vendor offering proven methodology,
tools & technologies
•  Learn More
–  Download the whitepaper
–  10-Step Methodology to Building a Single View
•  Engage
–  MongoDB Global Consulting Services can help you
scope the project and get started
–  Book a workshop
10-Step Methodology to
Building a Single View
Single View of the Customer
360° view of the customer increases customer satisfaction,
cross-sell & up-sell with MongoDB, Spark, & Hadoop
Problem	
   Why	
  MongoDB	
   Results	
  Problem Solution Results
Customer data scattered across
100+ different systems
Poor customer experience: no
personalization, no consistent
experience across brands or
devices
No way to analyze customer
behavior to deliver targeted offers
Single View application on MongoDB
flexible data model, expressive query
language, secondary indexes, &
horizontal scalability
Data from old relational systems fed
into Spark for analysis and then stored
in MongoDB to support real-time CRM
Customer data synced from MongoDB
to Hadoop for nightly batch jobs, then
fed back to MongoDB for personalized
recommendations
Single view serves customers from
any channel
Stores 10s of TBs of customer data
across multiple data centers
Increased revenues from improved
customer intimacy, driving cross-
sell and upsell
Global
Airline
Data Model Flexibility
…Mobile
App
Web
Call
Centre CRM Social
Feed
COMMON FIELDS
CustomerID | eMail |
DYNAMIC FIELDS
Can vary from record to record: location, action
Single View
Customer
Service Application
MongoDB	
  Primary	
  Replica	
  
Single	
  View	
  
BI & Reporting
	
  
	
  
REST Data Services
	
  
	
  
Real-time
Data Services for
Regulators & Partners
VisualisationsQueries
& Updates
Aggregates	
  
Predictive Analytics
MongoDB	
  Secondary	
  Replica	
  
Single	
  View	
  MongoDB	
  Secondary	
  Replica	
  
Single	
  View	
  MongoDB	
  Secondary	
  Replica	
  
Single	
  View	
  MongoDB	
  Secondary	
  Replica	
  
Single	
  View	
  MongoDB	
  Secondary	
  Replica	
  
MongoDB	
  Secondary	
  Replica	
  
Data Analytics
Pipeline
Real-Time Analytics
Predictable Scale & Always-On
Shard 1
Horizontally Scalable
Shard 2 Shard 3 Shard n

More Related Content

What's hot

What's hot (8)

Msbi online training
Msbi online trainingMsbi online training
Msbi online training
 
Sap business Objects certification note paper1
Sap business Objects certification note paper1Sap business Objects certification note paper1
Sap business Objects certification note paper1
 
My Experience of Salesforce Project
My Experience of Salesforce ProjectMy Experience of Salesforce Project
My Experience of Salesforce Project
 
SharePoint 2010 Managed Metadata Service
SharePoint 2010 Managed Metadata ServiceSharePoint 2010 Managed Metadata Service
SharePoint 2010 Managed Metadata Service
 
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing TagSPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
SPSTCDC - Managed Metadata and Taxonomies in SharePoint 2010 - Playing Tag
 
Ibm info sphere datastage tutorial part 1 architecture examples
Ibm info sphere datastage tutorial part 1  architecture examplesIbm info sphere datastage tutorial part 1  architecture examples
Ibm info sphere datastage tutorial part 1 architecture examples
 
SharePoint 2010 Managed Metadata
SharePoint 2010 Managed MetadataSharePoint 2010 Managed Metadata
SharePoint 2010 Managed Metadata
 
Sap business objects 4 quick start manual
Sap business objects 4 quick start manualSap business objects 4 quick start manual
Sap business objects 4 quick start manual
 

Viewers also liked

Blockchain & the IoT
Blockchain & the IoTBlockchain & the IoT
Blockchain & the IoTMat Keep
 
Webinar: 10-Step Guide to Creating a Single View of your Business
Webinar: 10-Step Guide to Creating a Single View of your BusinessWebinar: 10-Step Guide to Creating a Single View of your Business
Webinar: 10-Step Guide to Creating a Single View of your BusinessMongoDB
 
Bitcoin, Blockchain, and IoT
Bitcoin, Blockchain, and IoTBitcoin, Blockchain, and IoT
Bitcoin, Blockchain, and IoTRobin Teigland
 
IoT and Blockchain Convergence
IoT and Blockchain ConvergenceIoT and Blockchain Convergence
IoT and Blockchain ConvergenceAhmed Banafa
 
Gouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 Tunis
Gouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 TunisGouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 Tunis
Gouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 TunisEtienne Bailly
 
Writing Groups in Language Teaching
Writing Groups in Language Teaching Writing Groups in Language Teaching
Writing Groups in Language Teaching Malu Sciamarelli
 
Can We Achieve the SDGs?
Can We Achieve the SDGs? Can We Achieve the SDGs?
Can We Achieve the SDGs? socprog
 
1ST YEAR Infographics about team sport
 1ST YEAR Infographics about team sport 1ST YEAR Infographics about team sport
1ST YEAR Infographics about team sportCiclos Formativos
 
Human toxicity, environmental impact and legal implications of water fluorida...
Human toxicity, environmental impact and legal implications of water fluorida...Human toxicity, environmental impact and legal implications of water fluorida...
Human toxicity, environmental impact and legal implications of water fluorida...Declan Waugh
 
These words I share, written from despair, read them, speak them, but do so w...
These words I share, written from despair, read them, speak them, but do so w...These words I share, written from despair, read them, speak them, but do so w...
These words I share, written from despair, read them, speak them, but do so w...Blair Stuart
 
Isu isu trenda terkini dalam teknologi pendidikan
Isu isu trenda terkini dalam teknologi pendidikanIsu isu trenda terkini dalam teknologi pendidikan
Isu isu trenda terkini dalam teknologi pendidikanRenee Evelyn
 
正規言語について
正規言語について正規言語について
正規言語についてJumpei Ogawa
 
Hazop gijutsushikai chubu koukuukai
Hazop gijutsushikai chubu koukuukai Hazop gijutsushikai chubu koukuukai
Hazop gijutsushikai chubu koukuukai Kiyoshi Ogawa
 
Yapc Asia 2009 ペパボでのPerlの使い方
Yapc Asia 2009 ペパボでのPerlの使い方Yapc Asia 2009 ペパボでのPerlの使い方
Yapc Asia 2009 ペパボでのPerlの使い方hiboma
 
The New Framework for Information Literacy for Higher Education
The New Framework for Information Literacy for Higher EducationThe New Framework for Information Literacy for Higher Education
The New Framework for Information Literacy for Higher EducationTrudi Jacobson
 
Ruby HTTP clients comparison
Ruby HTTP clients comparisonRuby HTTP clients comparison
Ruby HTTP clients comparisonHiroshi Nakamura
 
GBM Group Based Marketing: Marketing to Groups
GBM Group Based Marketing: Marketing to GroupsGBM Group Based Marketing: Marketing to Groups
GBM Group Based Marketing: Marketing to GroupsScott Levine
 

Viewers also liked (20)

Blockchain & the IoT
Blockchain & the IoTBlockchain & the IoT
Blockchain & the IoT
 
Webinar: 10-Step Guide to Creating a Single View of your Business
Webinar: 10-Step Guide to Creating a Single View of your BusinessWebinar: 10-Step Guide to Creating a Single View of your Business
Webinar: 10-Step Guide to Creating a Single View of your Business
 
Bitcoin, Blockchain, and IoT
Bitcoin, Blockchain, and IoTBitcoin, Blockchain, and IoT
Bitcoin, Blockchain, and IoT
 
IoT and Blockchain Convergence
IoT and Blockchain ConvergenceIoT and Blockchain Convergence
IoT and Blockchain Convergence
 
Gouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 Tunis
Gouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 TunisGouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 Tunis
Gouvernance Office 365 avec et sans PowerShell - SharePoint Days 2017 Tunis
 
Top Content Marketing Formats
Top Content Marketing FormatsTop Content Marketing Formats
Top Content Marketing Formats
 
Writing Groups in Language Teaching
Writing Groups in Language Teaching Writing Groups in Language Teaching
Writing Groups in Language Teaching
 
Can We Achieve the SDGs?
Can We Achieve the SDGs? Can We Achieve the SDGs?
Can We Achieve the SDGs?
 
1ST YEAR Infographics about team sport
 1ST YEAR Infographics about team sport 1ST YEAR Infographics about team sport
1ST YEAR Infographics about team sport
 
Human toxicity, environmental impact and legal implications of water fluorida...
Human toxicity, environmental impact and legal implications of water fluorida...Human toxicity, environmental impact and legal implications of water fluorida...
Human toxicity, environmental impact and legal implications of water fluorida...
 
These words I share, written from despair, read them, speak them, but do so w...
These words I share, written from despair, read them, speak them, but do so w...These words I share, written from despair, read them, speak them, but do so w...
These words I share, written from despair, read them, speak them, but do so w...
 
Isu isu trenda terkini dalam teknologi pendidikan
Isu isu trenda terkini dalam teknologi pendidikanIsu isu trenda terkini dalam teknologi pendidikan
Isu isu trenda terkini dalam teknologi pendidikan
 
正規言語について
正規言語について正規言語について
正規言語について
 
C# & AWS Lambda
C# & AWS LambdaC# & AWS Lambda
C# & AWS Lambda
 
Hazop gijutsushikai chubu koukuukai
Hazop gijutsushikai chubu koukuukai Hazop gijutsushikai chubu koukuukai
Hazop gijutsushikai chubu koukuukai
 
Yapc Asia 2009 ペパボでのPerlの使い方
Yapc Asia 2009 ペパボでのPerlの使い方Yapc Asia 2009 ペパボでのPerlの使い方
Yapc Asia 2009 ペパボでのPerlの使い方
 
The New Framework for Information Literacy for Higher Education
The New Framework for Information Literacy for Higher EducationThe New Framework for Information Literacy for Higher Education
The New Framework for Information Literacy for Higher Education
 
Ruby HTTP clients comparison
Ruby HTTP clients comparisonRuby HTTP clients comparison
Ruby HTTP clients comparison
 
GBM Group Based Marketing: Marketing to Groups
GBM Group Based Marketing: Marketing to GroupsGBM Group Based Marketing: Marketing to Groups
GBM Group Based Marketing: Marketing to Groups
 
How to Kill a Word
How to Kill a WordHow to Kill a Word
How to Kill a Word
 

Similar to 10-Step Methodology to Building a Single View with MongoDB

MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB
 
How to deliver a Single View in Financial Services
 How to deliver a Single View in Financial Services How to deliver a Single View in Financial Services
How to deliver a Single View in Financial ServicesMongoDB
 
Big Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise ArchitectureBig Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise ArchitectureMongoDB
 
Methodology - Conceptual Database Design Transparencies .pptx
Methodology -  Conceptual Database Design Transparencies .pptxMethodology -  Conceptual Database Design Transparencies .pptx
Methodology - Conceptual Database Design Transparencies .pptxbirhanugirmay559
 
Single View of the Customer
Single View of the Customer Single View of the Customer
Single View of the Customer MongoDB
 
Pascua Yaqui Tribe App in a day and dashboard in day
Pascua Yaqui Tribe App in a day and dashboard in dayPascua Yaqui Tribe App in a day and dashboard in day
Pascua Yaqui Tribe App in a day and dashboard in dayVishal Pawar
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)Piet J.H. Daas
 
The Evolution of a Scrappy Startup to a Successful Web Service
The Evolution of a Scrappy Startup to a Successful Web ServiceThe Evolution of a Scrappy Startup to a Successful Web Service
The Evolution of a Scrappy Startup to a Successful Web ServicePoornima Vijayashanker
 
Emvigo Data Visualization - E Commerce Deck
Emvigo Data Visualization - E Commerce DeckEmvigo Data Visualization - E Commerce Deck
Emvigo Data Visualization - E Commerce DeckEmvigo Technologies
 
Ecommerce by bhawani nandan prasad
Ecommerce by bhawani nandan prasadEcommerce by bhawani nandan prasad
Ecommerce by bhawani nandan prasadBhawani N Prasad
 
Unlock your core business assets for the hybrid cloud with addi webinar dec...
Unlock your core business assets for the hybrid cloud with addi   webinar dec...Unlock your core business assets for the hybrid cloud with addi   webinar dec...
Unlock your core business assets for the hybrid cloud with addi webinar dec...Sherri Hanna
 
Methodology conceptual databases design roll no. 99 & 111
Methodology conceptual databases design roll no. 99 & 111Methodology conceptual databases design roll no. 99 & 111
Methodology conceptual databases design roll no. 99 & 111Manoj Nolkha
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysNEWYORKSYS-IT SOLUTIONS
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
 
Productionalize content recommendation engine
Productionalize content recommendation engine Productionalize content recommendation engine
Productionalize content recommendation engine Kim Ming Teh
 
AWS July Webinar Series: Amazon Redshift Reporting and Advanced Analytics
AWS July Webinar Series: Amazon Redshift Reporting and Advanced AnalyticsAWS July Webinar Series: Amazon Redshift Reporting and Advanced Analytics
AWS July Webinar Series: Amazon Redshift Reporting and Advanced AnalyticsAmazon Web Services
 
Ojoconsulting Oy Nimbus Monitoring Service description v1.2 public
Ojoconsulting Oy Nimbus Monitoring Service description v1.2 publicOjoconsulting Oy Nimbus Monitoring Service description v1.2 public
Ojoconsulting Oy Nimbus Monitoring Service description v1.2 publicOjoconsulting Oy
 
Winning with data
Winning with dataWinning with data
Winning with dataNUS-ISS
 

Similar to 10-Step Methodology to Building a Single View with MongoDB (20)

MongoDB in a Mainframe World
MongoDB in a Mainframe WorldMongoDB in a Mainframe World
MongoDB in a Mainframe World
 
How to deliver a Single View in Financial Services
 How to deliver a Single View in Financial Services How to deliver a Single View in Financial Services
How to deliver a Single View in Financial Services
 
Big Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise ArchitectureBig Data Paris - A Modern Enterprise Architecture
Big Data Paris - A Modern Enterprise Architecture
 
Methodology - Conceptual Database Design Transparencies .pptx
Methodology -  Conceptual Database Design Transparencies .pptxMethodology -  Conceptual Database Design Transparencies .pptx
Methodology - Conceptual Database Design Transparencies .pptx
 
Single View of the Customer
Single View of the Customer Single View of the Customer
Single View of the Customer
 
Pascua Yaqui Tribe App in a day and dashboard in day
Pascua Yaqui Tribe App in a day and dashboard in dayPascua Yaqui Tribe App in a day and dashboard in day
Pascua Yaqui Tribe App in a day and dashboard in day
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
 
The Evolution of a Scrappy Startup to a Successful Web Service
The Evolution of a Scrappy Startup to a Successful Web ServiceThe Evolution of a Scrappy Startup to a Successful Web Service
The Evolution of a Scrappy Startup to a Successful Web Service
 
VINOD_6yrs
VINOD_6yrsVINOD_6yrs
VINOD_6yrs
 
Emvigo Data Visualization - E Commerce Deck
Emvigo Data Visualization - E Commerce DeckEmvigo Data Visualization - E Commerce Deck
Emvigo Data Visualization - E Commerce Deck
 
Ecommerce by bhawani nandan prasad
Ecommerce by bhawani nandan prasadEcommerce by bhawani nandan prasad
Ecommerce by bhawani nandan prasad
 
Unlock your core business assets for the hybrid cloud with addi webinar dec...
Unlock your core business assets for the hybrid cloud with addi   webinar dec...Unlock your core business assets for the hybrid cloud with addi   webinar dec...
Unlock your core business assets for the hybrid cloud with addi webinar dec...
 
Methodology conceptual databases design roll no. 99 & 111
Methodology conceptual databases design roll no. 99 & 111Methodology conceptual databases design roll no. 99 & 111
Methodology conceptual databases design roll no. 99 & 111
 
Rohan_Purekar
Rohan_PurekarRohan_Purekar
Rohan_Purekar
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
Productionalize content recommendation engine
Productionalize content recommendation engine Productionalize content recommendation engine
Productionalize content recommendation engine
 
AWS July Webinar Series: Amazon Redshift Reporting and Advanced Analytics
AWS July Webinar Series: Amazon Redshift Reporting and Advanced AnalyticsAWS July Webinar Series: Amazon Redshift Reporting and Advanced Analytics
AWS July Webinar Series: Amazon Redshift Reporting and Advanced Analytics
 
Ojoconsulting Oy Nimbus Monitoring Service description v1.2 public
Ojoconsulting Oy Nimbus Monitoring Service description v1.2 publicOjoconsulting Oy Nimbus Monitoring Service description v1.2 public
Ojoconsulting Oy Nimbus Monitoring Service description v1.2 public
 
Winning with data
Winning with dataWinning with data
Winning with data
 

More from Mat Keep

MongoDB at Baidu
MongoDB at BaiduMongoDB at Baidu
MongoDB at BaiduMat Keep
 
MongoDB_Spark
MongoDB_SparkMongoDB_Spark
MongoDB_SparkMat Keep
 
Business of iot_mongodb_spark
Business of iot_mongodb_sparkBusiness of iot_mongodb_spark
Business of iot_mongodb_sparkMat Keep
 
Mongo db 2.6_security_architecture
Mongo db 2.6_security_architectureMongo db 2.6_security_architecture
Mongo db 2.6_security_architectureMat Keep
 
PayPal Big Data and MySQL Cluster
PayPal Big Data and MySQL ClusterPayPal Big Data and MySQL Cluster
PayPal Big Data and MySQL ClusterMat Keep
 
MySQL HA Solutions
MySQL HA SolutionsMySQL HA Solutions
MySQL HA SolutionsMat Keep
 
MySQL Cluster NoSQL Memcached API
MySQL Cluster NoSQL Memcached APIMySQL Cluster NoSQL Memcached API
MySQL Cluster NoSQL Memcached APIMat Keep
 
MySQL Cluster performance best practices
MySQL Cluster performance best practicesMySQL Cluster performance best practices
MySQL Cluster performance best practicesMat Keep
 
My sql 5.6_replwebinar_may12
My sql 5.6_replwebinar_may12My sql 5.6_replwebinar_may12
My sql 5.6_replwebinar_may12Mat Keep
 
NoSQL and MySQL webinar - best of both worlds
NoSQL and MySQL webinar - best of both worldsNoSQL and MySQL webinar - best of both worlds
NoSQL and MySQL webinar - best of both worldsMat Keep
 

More from Mat Keep (10)

MongoDB at Baidu
MongoDB at BaiduMongoDB at Baidu
MongoDB at Baidu
 
MongoDB_Spark
MongoDB_SparkMongoDB_Spark
MongoDB_Spark
 
Business of iot_mongodb_spark
Business of iot_mongodb_sparkBusiness of iot_mongodb_spark
Business of iot_mongodb_spark
 
Mongo db 2.6_security_architecture
Mongo db 2.6_security_architectureMongo db 2.6_security_architecture
Mongo db 2.6_security_architecture
 
PayPal Big Data and MySQL Cluster
PayPal Big Data and MySQL ClusterPayPal Big Data and MySQL Cluster
PayPal Big Data and MySQL Cluster
 
MySQL HA Solutions
MySQL HA SolutionsMySQL HA Solutions
MySQL HA Solutions
 
MySQL Cluster NoSQL Memcached API
MySQL Cluster NoSQL Memcached APIMySQL Cluster NoSQL Memcached API
MySQL Cluster NoSQL Memcached API
 
MySQL Cluster performance best practices
MySQL Cluster performance best practicesMySQL Cluster performance best practices
MySQL Cluster performance best practices
 
My sql 5.6_replwebinar_may12
My sql 5.6_replwebinar_may12My sql 5.6_replwebinar_may12
My sql 5.6_replwebinar_may12
 
NoSQL and MySQL webinar - best of both worlds
NoSQL and MySQL webinar - best of both worldsNoSQL and MySQL webinar - best of both worlds
NoSQL and MySQL webinar - best of both worlds
 

Recently uploaded

Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 

Recently uploaded (20)

Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
PREDICTING RIVER WATER QUALITY ppt presentation
PREDICTING  RIVER  WATER QUALITY  ppt presentationPREDICTING  RIVER  WATER QUALITY  ppt presentation
PREDICTING RIVER WATER QUALITY ppt presentation
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 

10-Step Methodology to Building a Single View with MongoDB

  • 1. 10-Step Methodology to Building a Single View MatKeep,DirectorofProduct&MarketAnalysis. mat.keep@mongodb.com @matkeep JonRangel,DirectorofProfessionalServices,EMEA. jon.rangel@mongodb.com
  • 2. What You Will Learn 1.  Single View: Opportunities & Challenges 2.  Repeatable 10-Step Methodology 3.  Required Technical Capabilities
  • 4. Single View Defined •  What –  Single, real-time representation of a business entity or domain –  Customer, product, supply chain, financial asset class, & more •  How –  Gathers and organizes data from multiple, disconnected sources; –  Aggregates information into a standardized format and joint information model •  Why –  Improves business visibility –  Serve operational applications –  Foundation for analytics
  • 5. Single View Use Cases •  Comparative view of traders or products •  Firm-wide view of asset exposure •  Aggregated transactions for fraud models •  Omni-channel view of customers for personalized marketing •  Inventory control & management •  Single view of product across channels & demographics •  Management of patient medical records for treatment plans •  Macro-analysis view for public health •  Medical history to identify insurance risk Finance Retail Healthcare
  • 6. Challenges •  Current State –  Data dispersed across multitude of systems –  Different structures, different attributes –  Apps built to meet specific business requirements, not integrated –  New data sources from new apps, M&A •  Governance Processes –  How to deliver & maintain single view in face of constant business change •  Technology Limitations –  Traditional databases not well suited to single view required capabilities
  • 9. 10-Step Methodology Step 1: Define Scope Step 4: Appoint Data Stewards Step 5: Develop Data Model Step 6: Load & Standardize Step 7: Merge, Test & Reconcile Step 8: Infrastructure Design Step 3: Identify Data Producers Step 2: Identify Data Consumers Step 9: Modify Consuming Systems Step 10: Maintenance Processes Discover Develop Deploy
  • 10. Step 1: Define Scope & Sponsorship •  Scope needs to be realistic, defined by specific success metric –  Long term: aggregate all customer data into a single view, serving all business functions –  Initial phase: collecting all customer interactions on digital channels over past 3-months to improve call center MTTR •  Appoint executive sponsors –  Senior: allocate resources and command credibility –  Combination of senior title from the business, and from the technology group Discover
  • 11. Web Mobile CRM Mainframe Source Systems Steps 2 & 3: Identify Data Consumers & Producers •  Single View Consumers Define –  Typical queries and SLAs –  Required data attributes –  Current data sources •  Identify apps generating the source data –  Identify application owners + associated databases –  Profile apps: operational, analytical Step 2: Data Consumers Step 3: Data Producers Discover
  • 12. Step 4: Appoint Data Stewards •  Data steward appointed for each data source. •  Deep knowledge of: –  Source system schema –  Which tables store required attributes, what format –  Clients and apps that generate & consume the source data •  Advise on data loading strategies Develop
  • 13. Step 5: Develop Single View Data Model •  Key inputs –  Required data attributes –  Query patterns •  Define common fields & data types –  Create rules to validate common data •  Define primary & secondary indexes •  Identify dynamic fields –  No need to pre-declare when using a document database •  Localize data into a single document (where appropriate) { _id : “mark.smith@mongodb.com”, first_name : "Mark", last_name : "Smith", city : "San Francisco", phones: [ { number : “1-212-777-1212”, dnc : true, type : “home” }, { number : “1-212-777-1213”, type : “cell” }]} Single View Develop
  • 14. Resources to Support Schema Design MongoDB Documentation MongoDB Development Rapid Start Develop
  • 15. Step 6: Load 2 phases: Initial Load & Delta Load Emit JSON to preserve data types. Use Extended JSON Load ETLorMessageQueue Single View Develop Initial Load •  ETL Tools •  Custom Loaders Delta Load •  Batch loads: use tools above •  Real-time loads: Message queue
  • 16. Step 6 (cont’d): Standardize Data  Source  A   Data  Source  B   Data  Source  C   14   77   26   cust_id:  14   f_name:  James   l_name:  Bond   dob:  07/14/1968   eMail:  007@spook.com   fno:  77   first:  Jim   last:  Bond   born:  1968-­‐07-­‐14   email:  007@spook.com   xc_id:  26   name:  James  Bind   bdate:  July  14,  68   Email:  007@spook.com   Develop
  • 17. Step 7: Match, Merge & Reconcile Develop cust_id:  14   f_name:  James   l_name:  Bond   dob:  07/14/1968   eMail:  007@spook.com   xc_id:  26   name:  James  Bind   bdate:  July  14,  68   Email:  007@spook.com   source_id:  A_14   first_name:  James   last_name:  Bond   dob:  1968-­‐07-­‐14   eMail:  007@spook.com   source_id:  B_77   first_name:  Jim   last_name:  Bond   dob:  1968-­‐07-­‐14   eMail:  007@spook.com   source_id:  C_26   first_name:  James   last_name:  Bind   dob:  1968-­‐07-­‐14   eMail:  007@spook.com   _id:  007@spook.com   first_name:  James   last_name:  Bond   dob:  1968-­‐07-­‐14   Source   Data   Standardized  Data   Field  names  &  data  types   Single  View   Data  merged,     tested  &  reconciled   fno:  77   first:  Jim   last:  Bond   born:  1968-­‐07-­‐14   email:  007@spook.com  
  • 18. Step 7 (cont’d): Match, Merge & Reconcile •  Use iterative grouping functions to cluster records with similar attributes 1.  Match against unique, authoritative attributes (email address, credit card #) 2.  Match by combining attributes (last name, DoB, zip code) 3.  Use fuzzy matching to catch errors in source data (i.e. different spellings of customer name) •  Apply confidence factor to dictate merging –  Automatically merge records with 95%+ confidence –  Manually inspect records with lower confidence Develop
  • 19. Step 7 (cont’d): MongoDB Tools •  Workers framework to parallelize document comparisons •  Grouping tool to cluster documents based on attribute similarity –  Levenshtein to calculate distances, single-linkage clustering for matching Develop
  • 20. Step 8: Architecture Design Deploy •  Deployment infrastructure •  MongoDB Production Readiness Consulting Package provides recommendations: –  Hardware sizing –  HA/DR strategies –  Scaling –  Security for corporate and regulatory compliance •  Follow-on services for implementation
  • 21. Step 9: Modify Consuming Systems Deploy •  Modify the apps that consume the single view –  Create an API that exposes the single view (i.e. RESTful web service) –  Re-point apps to the web service (reads initially) •  Modify one consuming application at time Call Center Analytics Technical Support Billing Consuming Systems Reads Single View
  • 22. Step 10: Implement Maintenance Processes Deploy •  Frequency of application launch & evolution is accelerating •  Impacts to single view –  Adding new attributes from source systems –  Onboarding new data sources or digital channels –  Creating new apps that consume the single view •  Single view team needs to institutionalize governance around on-going maintenance –  Repeat the 10-step process –  Dynamic schema is HUGE!
  • 24. Scope BusinessBenefits Transactions are written first to the single view, which propagates the data back to the source system of record. Writes are performed concurrently to the source systems as well as the single view The single view data model is enriched with additional sources to serve more applications, including real-time analytics. The single view becomes a platform serving multiple applications Single View Platform Records are copied via ETL or message queue mechanisms from the source systems into the single view, serving read queries. The single view serves one specific application Single View Application Single View First Dual Writes Read Centric Transforming the role of the single view Reads & Writes Single View Maturity Model •  Advantages of writing to the single view –  Fresher data –  Reduced app complexity –  Improved application agility
  • 25. Architecture for Writes to the Single View ETLorMessageQueue Web Mobile CRM Mainframe Single View Call Center Analytics Technical Support Billing Update Queue Reads Writes Source Systems Consuming Systems Load
  • 27. Single View with a Relational Database
  • 28. Required Database Capabilities •  Data model flexibility with a dynamic schema •  Real-time analytics •  Performance, scale & always-on •  Enterprise deployment model
  • 29. MongoDB Compass MongoDB Connector for BI MongoDB Enterprise Server Enterprise Deployment Model  24x7Support (1hourSLA) CommercialLicense (NoAGPLCopyleftRestrictions) Platform Certifications MongoDB Ops Manager Monitoring  &   AlerBng   Query   OpBmizaBon   Backup  &   Recovery   AutomaBon  &   ConfiguraBon   Schema  VisualizaBon   Data  ExploraBon   Ad-­‐Hoc  Queries   VisualizaBon   Analysis   ReporBng   AuthorizaBon   AudiBng   EncrypBon   (In  Flight  &  at  Rest)   AuthenBcaBon   REST  API  Emergency Patches Customer Success Program On-Demand Online Training Warranty Limitation of Liability Indemnification
  • 30. Single View In Action
  • 31. Single View of Customer Insurance leader generates coveted single view of customers in 90 days – “The Wall” Problem   Why  MongoDB   Results  Problem Solution Results No single view of customer, leading to poor customer experience and churn 145 years of policy data, 70+ systems, 24 800 numbers, 15+ front-end apps that are not integrated Spent 2 years, $25M trying build single view with RDBMS – failed Built “The Wall,” pulling in disparate data and serving single view to customer service reps in real time Flexible data model to aggregate disparate data into single data store Expressive query language and secondary indexes to serve any field in real time Prototyped in 2 weeks Deployed to production in 90 days Decreased churn and improved ability to upsell/cross-sell
  • 32. Single View of LHC Analytics Data aggregation system to accelerate scientific research & discovery Problem   Why  MongoDB   Results  Problem Solution Results Raw data from LHC & experiments distributed across multitude of source systems Scientists don’t know location of source data, or how to extract it Relational databases rigid data model prevented aggregation of data from different sources Data Aggregation System built on MongoDB, consolidating analytics into a single view Dynamic schema represents data of any structure MongoDB query language supports simple lookups to complex search, traversals & analytics A single query to MongoDB can return 10,000 documents from different data sources for real time analytics Accelerates scientific time to insight Accessed by 3,000 physicists from 200 research institutions across the globe
  • 34. Where to Go from Here? •  Single view projects are challenging –  Partner with a vendor offering proven methodology, tools & technologies •  Learn More –  Download the whitepaper –  10-Step Methodology to Building a Single View •  Engage –  MongoDB Global Consulting Services can help you scope the project and get started –  Book a workshop
  • 36. Single View of the Customer 360° view of the customer increases customer satisfaction, cross-sell & up-sell with MongoDB, Spark, & Hadoop Problem   Why  MongoDB   Results  Problem Solution Results Customer data scattered across 100+ different systems Poor customer experience: no personalization, no consistent experience across brands or devices No way to analyze customer behavior to deliver targeted offers Single View application on MongoDB flexible data model, expressive query language, secondary indexes, & horizontal scalability Data from old relational systems fed into Spark for analysis and then stored in MongoDB to support real-time CRM Customer data synced from MongoDB to Hadoop for nightly batch jobs, then fed back to MongoDB for personalized recommendations Single view serves customers from any channel Stores 10s of TBs of customer data across multiple data centers Increased revenues from improved customer intimacy, driving cross- sell and upsell Global Airline
  • 37. Data Model Flexibility …Mobile App Web Call Centre CRM Social Feed COMMON FIELDS CustomerID | eMail | DYNAMIC FIELDS Can vary from record to record: location, action Single View
  • 38. Customer Service Application MongoDB  Primary  Replica   Single  View   BI & Reporting     REST Data Services     Real-time Data Services for Regulators & Partners VisualisationsQueries & Updates Aggregates   Predictive Analytics MongoDB  Secondary  Replica   Single  View  MongoDB  Secondary  Replica   Single  View  MongoDB  Secondary  Replica   Single  View  MongoDB  Secondary  Replica   Single  View  MongoDB  Secondary  Replica   MongoDB  Secondary  Replica   Data Analytics Pipeline Real-Time Analytics
  • 39. Predictable Scale & Always-On Shard 1 Horizontally Scalable Shard 2 Shard 3 Shard n