SlideShare a Scribd company logo
1 of 18
Download to read offline
Tracking data lineage
with Neo4j and
Linkurious.
SAS founded in 2013 in Paris | http://linkurio.us | @linkurious
French startup specialized in graph-visualization.
CTO
Web-scale archiving
Université de
Technologie de
Compiègne
CMO
>5 years in consulting
Sciences Po + Ecole
de Guerre
Economique
Jean
Villedieu
Sébastien
Heymann
David
Rapin
CEO
Created Gephi
Phd in CS and
complex systems
from UPMC
What is a graph?
PERSON
name: Séb
age: 29
PERSON
name: Jean
age: 31
LOCATION
name: Paris
Lives
in Lives
in
Knows
A graph is a set of nodes and relationships.
This is a node
This is a
relationship
PERSON
name: Séb
age: 29
PERSON
name: Jean
age: 31
LOCATION
name: Paris
This is a
property
What is data lineage?
“Data lineage is defined as a
data life cycle that includes the
data's origins and where it
moves over time. It describes
what happens to data as it goes
through diverse processes”
- Wikipedia
A real-world data pipeline.
Top 5 data lineage questions.
1. Where is this data coming from?
2. Who has access to that information?
3. Do we have sensitive data that’s being propagated
unsafely?
4. Is my database still being used in an important
company process or can I remove it?
5. What systems and reports would be impacted by a
change in that particular process?
Traditional databases are not adapted to data lineage.
Hard to query
Querying connected data
through SQL is a hard and
error-prone process.
Slow
Slow performances for
questions requiring looking
up multiple connections.
Too rigid
Hard to accommodate an
evolving data model in a
relational database.
The cost of bad data lineage.
● A general lack of confidence in data;
● Potential legal exposure;
● Finding answers and making decisions
becomes complex and time-
consuming;
...it results in wasted time, money,
opportunities, etc.
Graph DBs are perfect
for data lineage.
● Easy to model the flow of data in a graph;
● Query relationships with ease and in real-time;
● Adapt your schema to accommodate new data
and relationships;
● Popularity of graph databases has increased
500% in the last 2 years and our partner Neo4j
is the leader.
Linkurious brings
ability to find answers.
● Tech and business users can search the data
lineage intuitively and find answers;
● Visualization brings ability to understand and
communicate complex connections;
● Accelerate and improve decisions.
Unique ability to store and analyse your data lineage.
Neo4j
Your data lineage is a large
graph. Store and query it
quickly with Neo4j.
Linkurious
Search and find answers
easily through a visual
interface.
Metadata Process
System
Process Metadata ReportMetadata
System System
Example: a graph model for data lineage.
Question #1: what’s the data lineage of this report?
Our business people need to know what data was
used to generate this month’s sales report. I need
to understand which metadata, which systems
and which processes were involved.
IT Analyst
Question #1: visualize the data lineage of a report.
It only takes a few minutes to search a report and analyse its lineage. No need to
be an expert!
Question #2: what is this database used for?
We’re relocating our datacenter and need to move
a server on which a database is stored? Can we
decommission it? I need to understand what
processes and reports rely on this server.
IT Analyst
Question #2: visualize an impact analysis.
We can visualize and inspect the complex set of relationships involved in the
impact analysis.
Conclusion.
Contact us to discuss your projects
at contact@linkurio.us

More Related Content

More from Linkurious

More from Linkurious (20)

GraphTech Ecosystem - part 2: Graph Analytics
 GraphTech Ecosystem - part 2: Graph Analytics GraphTech Ecosystem - part 2: Graph Analytics
GraphTech Ecosystem - part 2: Graph Analytics
 
GraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph DatabasesGraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph Databases
 
3 types of fraud graph analytics can help defeat
3 types of fraud graph analytics can help defeat3 types of fraud graph analytics can help defeat
3 types of fraud graph analytics can help defeat
 
Graph analytics in Linkurious Enterprise
Graph analytics in Linkurious EnterpriseGraph analytics in Linkurious Enterprise
Graph analytics in Linkurious Enterprise
 
Graph technology and data-journalism: the case of the Paradise Papers
Graph technology and data-journalism: the case of the Paradise PapersGraph technology and data-journalism: the case of the Paradise Papers
Graph technology and data-journalism: the case of the Paradise Papers
 
Visualize the Knowledge Graph and Unleash Your Data
Visualize the Knowledge Graph and Unleash Your DataVisualize the Knowledge Graph and Unleash Your Data
Visualize the Knowledge Graph and Unleash Your Data
 
Graph-based Product Lifecycle Management
Graph-based Product Lifecycle ManagementGraph-based Product Lifecycle Management
Graph-based Product Lifecycle Management
 
Fraudes Financières: Méthodes de Prévention et Détection
Fraudes Financières: Méthodes de Prévention et DétectionFraudes Financières: Méthodes de Prévention et Détection
Fraudes Financières: Méthodes de Prévention et Détection
 
Detecting eCommerce Fraud with Neo4j and Linkurious
Detecting eCommerce Fraud with Neo4j and LinkuriousDetecting eCommerce Fraud with Neo4j and Linkurious
Detecting eCommerce Fraud with Neo4j and Linkurious
 
Graph-based Network & IT Management.
Graph-based Network & IT Management.Graph-based Network & IT Management.
Graph-based Network & IT Management.
 
Graph-powered data lineage in Finance
Graph-powered data lineage in FinanceGraph-powered data lineage in Finance
Graph-powered data lineage in Finance
 
Using Linkurious in your Enterprise Architecture projects
Using Linkurious in your Enterprise Architecture projectsUsing Linkurious in your Enterprise Architecture projects
Using Linkurious in your Enterprise Architecture projects
 
Linkurious SDK: Build enterprise-ready graph applications faster
Linkurious SDK: Build enterprise-ready graph applications fasterLinkurious SDK: Build enterprise-ready graph applications faster
Linkurious SDK: Build enterprise-ready graph applications faster
 
Fighting financial crime with graph analysis at BIWA Summit 2017
Fighting financial crime with graph analysis at BIWA Summit 2017Fighting financial crime with graph analysis at BIWA Summit 2017
Fighting financial crime with graph analysis at BIWA Summit 2017
 
Reinforcing AML systems with graph technologies.
Reinforcing AML systems with graph technologies.Reinforcing AML systems with graph technologies.
Reinforcing AML systems with graph technologies.
 
Using graphs technologies for intelligence analysis.
Using graphs technologies for intelligence analysis. Using graphs technologies for intelligence analysis.
Using graphs technologies for intelligence analysis.
 
The 8 most common graph visualization mistakes
The 8 most common graph visualization mistakesThe 8 most common graph visualization mistakes
The 8 most common graph visualization mistakes
 
Panama papers: how ICIJ used Linkurious to investigate the Mossack Fonseca leaks
Panama papers: how ICIJ used Linkurious to investigate the Mossack Fonseca leaksPanama papers: how ICIJ used Linkurious to investigate the Mossack Fonseca leaks
Panama papers: how ICIJ used Linkurious to investigate the Mossack Fonseca leaks
 
La visualisation au service de la lutte contre la fraude
La visualisation au service de la lutte contre la fraudeLa visualisation au service de la lutte contre la fraude
La visualisation au service de la lutte contre la fraude
 
Finding answers through visualization (GraphDay Barcelona Feb 2016)
Finding answers through visualization (GraphDay Barcelona Feb 2016)Finding answers through visualization (GraphDay Barcelona Feb 2016)
Finding answers through visualization (GraphDay Barcelona Feb 2016)
 

Recently uploaded

怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
ptikerjasaptiker
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 

Recently uploaded (20)

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 

Tracking data lineage with neo4j and Linkurious

  • 1. Tracking data lineage with Neo4j and Linkurious. SAS founded in 2013 in Paris | http://linkurio.us | @linkurious
  • 2. French startup specialized in graph-visualization. CTO Web-scale archiving Université de Technologie de Compiègne CMO >5 years in consulting Sciences Po + Ecole de Guerre Economique Jean Villedieu Sébastien Heymann David Rapin CEO Created Gephi Phd in CS and complex systems from UPMC
  • 3. What is a graph? PERSON name: Séb age: 29 PERSON name: Jean age: 31 LOCATION name: Paris Lives in Lives in Knows
  • 4. A graph is a set of nodes and relationships. This is a node This is a relationship PERSON name: Séb age: 29 PERSON name: Jean age: 31 LOCATION name: Paris This is a property
  • 5. What is data lineage? “Data lineage is defined as a data life cycle that includes the data's origins and where it moves over time. It describes what happens to data as it goes through diverse processes” - Wikipedia
  • 6. A real-world data pipeline.
  • 7. Top 5 data lineage questions. 1. Where is this data coming from? 2. Who has access to that information? 3. Do we have sensitive data that’s being propagated unsafely? 4. Is my database still being used in an important company process or can I remove it? 5. What systems and reports would be impacted by a change in that particular process?
  • 8. Traditional databases are not adapted to data lineage. Hard to query Querying connected data through SQL is a hard and error-prone process. Slow Slow performances for questions requiring looking up multiple connections. Too rigid Hard to accommodate an evolving data model in a relational database.
  • 9. The cost of bad data lineage. ● A general lack of confidence in data; ● Potential legal exposure; ● Finding answers and making decisions becomes complex and time- consuming; ...it results in wasted time, money, opportunities, etc.
  • 10. Graph DBs are perfect for data lineage. ● Easy to model the flow of data in a graph; ● Query relationships with ease and in real-time; ● Adapt your schema to accommodate new data and relationships; ● Popularity of graph databases has increased 500% in the last 2 years and our partner Neo4j is the leader.
  • 11. Linkurious brings ability to find answers. ● Tech and business users can search the data lineage intuitively and find answers; ● Visualization brings ability to understand and communicate complex connections; ● Accelerate and improve decisions.
  • 12. Unique ability to store and analyse your data lineage. Neo4j Your data lineage is a large graph. Store and query it quickly with Neo4j. Linkurious Search and find answers easily through a visual interface.
  • 13. Metadata Process System Process Metadata ReportMetadata System System Example: a graph model for data lineage.
  • 14. Question #1: what’s the data lineage of this report? Our business people need to know what data was used to generate this month’s sales report. I need to understand which metadata, which systems and which processes were involved. IT Analyst
  • 15. Question #1: visualize the data lineage of a report. It only takes a few minutes to search a report and analyse its lineage. No need to be an expert!
  • 16. Question #2: what is this database used for? We’re relocating our datacenter and need to move a server on which a database is stored? Can we decommission it? I need to understand what processes and reports rely on this server. IT Analyst
  • 17. Question #2: visualize an impact analysis. We can visualize and inspect the complex set of relationships involved in the impact analysis.
  • 18. Conclusion. Contact us to discuss your projects at contact@linkurio.us