SlideShare a Scribd company logo
1 of 28
Graph Data Science Demo for
Fraud Analysis
Joe Depeau
Sr. Presales Consultant, UK
28th April, 2020
@joedepeau
http://linkedin.com/in/joedepeau
• Review of the Neo4j Graph Data Science Library
• Demo Data Overview
• Review of Graph Algorithms for Demo
• Demo
• Q&A
2
Agenda
Neo4j Graph Data Science
Library
3
4
Neo4j Graph Algorithms Overview
Pathfinding
& Search
Centrality /
Importance
Community
Detection
Link Prediction
Finds optimal paths
or evaluates route availability and
quality.
Determines the importance of
dis9nct nodes in the network.
Detects group clustering or
partition options.
Evaluates how alike nodes are
by neighbors and
relationships.
Estimates the likelihood of
nodes forming a future
relationship.
Similarity
Graph and ML algorithms in Neo4j
• Minimum Weight Spanning Tree
• Shortest Path
• Single Source Shortest Path
• All Pairs Shortest Path
• A*
• Yen’s K-shortest Paths
• Random Walk
• Breadth First Search
• Depth First Search
• Degree Centrality
• Closeness Centrality
• Betweenness Centrality
• PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count / Clustering Coefficient
• Weakly Connected Components
• Strongly Connected Components
• Label PropagaJon
• Louvain Modularity
• K-1 Colouring
• Modularity OpJmisaJon
• Node Similarity
• Approximate Nearest Neighbours
• Cosine Similarity
• Euclidean Similarity
• Jaccard Similarity
• Overlap Similarity
• Pearson Similarity
Pathfinding
& Search
Centrality /
Importance
Community
Detec<on
Similarity
https://neo4j.com/docs/graph-data-science/1.0/
Link
Prediction
• Adamic Adar
• Common Neighbours
• PreferenJal ARachment
• Resource AllocaJons
• Same Community
• Total Neighbours
5
Demo Data Overview
6
7
Some Examples of Typical Bank Data
Event DataProduct and
Services Data
Customer DataOrganisational
Data
3rd Party Data
Documentation
Employee
Data
Processes
Systems and
Databases
KPIs and Reports
Address
Personal Data
Documents
Relationships
Assets
Documentation
Processes
Product / Service
Details
Product / Service
Hierarchy
Pricing
Money
Movements
Web / App Activity
Customer Contact
Social Media
Credit Rating
Agencies
Market Data
Organisational
Hierarchy
Corporate Data
8
Some Examples of Typical Bank Data
Event DataProduct and
Services Data
Customer DataOrganisational
Data
3rd Party Data
Documentation
Employee
Data
Processes
Systems and
Databases
KPIs and Reports
Address
Personal Data
Documents
Relationships
Assets
Documentation
Processes
Product / Service
Details
Product / Service
Hierarchy
Pricing
Money
Movements
Web / App Activity
Customer Contact
Social Media
Credit Rating
Agencies
Market Data
Organisational
Hierarchy
Corporate Data
9
Our Graph Model
Total Nodes : 3.4m
Total Relationships : 10.2m
10
Three ways a Client node can be Flagged
Performed a transaction flagged as fraud Share a SSN with another Client
Have more than one SSN on file
Graph Algorithms for
Demonstra>on
11
PageRank
What: Finds important nodes based
on their relationships.
Why: Identify important or
influential Client nodes by
quantifying the flows of money
towards them.
Uses:
- Fraud detection
- Anti-money Laundering
- Inform prioritization during analysis
and investigation12
13
The PageRank Algorithm
PageRank: what nodes can be considered
‘important’ in our graph based on money flows ?
14
The PageRank Algorithm
PageRank: what nodes can be considered
‘important’ in our graph based on money flows ?
Inputs
.pagerank
Property Output
Betweenness Centrality
15
What: The sum of the % shortest paths that
pass through a node, calculated by pairs.
Why: Identify well connected nodes within
the graph, or bridges between
communities.
Uses:
- Fraud detection
- Anti-money Laundering
- Inform prioritization during analysis and
investigation
16
The Betweenness Centrality Algorithm
Betweenness Centrality: what nodes can be
considered ‘important’ in our graph based on
how many ‘shortest paths’ they are present in?
17
The Betweenness Centrality Algorithm
Betweenness Centrality: what nodes can be
considered ‘important’ in our graph based on
how many ‘shortest paths’ they are present in?
Inputs
.centrality
Property Output
Weakly Connected Components
What: Finds disconnected
community subgraphs in our data.
Why: Identify communities based
on connections with shared pieces
of identity.
Uses:
- Householding
- Synthetic identities
- Stolen identities
18
19
The Weakly Connected Components Algorithm
Weakly Connected
Components: what
communities exist in the
data based on connections
to pieces of identity ?
20
The Weakly Connected Components Algorithm
Weakly Connected
Components: what
communiFes exist in the
data based on connecFons
to pieces of idenFty ?
.component_id
Property Output
Inputs
Louvain Modularity
What: Finds communi?es in our
graph who are connected. Can
return intermediate results.
Why: Useful for iden?fying
communi?es based on transac?on
behaviour rather than iden?ty.
Uses:
- Fraud ring detecHon
- AnH-money Laundering
21
22
The Louvain Algorithm
Louvain: what communiFes of nodes transact
amongst themselves ?
23
The Louvain Algorithm
Louvain: what communities of nodes transact
amongst themselves ?
Inputs
.louvain_community
Property Output
Node Similarity
What: Similarity between nodes
based on neighbours. Writes a new
relationship to the graph.
Why: Identify similar nodes who
share common pieces of identity.
Uses:
- Entity Resolution
- Synthetic identities
- Stolen identities
24
25
The Node Similarity Algorithm
Node Similarity : how
similar are two Client
nodes based on the non-
Merchant accounts they
transact with ?
26
The Node Similarity Algorithm
Node Similarity : how
similar are two Client
nodes based on the non-
Merchant accounts they
transact with ?
SIMILAR Relationship Output
with .score property
Inputs
Demo
27
28
Thank you!

More Related Content

What's hot

The Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdfThe Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdf
Neo4j
 
Introduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & BahrainIntroduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & Bahrain
Neo4j
 
Maps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphX
Maps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphXMaps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphX
Maps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphX
Databricks
 

What's hot (20)

How Kafka Powers the World's Most Popular Vector Database System with Charles...
How Kafka Powers the World's Most Popular Vector Database System with Charles...How Kafka Powers the World's Most Popular Vector Database System with Charles...
How Kafka Powers the World's Most Popular Vector Database System with Charles...
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
 
Supply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion DeckSupply Chain Twin Demo - Companion Deck
Supply Chain Twin Demo - Companion Deck
 
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
 
SplunkLive! München - Flughafen München
SplunkLive! München - Flughafen MünchenSplunkLive! München - Flughafen München
SplunkLive! München - Flughafen München
 
The Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdfThe Neo4j Data Platform for Today & Tomorrow.pdf
The Neo4j Data Platform for Today & Tomorrow.pdf
 
Knowledge Graphs Overview
Knowledge Graphs OverviewKnowledge Graphs Overview
Knowledge Graphs Overview
 
Top 10 Cypher Tuning Tips & Tricks
Top 10 Cypher Tuning Tips & TricksTop 10 Cypher Tuning Tips & Tricks
Top 10 Cypher Tuning Tips & Tricks
 
Bertelsmann: BeTrend – Building a Trend Aggregation Machine.pdf
Bertelsmann: BeTrend – Building a Trend Aggregation Machine.pdfBertelsmann: BeTrend – Building a Trend Aggregation Machine.pdf
Bertelsmann: BeTrend – Building a Trend Aggregation Machine.pdf
 
The Path To Success With Graph Database and Analytics
The Path To Success With Graph Database and AnalyticsThe Path To Success With Graph Database and Analytics
The Path To Success With Graph Database and Analytics
 
Demystifying Graph Neural Networks
Demystifying Graph Neural NetworksDemystifying Graph Neural Networks
Demystifying Graph Neural Networks
 
Introduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & BahrainIntroduction to Neo4j for the Emirates & Bahrain
Introduction to Neo4j for the Emirates & Bahrain
 
Observability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineageObservability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineage
 
Applying Network Analytics in KYC
Applying Network Analytics in KYCApplying Network Analytics in KYC
Applying Network Analytics in KYC
 
Maps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphX
Maps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphXMaps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphX
Maps and Meaning: Graph-based Entity Resolution in Apache Spark & GraphX
 
Encrypting and Protecting Your Data in Neo4j(Jeff_Tallman).pptx
Encrypting and Protecting Your Data in Neo4j(Jeff_Tallman).pptxEncrypting and Protecting Your Data in Neo4j(Jeff_Tallman).pptx
Encrypting and Protecting Your Data in Neo4j(Jeff_Tallman).pptx
 
Natural Language Processing with Graph Databases and Neo4j
Natural Language Processing with Graph Databases and Neo4jNatural Language Processing with Graph Databases and Neo4j
Natural Language Processing with Graph Databases and Neo4j
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data Science
 
Druid + Kafka: transform your data-in-motion to analytics-in-motion | Gian Me...
Druid + Kafka: transform your data-in-motion to analytics-in-motion | Gian Me...Druid + Kafka: transform your data-in-motion to analytics-in-motion | Gian Me...
Druid + Kafka: transform your data-in-motion to analytics-in-motion | Gian Me...
 

Similar to Graph Data Science DEMO for fraud analysis

BIA 658 – Social Network Analysis - Final report Kanad Chatterjee
BIA 658 – Social Network Analysis - Final report Kanad ChatterjeeBIA 658 – Social Network Analysis - Final report Kanad Chatterjee
BIA 658 – Social Network Analysis - Final report Kanad Chatterjee
Kanad Chatterjee
 
Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science
Neo4j
 
992 sms10 social_media_services
992 sms10 social_media_services992 sms10 social_media_services
992 sms10 social_media_services
siyaza
 

Similar to Graph Data Science DEMO for fraud analysis (20)

GraphTour London 2020 - Graphs for AI, Amy Hodler
GraphTour London 2020  - Graphs for AI, Amy HodlerGraphTour London 2020  - Graphs for AI, Amy Hodler
GraphTour London 2020 - Graphs for AI, Amy Hodler
 
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
Thwart Fraud Using Graph-Enhanced Machine Learning and AIThwart Fraud Using Graph-Enhanced Machine Learning and AI
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
 
Big social data analytics - social network analysis
Big social data analytics - social network analysis Big social data analytics - social network analysis
Big social data analytics - social network analysis
 
Graph Data Science Training - Alicia Frame Presentation
Graph Data Science Training - Alicia Frame PresentationGraph Data Science Training - Alicia Frame Presentation
Graph Data Science Training - Alicia Frame Presentation
 
Detecting eCommerce Fraud with Neo4j and Linkurious
Detecting eCommerce Fraud with Neo4j and LinkuriousDetecting eCommerce Fraud with Neo4j and Linkurious
Detecting eCommerce Fraud with Neo4j and Linkurious
 
BIA 658 – Social Network Analysis - Final report Kanad Chatterjee
BIA 658 – Social Network Analysis - Final report Kanad ChatterjeeBIA 658 – Social Network Analysis - Final report Kanad Chatterjee
BIA 658 – Social Network Analysis - Final report Kanad Chatterjee
 
Improving Machine Learning using Graph Algorithms
Improving Machine Learning using Graph AlgorithmsImproving Machine Learning using Graph Algorithms
Improving Machine Learning using Graph Algorithms
 
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
 
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4jAI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
AI, ML and Graph Algorithms: Real Life Use Cases with Neo4j
 
Graph Algorithms for Developers
Graph Algorithms for DevelopersGraph Algorithms for Developers
Graph Algorithms for Developers
 
Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science Graphs for Finance - AML with Neo4j Graph Data Science
Graphs for Finance - AML with Neo4j Graph Data Science
 
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
Leveraging Graphs for AI and ML - Alicia Frame, Neo4jLeveraging Graphs for AI and ML - Alicia Frame, Neo4j
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Identical Users in Different Social Media Provides Uniform Network Structure ...
Identical Users in Different Social Media Provides Uniform Network Structure ...Identical Users in Different Social Media Provides Uniform Network Structure ...
Identical Users in Different Social Media Provides Uniform Network Structure ...
 
Social Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network ContextSocial Friend Overlying Communities Based on Social Network Context
Social Friend Overlying Communities Based on Social Network Context
 
Web Mining .ppt
Web Mining .pptWeb Mining .ppt
Web Mining .ppt
 
Web Mining .ppt
Web Mining .pptWeb Mining .ppt
Web Mining .ppt
 
Second review
Second reviewSecond review
Second review
 
992 sms10 social_media_services
992 sms10 social_media_services992 sms10 social_media_services
992 sms10 social_media_services
 

More from Neo4j

More from Neo4j (20)

From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptxFrom Knowledge Graphs via Lego Bricks to scientific conversations.pptx
From Knowledge Graphs via Lego Bricks to scientific conversations.pptx
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Graph Data Science DEMO for fraud analysis

  • 1. Graph Data Science Demo for Fraud Analysis Joe Depeau Sr. Presales Consultant, UK 28th April, 2020 @joedepeau http://linkedin.com/in/joedepeau
  • 2. • Review of the Neo4j Graph Data Science Library • Demo Data Overview • Review of Graph Algorithms for Demo • Demo • Q&A 2 Agenda
  • 3. Neo4j Graph Data Science Library 3
  • 4. 4 Neo4j Graph Algorithms Overview Pathfinding & Search Centrality / Importance Community Detection Link Prediction Finds optimal paths or evaluates route availability and quality. Determines the importance of dis9nct nodes in the network. Detects group clustering or partition options. Evaluates how alike nodes are by neighbors and relationships. Estimates the likelihood of nodes forming a future relationship. Similarity
  • 5. Graph and ML algorithms in Neo4j • Minimum Weight Spanning Tree • Shortest Path • Single Source Shortest Path • All Pairs Shortest Path • A* • Yen’s K-shortest Paths • Random Walk • Breadth First Search • Depth First Search • Degree Centrality • Closeness Centrality • Betweenness Centrality • PageRank • ArticleRank • Eigenvector Centrality • Triangle Count / Clustering Coefficient • Weakly Connected Components • Strongly Connected Components • Label PropagaJon • Louvain Modularity • K-1 Colouring • Modularity OpJmisaJon • Node Similarity • Approximate Nearest Neighbours • Cosine Similarity • Euclidean Similarity • Jaccard Similarity • Overlap Similarity • Pearson Similarity Pathfinding & Search Centrality / Importance Community Detec<on Similarity https://neo4j.com/docs/graph-data-science/1.0/ Link Prediction • Adamic Adar • Common Neighbours • PreferenJal ARachment • Resource AllocaJons • Same Community • Total Neighbours 5
  • 7. 7 Some Examples of Typical Bank Data Event DataProduct and Services Data Customer DataOrganisational Data 3rd Party Data Documentation Employee Data Processes Systems and Databases KPIs and Reports Address Personal Data Documents Relationships Assets Documentation Processes Product / Service Details Product / Service Hierarchy Pricing Money Movements Web / App Activity Customer Contact Social Media Credit Rating Agencies Market Data Organisational Hierarchy Corporate Data
  • 8. 8 Some Examples of Typical Bank Data Event DataProduct and Services Data Customer DataOrganisational Data 3rd Party Data Documentation Employee Data Processes Systems and Databases KPIs and Reports Address Personal Data Documents Relationships Assets Documentation Processes Product / Service Details Product / Service Hierarchy Pricing Money Movements Web / App Activity Customer Contact Social Media Credit Rating Agencies Market Data Organisational Hierarchy Corporate Data
  • 9. 9 Our Graph Model Total Nodes : 3.4m Total Relationships : 10.2m
  • 10. 10 Three ways a Client node can be Flagged Performed a transaction flagged as fraud Share a SSN with another Client Have more than one SSN on file
  • 12. PageRank What: Finds important nodes based on their relationships. Why: Identify important or influential Client nodes by quantifying the flows of money towards them. Uses: - Fraud detection - Anti-money Laundering - Inform prioritization during analysis and investigation12
  • 13. 13 The PageRank Algorithm PageRank: what nodes can be considered ‘important’ in our graph based on money flows ?
  • 14. 14 The PageRank Algorithm PageRank: what nodes can be considered ‘important’ in our graph based on money flows ? Inputs .pagerank Property Output
  • 15. Betweenness Centrality 15 What: The sum of the % shortest paths that pass through a node, calculated by pairs. Why: Identify well connected nodes within the graph, or bridges between communities. Uses: - Fraud detection - Anti-money Laundering - Inform prioritization during analysis and investigation
  • 16. 16 The Betweenness Centrality Algorithm Betweenness Centrality: what nodes can be considered ‘important’ in our graph based on how many ‘shortest paths’ they are present in?
  • 17. 17 The Betweenness Centrality Algorithm Betweenness Centrality: what nodes can be considered ‘important’ in our graph based on how many ‘shortest paths’ they are present in? Inputs .centrality Property Output
  • 18. Weakly Connected Components What: Finds disconnected community subgraphs in our data. Why: Identify communities based on connections with shared pieces of identity. Uses: - Householding - Synthetic identities - Stolen identities 18
  • 19. 19 The Weakly Connected Components Algorithm Weakly Connected Components: what communities exist in the data based on connections to pieces of identity ?
  • 20. 20 The Weakly Connected Components Algorithm Weakly Connected Components: what communiFes exist in the data based on connecFons to pieces of idenFty ? .component_id Property Output Inputs
  • 21. Louvain Modularity What: Finds communi?es in our graph who are connected. Can return intermediate results. Why: Useful for iden?fying communi?es based on transac?on behaviour rather than iden?ty. Uses: - Fraud ring detecHon - AnH-money Laundering 21
  • 22. 22 The Louvain Algorithm Louvain: what communiFes of nodes transact amongst themselves ?
  • 23. 23 The Louvain Algorithm Louvain: what communities of nodes transact amongst themselves ? Inputs .louvain_community Property Output
  • 24. Node Similarity What: Similarity between nodes based on neighbours. Writes a new relationship to the graph. Why: Identify similar nodes who share common pieces of identity. Uses: - Entity Resolution - Synthetic identities - Stolen identities 24
  • 25. 25 The Node Similarity Algorithm Node Similarity : how similar are two Client nodes based on the non- Merchant accounts they transact with ?
  • 26. 26 The Node Similarity Algorithm Node Similarity : how similar are two Client nodes based on the non- Merchant accounts they transact with ? SIMILAR Relationship Output with .score property Inputs