The newly released Neo4j Connector for Apache Spark can be used to read and write data between the two systems.
In this demo I show how to use the investigative Data from the FinCEN files to have a full pipeline up an running.
Notebook is in https://github.com/jexp/fincen
3. Connections in Data are as
Valuable as the Data Itself
Networks of People Transaction Networks Knowledge Networks
E.g., Risk management, Supply
chain, Payments
E.g., Employees, Customers,
Suppliers, Partners,
Influencers
E.g., Enterprise content,
Domain specific content,
eCommerce content
5. 5
The FinCEN files?
obtained and published Suspicious Activity Reports (SARs)
submitted by global financial institutions to the
6. ● Insider trading
● Transactions linked to money laundering,
terrorism financing or other crimes.
● Odd dealings, also involving shell companies
● Transactions by individuals known or suspected to have links to
criminal or terrorist organizations, or on sanction lists
These activities are required to be reported WITHIN 30-60 days.
In the FinCEN files - this is rarely the case.
6
Suspicious Activity
Reports
7. Property Graph - Simply Powerful
Entity
Sender
Entity
Receiver
Nodes represent
objects (nouns)
Relationships are directional
Relationships connect nodes
are represent actions (verbs)
Relationships can have
properties (name/value pairs)
Nodes can have
properties
(name/value pairs)
name: A Global Bank
address: New York City
amount: 1208209000
date: 2015-03-01
name: A Shady Bank
address: n/a
:TRANSFERRED
8. Analytics
Tooling
Graph Transactions
Data Integration
Dev.
& Admin
Drivers & APIs Discovery & Visualization
Graph Analytics &
Data Science
Developers
Admins
Applications Business Users
Data Analysts
Data Scientists
Native Graph Technology for Applications & Analytics