Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

•Download as PPTX, PDF•

3 likes•2,967 views

Neo4j

GraphConnect Europe 2017 Tim Ward, CluedIn

Technology

Taking the "Magic" Out of Machine Learning
Building a decision engine with Neo4j and Machine Learning Techniques

Tim Ward
Engineer at CluedIn
@jerrong / tiw@cluedin.com
Using Neo4j for 5+ years
Started on 1.6
WHO
AM I ?

WHAT DO
WE DO?
We help our customers achieve the connected enterprise
Average company uses 30+ SAAS tools
We connect them and the data in them automatically (SAAS)

WHY
“MAGIC”
Like everyone, when we heard there are new techniques
called "Machine Learning", we jumped on early and learnt.

STARTED
OUT SMALL
92% Suit 5% Bow Tie 2% Penguin 1% Other

FAILED
QUICKLY
97% Fur Coat 2% Bucket 1% Other

MOVED IN
SMALL
STEPS
Clustering
Neural Networks

DISCOVERED
The thing you learn through all of this is that machine
learning techniques are good at solving certain problems,
not the magic bullet for all problems.

THE SIMPLE
IDEA
Have a weighted decision engine that can persist
Have the ability to fork graph decisions async
Does not need to be super fast or realtime
Get something from nothing

WHY
To disseminate noise from valuable data
To reverse engineer how two things are related
To connect the enterprise.....automatically

THE SIMPLE
APPROACHES
CAN GET YOU
VERY FAR!
We combine the best parts of the graph with the backing of
a neural network to learn from its decisions.
Pattern matching combined with statistical models.

OUR
PRACTICAL
PROCESS
Recursive Decision Tree that organically grows, expands,
collapses then learns.

PRE-PROCESSING
PIPELINE
We combine the best parts of the graph with the backing of a
neural network to learn from its decisions.
Pattern matching combined with statistical models.

Martin Hyldahl, CTO
“The graph is the new secret in machine learning as most
models are dots on a chart or rows in a model. Besides
clustering algorithms there are not a lot of algorithms
where the dots are related in a strong and meaningful way.
Although this typically requires a lot more processing, we
found that this tapers off over time. The pre-processing
that we do to get data into a connected graph before we
make the decision tree allows our engine to be statistically
correct more than any known approach today.”

LET'S SEE IT
IN ACTION
Who is Emil Eifrém?

SOMETHING
HARDER
What is the best way for me to contact Emil Eifrém of Neo
Technologies?

SOMETHING
"MAGIC"
How do I sell to Neo Technologies?

HOW DID WE
COME UP WITH
THE WEIGHTS?
The weights are constantly being re-evaluated.

MACHINE
LEARNING
SUPERVISED
"Is Emil still the CEO of Neo?"
UNSUPERVISED
Neural Network built from statistical decisions

GOOD
PARTS
Building the graphs are easy, they are less of a black box.
Crossed Path Intersection Count.

CHALLENGING
PARTS
You can't do this real-time in the graph....turns out this
actually helps

SOMETHING
OUT OF
NOTHING
Because typically you have more than nothing

SO WHY DO
ALL OF THIS?
Connected data is always more interesting that disconnected data.

HOW CAN YOU
START USING THIS
TECHNIQUE?
cluedin.com/developers to request an API key.

WHY FOR
ENGINEERS?
Data Cleansing
Polyglot Persistence
Enrich data
Machine Learning

WHY FOR THE
BUSINESS?
Talk to Amalie (ale@cluedin.com)
Right to be forgotten
Data Privacy Act
cluedin.com/sales

Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

What's hot

Why do the majority of Data Science projects never make it to production?Itai Yaffe

IoT meets AI in the CloudsDr. Mirko Kämpf

Future of Data Platform in Cloud Native worldSrivatsan Srinivasan

Summary introduction to data engineeringNovita Sari

Data engineering design patternsValdas Maksimavičius

NVIDIA Supply Chain Finance CAPSTONEParam Parikh

Intro to Delta LakeDatabricks

Practical Machine LearningLynn Langit

Big Data: Myths and RealitiesToronto-Oracle-Users-Group

H2O AutoML roadmap - Ray PeckSri Ambati

What’s New with Databricks Machine LearningDatabricks

Lambda Architecture 2.0 for Reactive AB TestingTrieu Nguyen

Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Altan Khendup

Analytics-Enabled Experiences: The New Secret WeaponDatabricks

Production Grade Data Science for HadoopDataWorks Summit/Hadoop Summit

Hadoop for Humans: Introducing SnapReduce 2.0SnapLogic

Data Science at Scale - The DevOps ApproachMihai Criveti

Building Reactive Real-time Data PipelineTrieu Nguyen

Securing and governing a multi-tenant data lake within the financial industryDataWorks Summit

ALT-F1.BE : The Accelerator (Google Cloud Platform)Abdelkrim Boujraf

What's hot (20)

Why do the majority of Data Science projects never make it to production?

IoT meets AI in the Clouds

Future of Data Platform in Cloud Native world

Summary introduction to data engineering

Data engineering design patterns

NVIDIA Supply Chain Finance CAPSTONE

Intro to Delta Lake

Practical Machine Learning

Big Data: Myths and Realities

H2O AutoML roadmap - Ray Peck

What’s New with Databricks Machine Learning

Lambda Architecture 2.0 for Reactive AB Testing

Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...

Analytics-Enabled Experiences: The New Secret Weapon

Production Grade Data Science for Hadoop

Hadoop for Humans: Introducing SnapReduce 2.0

Data Science at Scale - The DevOps Approach

Building Reactive Real-time Data Pipeline

Securing and governing a multi-tenant data lake within the financial industry

ALT-F1.BE : The Accelerator (Google Cloud Platform)

Similar to Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

Why do most machine learning projects never make it to productionCameron Vetter

Ezml Stanford 2015Stanford University

Free and Open Machine LearningMaikel Mardjan

Tessella ConsultingTessella

Putting data science in your business a first utility feedbackPeculium Crypto

Designing for Data Security by Karen LopezKaren Lopez

Practical uses of AI in retailNational Retail Federation

SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...Dario Mangano

(In)convenient truths about applied machine learningMax Pagels

AI Orange Belt - Session 4AI Black Belt

The Death of Software in the Call Center - a 100% Cloud Based Approach?Geoffrey Mobisson

Intro to ai application emeritus uob-finalLuis Fernando Gonzalez Sanchez

Experiment Management for the EnterpriseSigOpt

Low Code Neuro-Symbolic Agents.pdfDenis Gagné

Bilot 3modeBilot

BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...Big Data Week

Understanding the Data Renaissance in ManufacturingSafetyChain Software

How-to-Build-a-Career-in-AI.pdfDustin Liu

Build a Career in AICMassociates

Machine learning for product developmentClaudio Villar

Similar to Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn (20)

Why do most machine learning projects never make it to production

Ezml Stanford 2015

Free and Open Machine Learning

Tessella Consulting

Putting data science in your business a first utility feedback

Designing for Data Security by Karen Lopez

Practical uses of AI in retail

SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...

(In)convenient truths about applied machine learning

AI Orange Belt - Session 4

The Death of Software in the Call Center - a 100% Cloud Based Approach?

Intro to ai application emeritus uob-final

Experiment Management for the Enterprise

Low Code Neuro-Symbolic Agents.pdf

Bilot 3mode

BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...

Understanding the Data Renaissance in Manufacturing

How-to-Build-a-Career-in-AI.pdf

Build a Career in AI

Machine learning for product development

Recently uploaded

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Artificial Intelligence: Facts and MythsJoaquim Jorge

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Partners Life - Insurer Innovation Award 2024The Digital Insurer

Histor y of HAM Radio presentation slidevu2urc

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

🐬 The future of MySQL is Postgres 🐘RTylerCroy

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

How to convert PDF to text with Nanonetsnaman860154

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...

Artificial Intelligence: Facts and Myths

Data Cloud, More than a CDP by Matt Robison

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Partners Life - Insurer Innovation Award 2024

Histor y of HAM Radio presentation slide

Powerful Google developer tools for immediate impact! (2023-24 C)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

🐬 The future of MySQL is Postgres 🐘

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Boost PC performance: How more available memory can improve productivity

How to convert PDF to text with Nanonets

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Strategies for Landing an Oracle DBA Job as a Fresher

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Handwritten Text Recognition for manuscripts and early printed texts

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

How to Troubleshoot Apps for the Modern Connected Worker

Finology Group – Insurtech Innovation Award 2024

Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

1. Taking the "Magic" Out of Machine Learning Building a decision engine with Neo4j and Machine Learning Techniques

2. Tim Ward Engineer at CluedIn @jerrong / tiw@cluedin.com Using Neo4j for 5+ years Started on 1.6 WHO AM I ?

3. WHAT DO WE DO? We help our customers achieve the connected enterprise Average company uses 30+ SAAS tools We connect them and the data in them automatically (SAAS)

4. WHY “MAGIC” Like everyone, when we heard there are new techniques called "Machine Learning", we jumped on early and learnt.

5. STARTED OUT SMALL 92% Suit 5% Bow Tie 2% Penguin 1% Other

6. FAILED QUICKLY 97% Fur Coat 2% Bucket 1% Other

7. MOVED IN SMALL STEPS Clustering Neural Networks

8. DISCOVERED The thing you learn through all of this is that machine learning techniques are good at solving certain problems, not the magic bullet for all problems.

9. THE SIMPLE IDEA Have a weighted decision engine that can persist Have the ability to fork graph decisions async Does not need to be super fast or realtime Get something from nothing

10. WHY To disseminate noise from valuable data To reverse engineer how two things are related To connect the enterprise.....automatically

11. THE SIMPLE APPROACHES CAN GET YOU VERY FAR! We combine the best parts of the graph with the backing of a neural network to learn from its decisions. Pattern matching combined with statistical models.

12.

13. OUR PRACTICAL PROCESS Recursive Decision Tree that organically grows, expands, collapses then learns.

14. PRE-PROCESSING PIPELINE We combine the best parts of the graph with the backing of a neural network to learn from its decisions. Pattern matching combined with statistical models.

15. Martin Hyldahl, CTO “The graph is the new secret in machine learning as most models are dots on a chart or rows in a model. Besides clustering algorithms there are not a lot of algorithms where the dots are related in a strong and meaningful way. Although this typically requires a lot more processing, we found that this tapers off over time. The pre-processing that we do to get data into a connected graph before we make the decision tree allows our engine to be statistically correct more than any known approach today.”

16. LET'S SEE IT IN ACTION Who is Emil Eifrém?

17. SOMETHING HARDER What is the best way for me to contact Emil Eifrém of Neo Technologies?

18. SOMETHING "MAGIC" How do I sell to Neo Technologies?

19. HOW DID WE COME UP WITH THE WEIGHTS? The weights are constantly being re-evaluated.

20. MACHINE LEARNING SUPERVISED "Is Emil still the CEO of Neo?" UNSUPERVISED Neural Network built from statistical decisions

21. GOOD PARTS Building the graphs are easy, they are less of a black box. Crossed Path Intersection Count.

22. CHALLENGING PARTS You can't do this real-time in the graph....turns out this actually helps

23. SOMETHING OUT OF NOTHING Because typically you have more than nothing

24. SO WHY DO ALL OF THIS? Connected data is always more interesting that disconnected data.

25. HOW CAN YOU START USING THIS TECHNIQUE? cluedin.com/developers to request an API key.

26. WHY FOR ENGINEERS? Data Cleansing Polyglot Persistence Enrich data Machine Learning

27. WHY FOR THE BUSINESS? Talk to Amalie (ale@cluedin.com) Right to be forgotten Data Privacy Act cluedin.com/sales

Editor's Notes

Important that you know a little bit about what we do, to understand why we built the solution we are about to show you.
Why? Because when you have something, you know the results will be great.
TensorFlow started using a graph backend recently.
Optimization Process. Output feed into a Data Model for a Neural Net
All of these steps add preliminary weight e.g. 2%, 0.8% etc.
Although somethings are less volatile e.g. a company doesn't change their Industry often, people don't change their email often, but they do change their relationships with those objects e.g. moving job, changing tasks, changing positions.
Where does the machine learning part come in. It comes in two ways. Manual Observations placed on the graph using human interaction i.e. decisions that hover between 70-90% are surfaced to the user to self correct, which once again only adds a weight, doesn't make it right. E.g. Clustering Algorithms like K-Means allows you to add observations i.e. if you toss up two coins, I think it will be heads twice. They observations we make are simply, what features of these two things made them make a decision that it is the same thing. An observation of "Something Else" is also an observation (indicates to our decision tree to try new things) e.g. "I know that this is a TV show that John likes, but it isn't a photo of him"
Solution is to do as much in memory, requires you to build a graph model which takes a little bit of time. Good thing is that we don't need this to be realtime. e.g. imagine someone sitting next to you saying "The Sky is green, no wait its red, wait its black, no got it now, it's definitely blue". You would want someone to just say, "The Sky is Blue, it is a well qualified fact, I am 100% sure of it, but I took 2 seconds more to tell you". So all the graph stuff in the demo is done in memory and then persisted to the graph I.e have you ever left state in your brain until 2 hours later, you get an extra clue and it loads it back into the main memory and answers it.
Imagine plugging this engine into your enterprise, this is what we do e.g. Your CRM.
There is no doubt that we miss so much at work. You have experienced the wow moment of the default Neo4j Movie Sample when you run your first ShortestPath query - Now you get to apply this at work. All of this is to make sure the right graph is built up. With the Panama papers example, if you had a chink in the chain, it would have led to some weird results. Reverse Engineer machine i.e. problem, solution, how did I get there?

Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

Similar to Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn (20)

More from Neo4j

More from Neo4j (20)

Recently uploaded

Recently uploaded (20)

Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn

Editor's Notes