SlideShare a Scribd company logo
1 of 18
Download to read offline
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph Abstractions Matter
(a Personal View)
Dr. Ora Lassila
Principal Technologist
Amazon Neptune
© 2021, Amazon Web Services, Inc. or its Affiliates.
Game plan: 1. Some graph history
2. Graph abstractions, queries
and access patterns
3. Graph as an abstraction:
“One graph to rule them all”
4. The way forward
© 2021, Amazon Web Services, Inc. or its Affiliates.
A brief history of graphs and ontologies
1730s: Graph theory (Euler)
1870s: Library classification (Dewey)
1900: Semantics, ontology and logic (Husserl)
1950s and onwards: Graphs as the essential
underpinning of computer science
1960s-1970s: Network databases (CODASYL), semantic
networks (Quillian et al)
1970s-1990s: Predicate logic as the foundation of
Knowledge Representation (Hayes et al)
1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al)
Today: Modern knowledge graphs and graph databases
3rd Century BCE: Categories
& logic (Aristotle)
1730s: Taxonomical classification of plants and
animals (Linnaeus)
1960s: Social networks, “small-world experiment”,
Erdős number (Milgram et al)
© 2021, Amazon Web Services, Inc. or its Affiliates.
1730s: Graph theory (Euler)
1870s: Library classification (Dewey)
1900: Semantics, ontology and logic (Husserl)
1950s and onwards: Graphs as the essential
underpinning of computer science
1960s-1970s: Network databases (CODASYL), semantic
networks (Quillian et al)
1970s-1990s: Predicate logic as the foundation of
Knowledge Representation (Hayes et al)
Today: Modern knowledge graphs and graph databases
3rd Century BCE: Categories
& logic (Aristotle)
1730s: Taxonomical classification of plants and
animals (Linnaeus)
1960s: Social networks, “small-world experiment”,
Erdős number (Milgram et al)
A brief history of graphs and ontologies
1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al)
Despite all this history,
programming with graphs and
ontologies is still cumbersome
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF Labeled property graphs
Features:
• vertices and edge types: identifiers
• graphs decompose into “triples”
• scalar data types from XSD, values
are special vertices
• no composite data types; objects
use graph structure
Query languages:
• SPARQL (declarative)
Features:
• vertices and edges: objects
• no common identifier scheme
• scalar and composite data types
borrowed from implementation
languages
Query languages:
• Gremlin (imperative)
• Cypher (declarative)
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph abstractions
Triples are a very low-level abstraction of graphs
• akin to assembly language
Mapping graphs to trees does not necessarily work
• metamodel mismatch
• RDF/XML and JSON-LD as warning examples
• (also: syntax does not matter!)
“Object-graph mapping”…?
• how to determine what constitutes an object?
• in RDF, blank nodes can be difficult to handle
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph queries (or: “how do I get there from here?”)
Traversal is an excellent graph access paradigm
• “where can I go from here given this path pattern?”
• “are these two nodes connected, and how?”
• using path patterns for traversal is akin to pattern matching [Lassila 2007]
SPARQL is a ”very big hammer” for many access problems
We need simple APIs:
• search, faceted search (filtering)
• “give me everything you know about this node” (cf. object-graph mapping)
• traversal
© 2021, Amazon Web Services, Inc. or its Affiliates.
Models and ontologies
Problems:
1. LPGs (by and large) do not have schema languages
2. RDF has too many schema languages (RDFS, OWL, SHACL)
3. how to map models to code?
What does mapping your data to a shared ontology “buy” you?
Can we support ontologies with predefined libraries?
• ”ontology engines”
© 2021, Amazon Web Services, Inc. or its Affiliates.
What about reasoning?
Symbolic reasoning:
• generative reasoning can be easily hidden from the application/client
• restrictions as a means of data validation (SHACL rather than OWL)
Non-symbolic reasoning:
• ML techniques can be used much like symbolic reasoning
We need an expanded view of reasoning, and we need to rethink what an
“application” is:
Logic should be associated with data
and models, not with an application
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graphs as a logical abstraction
Modern enterprise data practice is messy
Data silos are commonplace
Data integration often happens via
bespoke and ad hoc solutions
We need a unifying logical view of data (a
lingua franca for data integration) or risk
ongoing and ever-increasing
fragmentation
Until some better option shows up, RDF
and OWL can provide this view
“Tower of Babel”, Pieter Brueghel the Elder, 1563; Kunsthistorisches Museum, Wien
We are here
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF vs. labeled property graphs…?
Confuses users (when given the choice)
Should not matter
The real issue may in fact be
Graph as a logical representation vs. graph as a data structure
The practical problem is still that we have two separate ecosystems
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF Labeled property graphs
Pros:
• W3C standard, well established
• easy to use external data sources
• schema language(s)
• formal semantics for graph merging
• supports reasoning
Cons:
• often considered “too academic”
• no easy “on-ramp”
Pros:
• intuitive for software developers
• integrates well with programming
languages (esp. Gremlin)
Cons:
• no standard, many divergent
(proprietary) implementations
• no schema language
• no formal semantics
© 2021, Amazon Web Services, Inc. or its Affiliates.
RDF Labeled property graphs
Pros:
• W3C standard, well established
• easy to use external data sources
• schema language(s)
• formal semantics for graph merging
• supports reasoning
Pros:
• intuitive for software developers
• integrates well with programming
languages (esp. Gremlin)
We want both!
© 2021, Amazon Web Services, Inc. or its Affiliates.
Graph as an abstraction
What we really want is a unifying metamodel that covers both RDF and LPGs
• helps adoption à important for the graph industry
• gives users more choices (e.g., Gremlin over RDF)
We are working on this (codename “One Graph”)
This is not an easy undertaking, however…
© 2021, Amazon Web Services, Inc. or its Affiliates.
Some challenges in unifying RDF and LPGs
RDF and RDF-star
semantics
• triples are unique, but LPG edges are not
• (“old style” reification does not have this problem)
Lack of formal
semantics for LPGs
• difficult to determine “correct” semantics
• also: diversity of scalar datatypes
Graph partitioning
• no named graphs in current LPGs
• in RDF vertices are just identifiers (no structure), but
in LPGs vertices are structured objects
Update semantics
• deleting non-unique triples: what happens?
• cascading delete for edge properties?
© 2021, Amazon Web Services, Inc. or its Affiliates.
The way forward (or: what are the real pain points?)
How do I write software that leverages a
knowledge graph?
• bridging the “graph ←→ code -gap”
• low-level interfaces (lots of boilerplate
code, etc.)
I mapped my data to an ontology… now what?
(or: what is “ontology support”…?)
• role of reasoning
• “hiding” reasoning works well
• transitive taxonomies & owl:sameAs
(including inverse functional properties)
New interfaces
and abstractions
Libraries, “ontology
engine(s)”…?
© 2021, Amazon Web Services, Inc. or its Affiliates.
The way forward (or: what are the real pain points?)
What about property graphs, Gremlin, …?
• choosing between RDF and property graph
models is hard and leads to confusion
• RDF-star to the rescue?
Is there a way out of the silos that does not
involve never-ending bespoke integrations?
• modern tools do not offer a single unified
view of all data
Graphs as a unifying
logical view
“One Graph”
unified metamodel
Lassila et al: “Graph? Yes! Which one?
Help!”, SCG2021 @ Semantics 2021
(arXiv:2110.13348)
© 2021, Amazon Web Services, Inc. or its Affiliates.
Questions?
Contact: ora@amazon.com
More info: https://aws.amazon.com/neptune/knowledge-graphs-on-aws/
https://arxiv.org/abs/2110.13348

More Related Content

What's hot

Elasticsearch From the Bottom Up
Elasticsearch From the Bottom UpElasticsearch From the Bottom Up
Elasticsearch From the Bottom Upfoundsearch
 
On-boarding with JanusGraph Performance
On-boarding with JanusGraph PerformanceOn-boarding with JanusGraph Performance
On-boarding with JanusGraph PerformanceChin Huang
 
1. introduction to no sql
1. introduction to no sql1. introduction to no sql
1. introduction to no sqlAnuja Gunale
 
How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019Randall Hunt
 
Introduction to column oriented databases
Introduction to column oriented databasesIntroduction to column oriented databases
Introduction to column oriented databasesArangoDB Database
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudDataWorks Summit
 
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...Edureka!
 
Introduction to elasticsearch
Introduction to elasticsearchIntroduction to elasticsearch
Introduction to elasticsearchpmanvi
 
Oracle to Postgres Schema Migration Hustle
Oracle to Postgres Schema Migration HustleOracle to Postgres Schema Migration Hustle
Oracle to Postgres Schema Migration HustleEDB
 
stackconf 2021 | Weaviate Vector Search Engine – Introduction
stackconf 2021 | Weaviate Vector Search Engine – Introductionstackconf 2021 | Weaviate Vector Search Engine – Introduction
stackconf 2021 | Weaviate Vector Search Engine – IntroductionNETWAYS
 
ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...
ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...
ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...Databricks
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceOptum
 
How To Become A Big Data Engineer? Edureka
How To Become A Big Data Engineer? EdurekaHow To Become A Big Data Engineer? Edureka
How To Become A Big Data Engineer? EdurekaEdureka!
 
GraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptx
GraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptxGraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptx
GraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptxjexp
 
From distributed caches to in-memory data grids
From distributed caches to in-memory data gridsFrom distributed caches to in-memory data grids
From distributed caches to in-memory data gridsMax Alexejev
 
An Intro to NoSQL Databases
An Intro to NoSQL DatabasesAn Intro to NoSQL Databases
An Intro to NoSQL DatabasesRajith Pemabandu
 

What's hot (20)

Elasticsearch
ElasticsearchElasticsearch
Elasticsearch
 
Elasticsearch From the Bottom Up
Elasticsearch From the Bottom UpElasticsearch From the Bottom Up
Elasticsearch From the Bottom Up
 
On-boarding with JanusGraph Performance
On-boarding with JanusGraph PerformanceOn-boarding with JanusGraph Performance
On-boarding with JanusGraph Performance
 
1. introduction to no sql
1. introduction to no sql1. introduction to no sql
1. introduction to no sql
 
How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019
 
Introduction to column oriented databases
Introduction to column oriented databasesIntroduction to column oriented databases
Introduction to column oriented databases
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
 
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
Spark Interview Questions and Answers | Apache Spark Interview Questions | Sp...
 
Introduction to elasticsearch
Introduction to elasticsearchIntroduction to elasticsearch
Introduction to elasticsearch
 
Oracle to Postgres Schema Migration Hustle
Oracle to Postgres Schema Migration HustleOracle to Postgres Schema Migration Hustle
Oracle to Postgres Schema Migration Hustle
 
DBpedia InsideOut
DBpedia InsideOutDBpedia InsideOut
DBpedia InsideOut
 
stackconf 2021 | Weaviate Vector Search Engine – Introduction
stackconf 2021 | Weaviate Vector Search Engine – Introductionstackconf 2021 | Weaviate Vector Search Engine – Introduction
stackconf 2021 | Weaviate Vector Search Engine – Introduction
 
ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...
ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...
ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scal...
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
 
TypeScript VS JavaScript.pptx
TypeScript VS JavaScript.pptxTypeScript VS JavaScript.pptx
TypeScript VS JavaScript.pptx
 
How To Become A Big Data Engineer? Edureka
How To Become A Big Data Engineer? EdurekaHow To Become A Big Data Engineer? Edureka
How To Become A Big Data Engineer? Edureka
 
Unit 3 MongDB
Unit 3 MongDBUnit 3 MongDB
Unit 3 MongDB
 
GraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptx
GraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptxGraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptx
GraphConnect 2022 - Top 10 Cypher Tuning Tips & Tricks.pptx
 
From distributed caches to in-memory data grids
From distributed caches to in-memory data gridsFrom distributed caches to in-memory data grids
From distributed caches to in-memory data grids
 
An Intro to NoSQL Databases
An Intro to NoSQL DatabasesAn Intro to NoSQL Databases
An Intro to NoSQL Databases
 

Similar to Graph Abstractions Matter by Ora Lassila

Graph database in sv meetup
Graph database in sv meetupGraph database in sv meetup
Graph database in sv meetupJoshua Bae
 
Arches Getty Brownbag Talk
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talkbenosteen
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...Jean Ihm
 
Making the semantic web work
Making the semantic web workMaking the semantic web work
Making the semantic web workPaul Houle
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4jSina Khorami
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsGeorge Stathis
 
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)Emil Eifrem
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?Samet KILICTAS
 
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)Emil Eifrem
 
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SFTed Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SFMLconf
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph SchemaJoshua Shinavier
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data21Style
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneDoug Needham
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialLeeFeigenbaum
 
Large Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphLarge Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphP. Taylor Goetz
 

Similar to Graph Abstractions Matter by Ora Lassila (20)

Graph database in sv meetup
Graph database in sv meetupGraph database in sv meetup
Graph database in sv meetup
 
Arches Getty Brownbag Talk
Arches Getty Brownbag TalkArches Getty Brownbag Talk
Arches Getty Brownbag Talk
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
 
Making the semantic web work
Making the semantic web workMaking the semantic web work
Making the semantic web work
 
JSON-LD and SHACL for Knowledge Graphs
JSON-LD and SHACL for Knowledge GraphsJSON-LD and SHACL for Knowledge Graphs
JSON-LD and SHACL for Knowledge Graphs
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4j
 
Sharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data LessonsSharing a Startup’s Big Data Lessons
Sharing a Startup’s Big Data Lessons
 
Presentation1
Presentation1Presentation1
Presentation1
 
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
NOSQL Overview, Neo4j Intro And Production Example (QCon London 2010)
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
NOSQL overview and intro to graph databases with Neo4j (Geeknight May 2010)
 
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SFTed Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
 
Some NoSQL
Some NoSQLSome NoSQL
Some NoSQL
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZone
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
CSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web TutorialCSHALS 2010 W3C Semanic Web Tutorial
CSHALS 2010 W3C Semanic Web Tutorial
 
Large Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraphLarge Scale Graph Analytics with JanusGraph
Large Scale Graph Analytics with JanusGraph
 

More from Connected Data World

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenConnected Data World
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Connected Data World
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine LearningConnected Data World
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is hereConnected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2Connected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data ModelConnected Data World
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseConnected Data World
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Connected Data World
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Connected Data World
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleConnected Data World
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Connected Data World
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the WebConnected Data World
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsConnected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsConnected Data World
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...Connected Data World
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGOConnected Data World
 
What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?Connected Data World
 

More from Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 
What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?
 

Recently uploaded

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Recently uploaded (20)

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Graph Abstractions Matter by Ora Lassila

  • 1. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph Abstractions Matter (a Personal View) Dr. Ora Lassila Principal Technologist Amazon Neptune
  • 2. © 2021, Amazon Web Services, Inc. or its Affiliates. Game plan: 1. Some graph history 2. Graph abstractions, queries and access patterns 3. Graph as an abstraction: “One graph to rule them all” 4. The way forward
  • 3. © 2021, Amazon Web Services, Inc. or its Affiliates. A brief history of graphs and ontologies 1730s: Graph theory (Euler) 1870s: Library classification (Dewey) 1900: Semantics, ontology and logic (Husserl) 1950s and onwards: Graphs as the essential underpinning of computer science 1960s-1970s: Network databases (CODASYL), semantic networks (Quillian et al) 1970s-1990s: Predicate logic as the foundation of Knowledge Representation (Hayes et al) 1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al) Today: Modern knowledge graphs and graph databases 3rd Century BCE: Categories & logic (Aristotle) 1730s: Taxonomical classification of plants and animals (Linnaeus) 1960s: Social networks, “small-world experiment”, Erdős number (Milgram et al)
  • 4. © 2021, Amazon Web Services, Inc. or its Affiliates. 1730s: Graph theory (Euler) 1870s: Library classification (Dewey) 1900: Semantics, ontology and logic (Husserl) 1950s and onwards: Graphs as the essential underpinning of computer science 1960s-1970s: Network databases (CODASYL), semantic networks (Quillian et al) 1970s-1990s: Predicate logic as the foundation of Knowledge Representation (Hayes et al) Today: Modern knowledge graphs and graph databases 3rd Century BCE: Categories & logic (Aristotle) 1730s: Taxonomical classification of plants and animals (Linnaeus) 1960s: Social networks, “small-world experiment”, Erdős number (Milgram et al) A brief history of graphs and ontologies 1997 and onwards: The Semantic Web, RDF, OWL, etc. (Lassila et al) Despite all this history, programming with graphs and ontologies is still cumbersome
  • 5. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF Labeled property graphs Features: • vertices and edge types: identifiers • graphs decompose into “triples” • scalar data types from XSD, values are special vertices • no composite data types; objects use graph structure Query languages: • SPARQL (declarative) Features: • vertices and edges: objects • no common identifier scheme • scalar and composite data types borrowed from implementation languages Query languages: • Gremlin (imperative) • Cypher (declarative)
  • 6. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph abstractions Triples are a very low-level abstraction of graphs • akin to assembly language Mapping graphs to trees does not necessarily work • metamodel mismatch • RDF/XML and JSON-LD as warning examples • (also: syntax does not matter!) “Object-graph mapping”…? • how to determine what constitutes an object? • in RDF, blank nodes can be difficult to handle
  • 7. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph queries (or: “how do I get there from here?”) Traversal is an excellent graph access paradigm • “where can I go from here given this path pattern?” • “are these two nodes connected, and how?” • using path patterns for traversal is akin to pattern matching [Lassila 2007] SPARQL is a ”very big hammer” for many access problems We need simple APIs: • search, faceted search (filtering) • “give me everything you know about this node” (cf. object-graph mapping) • traversal
  • 8. © 2021, Amazon Web Services, Inc. or its Affiliates. Models and ontologies Problems: 1. LPGs (by and large) do not have schema languages 2. RDF has too many schema languages (RDFS, OWL, SHACL) 3. how to map models to code? What does mapping your data to a shared ontology “buy” you? Can we support ontologies with predefined libraries? • ”ontology engines”
  • 9. © 2021, Amazon Web Services, Inc. or its Affiliates. What about reasoning? Symbolic reasoning: • generative reasoning can be easily hidden from the application/client • restrictions as a means of data validation (SHACL rather than OWL) Non-symbolic reasoning: • ML techniques can be used much like symbolic reasoning We need an expanded view of reasoning, and we need to rethink what an “application” is: Logic should be associated with data and models, not with an application
  • 10. © 2021, Amazon Web Services, Inc. or its Affiliates. Graphs as a logical abstraction Modern enterprise data practice is messy Data silos are commonplace Data integration often happens via bespoke and ad hoc solutions We need a unifying logical view of data (a lingua franca for data integration) or risk ongoing and ever-increasing fragmentation Until some better option shows up, RDF and OWL can provide this view “Tower of Babel”, Pieter Brueghel the Elder, 1563; Kunsthistorisches Museum, Wien We are here
  • 11. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF vs. labeled property graphs…? Confuses users (when given the choice) Should not matter The real issue may in fact be Graph as a logical representation vs. graph as a data structure The practical problem is still that we have two separate ecosystems
  • 12. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF Labeled property graphs Pros: • W3C standard, well established • easy to use external data sources • schema language(s) • formal semantics for graph merging • supports reasoning Cons: • often considered “too academic” • no easy “on-ramp” Pros: • intuitive for software developers • integrates well with programming languages (esp. Gremlin) Cons: • no standard, many divergent (proprietary) implementations • no schema language • no formal semantics
  • 13. © 2021, Amazon Web Services, Inc. or its Affiliates. RDF Labeled property graphs Pros: • W3C standard, well established • easy to use external data sources • schema language(s) • formal semantics for graph merging • supports reasoning Pros: • intuitive for software developers • integrates well with programming languages (esp. Gremlin) We want both!
  • 14. © 2021, Amazon Web Services, Inc. or its Affiliates. Graph as an abstraction What we really want is a unifying metamodel that covers both RDF and LPGs • helps adoption à important for the graph industry • gives users more choices (e.g., Gremlin over RDF) We are working on this (codename “One Graph”) This is not an easy undertaking, however…
  • 15. © 2021, Amazon Web Services, Inc. or its Affiliates. Some challenges in unifying RDF and LPGs RDF and RDF-star semantics • triples are unique, but LPG edges are not • (“old style” reification does not have this problem) Lack of formal semantics for LPGs • difficult to determine “correct” semantics • also: diversity of scalar datatypes Graph partitioning • no named graphs in current LPGs • in RDF vertices are just identifiers (no structure), but in LPGs vertices are structured objects Update semantics • deleting non-unique triples: what happens? • cascading delete for edge properties?
  • 16. © 2021, Amazon Web Services, Inc. or its Affiliates. The way forward (or: what are the real pain points?) How do I write software that leverages a knowledge graph? • bridging the “graph ←→ code -gap” • low-level interfaces (lots of boilerplate code, etc.) I mapped my data to an ontology… now what? (or: what is “ontology support”…?) • role of reasoning • “hiding” reasoning works well • transitive taxonomies & owl:sameAs (including inverse functional properties) New interfaces and abstractions Libraries, “ontology engine(s)”…?
  • 17. © 2021, Amazon Web Services, Inc. or its Affiliates. The way forward (or: what are the real pain points?) What about property graphs, Gremlin, …? • choosing between RDF and property graph models is hard and leads to confusion • RDF-star to the rescue? Is there a way out of the silos that does not involve never-ending bespoke integrations? • modern tools do not offer a single unified view of all data Graphs as a unifying logical view “One Graph” unified metamodel Lassila et al: “Graph? Yes! Which one? Help!”, SCG2021 @ Semantics 2021 (arXiv:2110.13348)
  • 18. © 2021, Amazon Web Services, Inc. or its Affiliates. Questions? Contact: ora@amazon.com More info: https://aws.amazon.com/neptune/knowledge-graphs-on-aws/ https://arxiv.org/abs/2110.13348