SlideShare a Scribd company logo
1 of 18
Download to read offline
https://nebula-graph.io
An Open-Source Distributed Graph Database
Sherman Ye
Founder & CEO
sherman.ye@vesoft.com
https://nebula-graph.io
Agenda
l Who We Are?
l What is a Graph Database?
l Why Open Source?
l Architecture
l Advantages
• In Architecture
• In Data Amount
• In Performance
l Adopters
l Product Roadmap
l Summary
https://nebula-graph.io
Founder: A Graph Database Forerunner
Founder & CEO
Infra Software
Engineer
(2010-2015)
Started and led the high-performance
distributed Graph Database project - Dragon
• First distributed Graph Database ever in the industry
• Deployed on more than 500 nodes by Jan. 2015
• Together with the write-through cache system TAO, Dragon served all
relationship queries across the entire Facebook application stacks.
Principal
Software
Engineer
(2015-2018)
Formed the team to build the high-performance Graph
Database GeaBase from the ground up
• GeaBase is the only technical product that was rooted in Ant Financial
and widely adopted in Alibaba Group
• Deployed on more than 1000 nodes and served more than one billion
queries every day. The average latency is about 40ms
• Served in almost one hundred scenario and covered business groups
from Ant Financial, to AliExpress, GaoDe Map, AliMama, TianMao,
CaiNiao, UC, etc.
Sherman Ye
https://nebula-graph.io
Who We Are?
Team
l 40+ full-time employees, more than 30 of them are
technical persons
l More than half of the technical employees are from Alibaba,
Hauawei, NetEase, etc.
l vesoft Inc. was founded in Oct. 2018, Nebula Graph became
open source in May 2019
l Finalized $3M angel funding: Matrix Partners China
l Raised $17M Pre-A funding: Source Code Capital, Red
Point Ventures, Matrix Partners China
History
https://nebula-graph.io
What is a Graph Database?
l Typical database consists of tables filled with same type of data, useful for quick retrieval of such
data.
l A graph database can uncover deep relationships between many data sets
l In a graph, a VERTEX (or NODE) defines an item, a defined EDGE connects it with other data sets
or items (one way, two way or multiple ways), and PROPERTIES can further classify a VERTEX
for more granular data mining
l So, instead of just calling up people with the name SMITH data scientists can instead intelligently reveal
how SMITH might relate to JONES or DOE, in a given time, a location, and more
l These capabilities lead to big users: Adobe, Facebook, Microsoft, Netflix, and many more
l Graph database market expected to grow $650M (2018) to $4.13B by 2026 (Verified Market
Research)
https://nebula-graph.io
Why Open Source?
l To make the graph technology more accessible to the world
l To build a healthy ecosystem around Nebula Graph
l To expand globally
GitHub star: 5200+ WeChat group
members: 1000+
Contributors: 50+ Forum posts: 1,100/month
https://nebula-graph.io
Architecture
l Meta Service
l Query Service
l Storage Service
l Proven Highest Performance
l The Most Scalable
l Industry’s Highest Availability
Three Components:
Advantages:
https://nebula-graph.io
Advantages
Data Amount in Example:
l Data amount: 150TB
l Graph size: One trillion edges/connections
l An hourly update of 10 billion connections
Compared with other graph database solutions, Nebula Graph has the following advantages:
In Architecture
l Shared-nothing structure - ensures high availability
l Storage and computation separation - ensures high scalability and cloud ready
https://nebula-graph.io
In Performance: Meituan
Link to the topic on the forum:
https://discuss.nebula-graph.io/t/benchmarking-the-mainstream-open-source-distributed-graph-databases-at-meituan-nebula-
graph-vs-dgraph-vs-hugegraph/715
Real-Time Write
We invite you to read a real large customer’s own performance benchmarking, conducted by the
NLP team at Meituan: NebulaGraph vs. Dgraph vs. HugeGraph
https://nebula-graph.io
In Performance: Meituan (Cont’d)
N-Hop Queries Shared Friends Queries
https://nebula-graph.io
In Performance: Tencent Cloud
Data import 1-degree friends query 2-degree friends query Common friends query
Performance comparison conducted by the Tencent Cloud team:
NebulaGraph vs. Neo4j vs. HugeGraph
Link to the topic on the forum:
https://discuss.nebula-graph.io/t/performance-comparison-neo4j-vs-nebula-graph-vs-janusgraph/619
https://nebula-graph.io
In Performance: 360 Digitech
360 Digitech has shared their experience migrating from JanusGraph to NebulaGraph and the
huge performance gains after the migration.
Link to the topic on the forum:
https://discuss.nebula-graph.io/t/data-migration-from-janusgraph-to-nebula-graph-practice-at-360-finance/672
HBase network I/O Nebula Graph network I/O
HBase disk I/O Nebula Graph disk I/O
https://nebula-graph.io
In Performance: 360 Digitech (Cont’d)
Test Results from 360 Digitech
l NebulaGraph significantly outperforms in disk or network I/O
l Performance achieved using only 30% of HBase cluster machine resources
l When JanusGraph needs 2-3 seconds per query, Nebula Graph just needs 100 ms
l When JanusGraph needs 10-20 seconds per query, Nebula Graph needs 2 seconds
l Overall Nebula Graph performance is more than 20 times improvement over others
https://nebula-graph.io
Adopters
Real-Time
Recommendations
Fraud
Detection
Cyber Security Knowledge Graph
...... ...... ...... ......
https://nebula-graph.io
Product Roadmap
https://nebula-graph.io
Summary
l Nebula Graph is a VC-funded solution already adopted by some of the world’s
largest Internet companies
l Nebula Graph is proven the world’s highest performing Graph Database
l It can store and process hundreds of billions of data points with trillions of relational connections in a
shared-nothing distributed architecture
l Graph database market to quadruple in size by 2026
https://nebula-graph.io
Unleash the Power of
Connections!
https://nebula-graph.io
Thank You
GitHub: vesoft-inc/nebula
Twitter: @NebulaGraph
Facebook: @NebulaGraph
https://discuss.nebula-graph.io
https://nebula-graph.io

More Related Content

What's hot

Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...Simplilearn
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphIoan Toma
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...Simplilearn
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineeringThang Bui (Bob)
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseDatabricks
 
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Certus Solutions
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesDATAVERSITY
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...DATAVERSITY
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data MeshLibbySchulze
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptxAlex Ivy
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerDatabricks
 
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...Simplilearn
 
Snowflake SnowPro Core Cert CheatSheet.pdf
Snowflake SnowPro Core Cert CheatSheet.pdfSnowflake SnowPro Core Cert CheatSheet.pdf
Snowflake SnowPro Core Cert CheatSheet.pdfDustin Liu
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesNeo4j
 
Transparent Data Encryption for SharePoint Content Databases
Transparent Data Encryption for SharePoint Content DatabasesTransparent Data Encryption for SharePoint Content Databases
Transparent Data Encryption for SharePoint Content DatabasesMichael Noel
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeSnowflake Computing
 
Accelerating Data Ingestion with Databricks Autoloader
Accelerating Data Ingestion with Databricks AutoloaderAccelerating Data Ingestion with Databricks Autoloader
Accelerating Data Ingestion with Databricks AutoloaderDatabricks
 

What's hot (20)

Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
 
Demystifying data engineering
Demystifying data engineeringDemystifying data engineering
Demystifying data engineering
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
 
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph Databases
 
Introduction to HBase
Introduction to HBaseIntroduction to HBase
Introduction to HBase
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI InitiativesDatabricks + Snowflake: Catalyzing Data and AI Initiatives
Databricks + Snowflake: Catalyzing Data and AI Initiatives
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
Hive Tutorial | Hive Architecture | Hive Tutorial For Beginners | Hive In Had...
 
Snowflake SnowPro Core Cert CheatSheet.pdf
Snowflake SnowPro Core Cert CheatSheet.pdfSnowflake SnowPro Core Cert CheatSheet.pdf
Snowflake SnowPro Core Cert CheatSheet.pdf
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
Transparent Data Encryption for SharePoint Content Databases
Transparent Data Encryption for SharePoint Content DatabasesTransparent Data Encryption for SharePoint Content Databases
Transparent Data Encryption for SharePoint Content Databases
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with Snowflake
 
Accelerating Data Ingestion with Databricks Autoloader
Accelerating Data Ingestion with Databricks AutoloaderAccelerating Data Ingestion with Databricks Autoloader
Accelerating Data Ingestion with Databricks Autoloader
 

Similar to Introduction to Nebula Graph, an Open-Source Distributed Graph Database

GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?Neo4j
 
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...InfluxData
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSAPRBETTER
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscapeLinkurious
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscapeLinkurious
 
The power of polyglot searching
The power of polyglot searchingThe power of polyglot searching
The power of polyglot searchingGraphAware
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric TeamsData Con LA
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavSwapnil (Neil) Jadhav
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesCédric Fauvet
 
Deep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex FridmanDeep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex FridmanPeerasak C.
 
Deep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous DrivingDeep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous DrivingAlok Jain
 
Graph-Oriented NoSQL Databases
Graph-Oriented NoSQL  Databases Graph-Oriented NoSQL  Databases
Graph-Oriented NoSQL Databases Abdelkader OUARED
 
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
 Apache AGE and the synergy effect in the combination of Postgres and NoSQL Apache AGE and the synergy effect in the combination of Postgres and NoSQL
Apache AGE and the synergy effect in the combination of Postgres and NoSQLEDB
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezBig Data Spain
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...Neo4j
 
Anaconda and PyData Solutions
Anaconda and PyData SolutionsAnaconda and PyData Solutions
Anaconda and PyData SolutionsTravis Oliphant
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with DatabricksGrega Kespret
 
20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patengeKarin Patenge
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4jNeo4j
 

Similar to Introduction to Nebula Graph, an Open-Source Distributed Graph Database (20)

GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
Discover How IBM Uses InfluxDB and Grafana to Help Clients Monitor Large Prod...
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscape
 
Introduction to the graph technologies landscape
Introduction to the graph technologies landscapeIntroduction to the graph technologies landscape
Introduction to the graph technologies landscape
 
The power of polyglot searching
The power of polyglot searchingThe power of polyglot searching
The power of polyglot searching
 
Power of Polyglot Search
Power of Polyglot SearchPower of Polyglot Search
Power of Polyglot Search
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
 
Big Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil JadhavBig Data & Open Source - Neil Jadhav
Big Data & Open Source - Neil Jadhav
 
La bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphesLa bi, l'informatique décisionnelle et les graphes
La bi, l'informatique décisionnelle et les graphes
 
Deep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex FridmanDeep Learning State of the Art (2019) - MIT by Lex Fridman
Deep Learning State of the Art (2019) - MIT by Lex Fridman
 
Deep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous DrivingDeep learning state_of_the_art- Autonomous Driving
Deep learning state_of_the_art- Autonomous Driving
 
Graph-Oriented NoSQL Databases
Graph-Oriented NoSQL  Databases Graph-Oriented NoSQL  Databases
Graph-Oriented NoSQL Databases
 
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
 Apache AGE and the synergy effect in the combination of Postgres and NoSQL Apache AGE and the synergy effect in the combination of Postgres and NoSQL
Apache AGE and the synergy effect in the combination of Postgres and NoSQL
 
Multiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier DominguezMultiplatform Spark solution for Graph datasources by Javier Dominguez
Multiplatform Spark solution for Graph datasources by Javier Dominguez
 
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
New Opportunities for Connected Data - Emil Eifrem @ GraphConnect Boston + Ch...
 
Anaconda and PyData Solutions
Anaconda and PyData SolutionsAnaconda and PyData Solutions
Anaconda and PyData Solutions
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge20181019 code.talks graph_analytics_k_patenge
20181019 code.talks graph_analytics_k_patenge
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 

Recently uploaded

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

Introduction to Nebula Graph, an Open-Source Distributed Graph Database

  • 1. https://nebula-graph.io An Open-Source Distributed Graph Database Sherman Ye Founder & CEO sherman.ye@vesoft.com
  • 2. https://nebula-graph.io Agenda l Who We Are? l What is a Graph Database? l Why Open Source? l Architecture l Advantages • In Architecture • In Data Amount • In Performance l Adopters l Product Roadmap l Summary
  • 3. https://nebula-graph.io Founder: A Graph Database Forerunner Founder & CEO Infra Software Engineer (2010-2015) Started and led the high-performance distributed Graph Database project - Dragon • First distributed Graph Database ever in the industry • Deployed on more than 500 nodes by Jan. 2015 • Together with the write-through cache system TAO, Dragon served all relationship queries across the entire Facebook application stacks. Principal Software Engineer (2015-2018) Formed the team to build the high-performance Graph Database GeaBase from the ground up • GeaBase is the only technical product that was rooted in Ant Financial and widely adopted in Alibaba Group • Deployed on more than 1000 nodes and served more than one billion queries every day. The average latency is about 40ms • Served in almost one hundred scenario and covered business groups from Ant Financial, to AliExpress, GaoDe Map, AliMama, TianMao, CaiNiao, UC, etc. Sherman Ye
  • 4. https://nebula-graph.io Who We Are? Team l 40+ full-time employees, more than 30 of them are technical persons l More than half of the technical employees are from Alibaba, Hauawei, NetEase, etc. l vesoft Inc. was founded in Oct. 2018, Nebula Graph became open source in May 2019 l Finalized $3M angel funding: Matrix Partners China l Raised $17M Pre-A funding: Source Code Capital, Red Point Ventures, Matrix Partners China History
  • 5. https://nebula-graph.io What is a Graph Database? l Typical database consists of tables filled with same type of data, useful for quick retrieval of such data. l A graph database can uncover deep relationships between many data sets l In a graph, a VERTEX (or NODE) defines an item, a defined EDGE connects it with other data sets or items (one way, two way or multiple ways), and PROPERTIES can further classify a VERTEX for more granular data mining l So, instead of just calling up people with the name SMITH data scientists can instead intelligently reveal how SMITH might relate to JONES or DOE, in a given time, a location, and more l These capabilities lead to big users: Adobe, Facebook, Microsoft, Netflix, and many more l Graph database market expected to grow $650M (2018) to $4.13B by 2026 (Verified Market Research)
  • 6. https://nebula-graph.io Why Open Source? l To make the graph technology more accessible to the world l To build a healthy ecosystem around Nebula Graph l To expand globally GitHub star: 5200+ WeChat group members: 1000+ Contributors: 50+ Forum posts: 1,100/month
  • 7. https://nebula-graph.io Architecture l Meta Service l Query Service l Storage Service l Proven Highest Performance l The Most Scalable l Industry’s Highest Availability Three Components: Advantages:
  • 8. https://nebula-graph.io Advantages Data Amount in Example: l Data amount: 150TB l Graph size: One trillion edges/connections l An hourly update of 10 billion connections Compared with other graph database solutions, Nebula Graph has the following advantages: In Architecture l Shared-nothing structure - ensures high availability l Storage and computation separation - ensures high scalability and cloud ready
  • 9. https://nebula-graph.io In Performance: Meituan Link to the topic on the forum: https://discuss.nebula-graph.io/t/benchmarking-the-mainstream-open-source-distributed-graph-databases-at-meituan-nebula- graph-vs-dgraph-vs-hugegraph/715 Real-Time Write We invite you to read a real large customer’s own performance benchmarking, conducted by the NLP team at Meituan: NebulaGraph vs. Dgraph vs. HugeGraph
  • 10. https://nebula-graph.io In Performance: Meituan (Cont’d) N-Hop Queries Shared Friends Queries
  • 11. https://nebula-graph.io In Performance: Tencent Cloud Data import 1-degree friends query 2-degree friends query Common friends query Performance comparison conducted by the Tencent Cloud team: NebulaGraph vs. Neo4j vs. HugeGraph Link to the topic on the forum: https://discuss.nebula-graph.io/t/performance-comparison-neo4j-vs-nebula-graph-vs-janusgraph/619
  • 12. https://nebula-graph.io In Performance: 360 Digitech 360 Digitech has shared their experience migrating from JanusGraph to NebulaGraph and the huge performance gains after the migration. Link to the topic on the forum: https://discuss.nebula-graph.io/t/data-migration-from-janusgraph-to-nebula-graph-practice-at-360-finance/672 HBase network I/O Nebula Graph network I/O HBase disk I/O Nebula Graph disk I/O
  • 13. https://nebula-graph.io In Performance: 360 Digitech (Cont’d) Test Results from 360 Digitech l NebulaGraph significantly outperforms in disk or network I/O l Performance achieved using only 30% of HBase cluster machine resources l When JanusGraph needs 2-3 seconds per query, Nebula Graph just needs 100 ms l When JanusGraph needs 10-20 seconds per query, Nebula Graph needs 2 seconds l Overall Nebula Graph performance is more than 20 times improvement over others
  • 16. https://nebula-graph.io Summary l Nebula Graph is a VC-funded solution already adopted by some of the world’s largest Internet companies l Nebula Graph is proven the world’s highest performing Graph Database l It can store and process hundreds of billions of data points with trillions of relational connections in a shared-nothing distributed architecture l Graph database market to quadruple in size by 2026
  • 18. https://nebula-graph.io Thank You GitHub: vesoft-inc/nebula Twitter: @NebulaGraph Facebook: @NebulaGraph https://discuss.nebula-graph.io https://nebula-graph.io