SlideShare a Scribd company logo
1 of 56
Download to read offline
Artem Aliev and Russell Spitzer, DataStax
A Tale of Two Graph Frameworks
on Spark: 

GraphFrames and Tinkerpop
OLAP
#EUeco3
#EUeco3
Pierrot and Harlequin
• Artem
• Graph Analytics Expert
• Earth
• Russell
• Distributed Systems Enthusiast
• Earth
2
Tinkerpop and GraphFrames provide
Complimentary Approaches for Graph Analytics
DataSet Catalyst
GraphFrames
3#EUeco3
Graphs are Vertices and Edges
4
Vertices are things and edges represent their relations to one another
#EUeco3
Graphs are Vertices and Edges
5
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class[6]
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class[8][9]
Service: 2286–2293 (7 Years)
#EUeco3
Graphs are Vertices and Edges
6
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class[6]
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class[8][9]
Service: 2286–2293 (7 Years)
Vertex
Properties
#EUeco3
Graphs are Vertices and Edges
7
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class[6]
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class[8][9]
Service: 2286–2293 (7 Years)
succeeded by
succeeded by
succeeded by
#EUeco3
Graphs are Vertices and Edges
8
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class[6]
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class[8][9]
Service: 2286–2293 (7 Years)
Edge
Edge Labelsucceeded by
succeeded by
succeeded by
#EUeco3
Graphs are Vertices and Edges
9
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class[6]
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class[8][9]
Service: 2286–2293 (7 Years)
Ship
Ship
Ship
Ship
Vertex Label
succeeded by
succeeded by
succeeded by
#EUeco3
Graphs are Vertices and Edges
10
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class
Service: 2245–2285 (40 Years)
Ship
Ship
Ship
Ship
Position: Captain

Name: Kirk
Position: Captain

Name: Picard
Crew
Crew
succeeded by
succeeded by
succeeded by
#EUeco3
Graphs are Vertices and Edges
11
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class
Service: 2286–2293 (7 Years)
Ship
Ship
Ship
Ship
Position: Captain

Name: Kirk
Position: Captain

Name: Picard
Crew
Crew
succeeded by
succeeded by
succeeded by
served on
served on
served on
served on
#EUeco3
Graphs are Vertices and Edges
12
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class
Service: 2286–2293 (7 Years)
Ship
Ship
Ship
Ship
Position: Captain

Name: Kirk
Position: Captain

Name: Picard
Crew
Crew
succeeded by
succeeded by
succeeded by
served on
served on
served on
served on
But why do I
want this?
#EUeco3
Graphs let us ask questions about our data based
on their relations
13
What Captain Served After Kirk?
What Ship was two after the
NCC-1701?
#EUeco3
Traversals involve following paths through the
Graph
14
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-D)
Class: Galaxy
Service: 2363–2371 (8 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class
Service: 2286–2293 (7 Years)
Ship
Ship
Ship
Ship
Position: Captain

Name: Kirk
Position: Captain

Name: Picard
Crew
Crew
succeeded by
succeeded by
succeeded by
served on
served on
served on
served on
#EUeco3
What Captain was After Kirk?
15
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class
Service: 2286–2293 (7 Years)
Ship
Ship
Position: Captain

Name: Kirk
Position: Captain

Name: Picard
Crew
Crewsucceeded by
served on
served on
#EUeco3
What Ship was two after the NCC-1701?
16
Registry: USS Enterprise (NCC-1701-C)
Class: Ambassador
Service: 2332[11] – 2344 (12 Years)
Registry: USS Enterprise (NCC-1701)
Class: Constitution class
Service: 2245–2285 (40 Years)
Registry: USS Enterprise (NCC-1701-A)
Class: Enterprise class
Service: 2286–2293 (7 Years)
Ship
Ship
Ship
succeeded by
succeeded by
#EUeco3
Tinkerpop is a Powerful and Flexible Graph
Framework
• Server, Language, Connectors
• Graph Framework for 

OLAP and OLTP
• Node Centric Representations
• Fluent API (Gremlin)
• Fully Self Contained Framework
17#EUeco3
OLTP Examples
18#EUeco3 18
Movie Lens
Example
Schema
19
https://grouplens.org/datasets/movielens/
#EUeco3 19
20
#EUeco3
What happens when you have too much data?
21
#EUeco3
Tinkerpop Spark OLAP Mechanism
• Instead of one traversal we traverse starting from all nodes simultaneously
22
Distribution Requires Partitioning
23
?
Big Data
Independent Chunks
of Data#EUeco3
#EUeco3
Vertex Stored in a PairRDD
Id -> StarVertex(Edge and Property Information)
24
1
A
C
D
Star Vertex: Adjacency list representation

1: "A", "Kirk"

A: "C", "Kirk"

C: "D", "Picard"

D: "Picard"
 Just Id 

Of Connected 

Vertex
#EUeco3
Vertex Program Runs Initializing Traverser for
every Vertex
25
1
A
C
D
SparkMemory - Accumulator - Used for GlobalState
#EUeco3
Then we cycle through a message Passing
Algorithm
26
1
A
C
D
1
A
C
D
1
A
C
D
SparkMemory - Accumulator - Used for GlobalState
#EUeco3
Then we cycle through a message Passing
Algorithm
27
1
A
C
D
1
A
C
D
1
A
C
D
SparkMemory - Accumulator - Used for GlobalState
Passes messages from one Vertex to another with a join
#EUeco3
Then we cycle through a message Passing
Algorithm
28
1
A
C
D
1
A
C
D
1
A
C
D
SparkMemory - Accumulator - Used for GlobalState
Repeat
#EUeco3
Then we cycle through a message Passing
Algorithm
29
1
A
C
D
1
A
C
D
1
A
C
D
SparkMemory - Accumulator - Used for GlobalState
All Traversers Halt

Or
Program Terminates
Result!
#EUeco3
Example OLAP Traversals
30
#EUeco3
Tinkerpop Spark OLAP Pros/Cons
Pros
• Every message pass requires only a single shuffle
• Edges and edge properties accessible without a step
• Very Flexible, Many Provider Specific Shortcuts possible
• Internal properties can be any Java type
• All in one, Server already ready for multiple clients
Cons
• Limited in ability to connect to external sources/other spark applications
• Flexibility of framework allows for many platform specific shortcuts to be added
• Genericness provides difficulty in making some optimizations
• Edges co-partitioned with vertices, high degree nodes can cause memory issues
31
#EUeco3
GraphFrames Background
• Third Party Package
• https://graphframes.github.io/
• Integrates with Dataset/Dataframe in Spark
• Relational under the hood
32
#EUeco3
GraphFrames are built of two DataFrames
33
Row
Column
#EUeco3
GraphFrames are built of two DataFrames
34
id job species
Geordi Chief
Engineer
Human
Data Science
Officer
Android
Vertex DataFrame
src dst relationship
Geordi Data Friend
Edge DataFrame
Friend
#EUeco3
GraphFrames are built of two DataFrames
35
id job species
Geordi Chief
Engineer
Human
Data Science
Officer
Android
Vertex DataFrame
src dst relationship
Geordi Data Friend
Edge DataFrame
Friend
Can Only Be Spark Types
#EUeco3
GraphFrames are built of two DataFrames
36
id job species
Geordi Chief
Engineer
Human
Data Science
Officer
Android
Vertex DataFrame
src dst relationship
Geordi Data Friend
Edge DataFrame
Friend
No Built in Labels
#EUeco3
Catalyst Optimizes any Requests
• Simple requests using DataFrame api don't do
anything special
• Some methods fall back to GraphX (RDD Based)
• Others use pure DataFrame methods
37
#EUeco3
GraphFrames Motif Matching
38
GraphFrame
(a)-[e]->(b)
V E
#EUeco3
GraphFrames Motif Matching
39
GraphFrame
(a)-[e]->(b)
Vertex (a) Vertices as a UDT "A"V E
A: <VertexRow>
#EUeco3
GraphFrames Motif Matching
40
GraphFrame
(a)-[e]->(b)
Vertex (a) Vertices as a UDT "A"
Edge [b] 

Edges as UDT "E"

Join with edges
where A.id = E.src
V E
A: <VertexRow>
Join
A: <VertexRow>,
E: <EdgeRow>
#EUeco3
GraphFrames Motif Matching
41
GraphFrame
(a)-[e]->(b)
Vertex (a) Vertices as a UDT "A"
[e]
Vertices as UDT "B"
Join with edges where
E.dst = B.id
Edge
Vertex
[b] 

Edges as UDT "E"

Join with edges
where A.id = E.src
V E
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
Join
JoinA: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
#EUeco3
GraphFrames Motif Matching
42
GraphFrame
(a)-[e]->(b)
Vertex (a) Vertices as a UDT "A"
[e]
Vertices as UDT "B"
Join with edges where
E.dst = B.id
Edge
Vertex
[b] 

Edges as UDT "E"

Join with edges
where A.id = E.src
V E
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
Join
JoinA: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
THAT'S SO
MANY JOINS
#EUeco3 43
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
DataFrames means Optimizations are Automatic
#EUeco3 44
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
Select A.ID
Columns Pruned and Predicates Pushed
45
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
Select A.ID
Columns Pruned and Predicates Pushed
#EUeco3
46
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
Select A.ID
Columns Pruned and Predicates Pushed
#EUeco3
47
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
Select A.ID
Columns Pruned and Predicates Pushed
#EUeco3
#EUeco3
All of the normal optimizations happen within this
FrameWork
48
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
Broadcast?
Broadcast?
#EUeco3
Code Generation and Internal Rows
49
Vertex
Edge
Vertex
A: <VertexRow>
A: <VertexRow>,
E: <EdgeRow>
A: <VertexRow>,
E: <EdgeRow>,
B: <VertexRow>
Code
Generation
Code
Generation
Code
Generation
Code
Generation
Code
Generation
#EUeco3
GraphFrames Examples
50
#EUeco3
GraphFrame Pros Cons
Pros
• Much Faster on basic counts
• Powerful optimizations + CodeGen
• Easy to connect to other sources


Cons
• Slower on complex traversals (2 Joins per hop)
• Relational Model not as Flexible
51
#EUeco3
Choosing the Right Framework
52
Choose TinkerPop OLAP For Long Paths
• More complicated queries
• Traversals that require many hops
• g.V().out.out.out.out 

• Avoid for simple counts and aggregations
• Avoid if you have very high degree Vertices
53#EUeco3
Choose GraphFrames for Interoperability and
Short Paths
• General Edge/Vertex stats groupCount, min, max
• Connecting to other sources
• Short paths
• High Degree Vertices
• Avoid
• Long path algorithms
54#EUeco3
#EUeco3
Choosing the Right Framework
55
Gremlin on

Graphframes
OLTP backed
by DSE Graph
Built in Spark
We write it!
Search Built In!
Advanced
Security
#EUeco3
Thanks for Listening
56
Datastax Academy Graph Course
https://academy.datastax.com/resources/ds330-datastax-enterprise-graph

Try out Datastax Enterprise!
https://academy.datastax.com/quick-downloads



Apache Tinkerpop

http://tinkerpop.apache.org/


GraphFrames Link
https://graphframes.github.io/

More Related Content

What's hot

A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...Databricks
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemDatabricks
 
Graph processing - Pregel
Graph processing - PregelGraph processing - Pregel
Graph processing - PregelAmir Payberah
 
Cql – cassandra query language
Cql – cassandra query languageCql – cassandra query language
Cql – cassandra query languageCourtney Robinson
 
Full Page Writes in PostgreSQL PGCONFEU 2022
Full Page Writes in PostgreSQL PGCONFEU 2022Full Page Writes in PostgreSQL PGCONFEU 2022
Full Page Writes in PostgreSQL PGCONFEU 2022Grant McAlister
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesDATAVERSITY
 
201804 neo4 j_cypher_guide
201804 neo4 j_cypher_guide201804 neo4 j_cypher_guide
201804 neo4 j_cypher_guideJunyi Song
 
The Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j OverviewThe Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j OverviewNeo4j
 
The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...
The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...
The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...Databricks
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...Flink Forward
 
Spark rdd vs data frame vs dataset
Spark rdd vs data frame vs datasetSpark rdd vs data frame vs dataset
Spark rdd vs data frame vs datasetAnkit Beohar
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureKai Wähner
 
CockroachDB: Architecture of a Geo-Distributed SQL Database
CockroachDB: Architecture of a Geo-Distributed SQL DatabaseCockroachDB: Architecture of a Geo-Distributed SQL Database
CockroachDB: Architecture of a Geo-Distributed SQL DatabaseC4Media
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceNeo4j
 
Powers of Ten Redux
Powers of Ten ReduxPowers of Ten Redux
Powers of Ten ReduxJason Plurad
 
How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...Neo4j
 
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -Yoshiyasu SAEKI
 
The DDS Tutorial Part II
The DDS Tutorial Part IIThe DDS Tutorial Part II
The DDS Tutorial Part IIAngelo Corsaro
 
Introduction to Cypher
Introduction to Cypher Introduction to Cypher
Introduction to Cypher Neo4j
 

What's hot (20)

A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules ...
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format Ecosystem
 
Graph processing - Pregel
Graph processing - PregelGraph processing - Pregel
Graph processing - Pregel
 
Cql – cassandra query language
Cql – cassandra query languageCql – cassandra query language
Cql – cassandra query language
 
Full Page Writes in PostgreSQL PGCONFEU 2022
Full Page Writes in PostgreSQL PGCONFEU 2022Full Page Writes in PostgreSQL PGCONFEU 2022
Full Page Writes in PostgreSQL PGCONFEU 2022
 
Data Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph DatabasesData Modeling & Metadata for Graph Databases
Data Modeling & Metadata for Graph Databases
 
201804 neo4 j_cypher_guide
201804 neo4 j_cypher_guide201804 neo4 j_cypher_guide
201804 neo4 j_cypher_guide
 
The Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j OverviewThe Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j Overview
 
The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...
The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...
The Top Five Mistakes Made When Writing Streaming Applications with Mark Grov...
 
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
 
Spark rdd vs data frame vs dataset
Spark rdd vs data frame vs datasetSpark rdd vs data frame vs dataset
Spark rdd vs data frame vs dataset
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
 
CockroachDB: Architecture of a Geo-Distributed SQL Database
CockroachDB: Architecture of a Geo-Distributed SQL DatabaseCockroachDB: Architecture of a Geo-Distributed SQL Database
CockroachDB: Architecture of a Geo-Distributed SQL Database
 
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data ScienceGet Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
 
Powers of Ten Redux
Powers of Ten ReduxPowers of Ten Redux
Powers of Ten Redux
 
How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...How the Neanex digital twin solution delivers on both speed and scale to the ...
How the Neanex digital twin solution delivers on both speed and scale to the ...
 
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
Apache Sparkにおけるメモリ - アプリケーションを落とさないメモリ設計手法 -
 
The DDS Tutorial Part II
The DDS Tutorial Part IIThe DDS Tutorial Part II
The DDS Tutorial Part II
 
Scala and spark
Scala and sparkScala and spark
Scala and spark
 
Introduction to Cypher
Introduction to Cypher Introduction to Cypher
Introduction to Cypher
 

Viewers also liked

How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...
How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...
How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...Spark Summit
 
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...Spark Summit
 
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya RaghavendraSpark Summit
 
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...Spark Summit
 
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...Spark Summit
 
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind RaoHistogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind RaoSpark Summit
 
Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...
Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...
Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...Spark Summit
 
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Spark Summit
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingSpark Summit
 
Storage Engine Considerations for Your Apache Spark Applications with Mladen ...
Storage Engine Considerations for Your Apache Spark Applications with Mladen ...Storage Engine Considerations for Your Apache Spark Applications with Mladen ...
Storage Engine Considerations for Your Apache Spark Applications with Mladen ...Spark Summit
 
Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...
Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...
Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...Spark Summit
 
Building Machine Learning Algorithms on Apache Spark with William Benton
Building Machine Learning Algorithms on Apache Spark with William BentonBuilding Machine Learning Algorithms on Apache Spark with William Benton
Building Machine Learning Algorithms on Apache Spark with William BentonSpark Summit
 
Feature Hashing for Scalable Machine Learning with Nick Pentreath
Feature Hashing for Scalable Machine Learning with Nick PentreathFeature Hashing for Scalable Machine Learning with Nick Pentreath
Feature Hashing for Scalable Machine Learning with Nick PentreathSpark Summit
 
Low Touch Machine Learning with Leah McGuire (Salesforce)
Low Touch Machine Learning with Leah McGuire (Salesforce)Low Touch Machine Learning with Leah McGuire (Salesforce)
Low Touch Machine Learning with Leah McGuire (Salesforce)Spark Summit
 
Experimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles BakerExperimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles BakerDatabricks
 
Art of Feature Engineering for Data Science with Nabeel Sarwar
Art of Feature Engineering for Data Science with Nabeel SarwarArt of Feature Engineering for Data Science with Nabeel Sarwar
Art of Feature Engineering for Data Science with Nabeel SarwarSpark Summit
 
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim HunterDeep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim HunterDatabricks
 
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang Spark Summit
 

Viewers also liked (18)

How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...
How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...
How to Share State Across Multiple Apache Spark Jobs using Apache Ignite with...
 
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
 
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
 
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
Indicium: Interactive Querying at Scale Using Apache Spark, Zeppelin, and Spa...
 
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
 
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind RaoHistogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
 
Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...
Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...
Natural Language Understanding at Scale with Spark-Native NLP, Spark ML, and ...
 
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim Dowling
 
Storage Engine Considerations for Your Apache Spark Applications with Mladen ...
Storage Engine Considerations for Your Apache Spark Applications with Mladen ...Storage Engine Considerations for Your Apache Spark Applications with Mladen ...
Storage Engine Considerations for Your Apache Spark Applications with Mladen ...
 
Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...
Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...
Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark with Yuval...
 
Building Machine Learning Algorithms on Apache Spark with William Benton
Building Machine Learning Algorithms on Apache Spark with William BentonBuilding Machine Learning Algorithms on Apache Spark with William Benton
Building Machine Learning Algorithms on Apache Spark with William Benton
 
Feature Hashing for Scalable Machine Learning with Nick Pentreath
Feature Hashing for Scalable Machine Learning with Nick PentreathFeature Hashing for Scalable Machine Learning with Nick Pentreath
Feature Hashing for Scalable Machine Learning with Nick Pentreath
 
Low Touch Machine Learning with Leah McGuire (Salesforce)
Low Touch Machine Learning with Leah McGuire (Salesforce)Low Touch Machine Learning with Leah McGuire (Salesforce)
Low Touch Machine Learning with Leah McGuire (Salesforce)
 
Experimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles BakerExperimental Design for Distributed Machine Learning with Myles Baker
Experimental Design for Distributed Machine Learning with Myles Baker
 
Art of Feature Engineering for Data Science with Nabeel Sarwar
Art of Feature Engineering for Data Science with Nabeel SarwarArt of Feature Engineering for Data Science with Nabeel Sarwar
Art of Feature Engineering for Data Science with Nabeel Sarwar
 
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim HunterDeep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
 
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
 

Similar to A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem Aliev and Russell Spitzer

Web-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim HunterWeb-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim HunterDatabricks
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingNesreen K. Ahmed
 
2013 Hello GCC:The Theory, History and Future of System Linkers
2013 Hello GCC:The Theory, History and Future of System Linkers2013 Hello GCC:The Theory, History and Future of System Linkers
2013 Hello GCC:The Theory, History and Future of System LinkersChing-Yi Chen
 
Lisa Kaplan Resume-15-03
Lisa Kaplan Resume-15-03Lisa Kaplan Resume-15-03
Lisa Kaplan Resume-15-03Lisa Kaplan
 
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14Yuichiro Yasui
 
Playtrip: a CQRS/ES architecture in Erlang.
Playtrip: a CQRS/ES architecture in Erlang.Playtrip: a CQRS/ES architecture in Erlang.
Playtrip: a CQRS/ES architecture in Erlang.Nicola Fiorillo
 
Building a Business Logic Translation Engine with Spark Streaming for Communi...
Building a Business Logic Translation Engine with Spark Streaming for Communi...Building a Business Logic Translation Engine with Spark Streaming for Communi...
Building a Business Logic Translation Engine with Spark Streaming for Communi...Spark Summit
 
Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Databricks
 
Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Databricks
 
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...MDC_UNICA
 
SF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at Lyft
SF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at LyftSF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at Lyft
SF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at LyftChester Chen
 
Big Data Redis Mongodb Dynamodb Sharding
Big Data Redis Mongodb Dynamodb ShardingBig Data Redis Mongodb Dynamodb Sharding
Big Data Redis Mongodb Dynamodb ShardingAraf Karsh Hamid
 
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...Larry Smarr
 
OrientDB - the 2nd generation of (Multi-Model) NoSQL - J On The Beach 2016
OrientDB - the 2nd generation of (Multi-Model) NoSQL  - J On The Beach 2016OrientDB - the 2nd generation of (Multi-Model) NoSQL  - J On The Beach 2016
OrientDB - the 2nd generation of (Multi-Model) NoSQL - J On The Beach 2016Luigi Dell'Aquila
 
Cyclone DDS Unleashed: ROS & Cyclone DDS.pdf
Cyclone DDS Unleashed: ROS & Cyclone DDS.pdfCyclone DDS Unleashed: ROS & Cyclone DDS.pdf
Cyclone DDS Unleashed: ROS & Cyclone DDS.pdfZettaScaleTechnology
 
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui MengChallenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui MengDatabricks
 
Challenging Web-Scale Graph Analytics with Apache Spark
Challenging Web-Scale Graph Analytics with Apache SparkChallenging Web-Scale Graph Analytics with Apache Spark
Challenging Web-Scale Graph Analytics with Apache SparkDatabricks
 
2018 GIS in Development: FOSS4G in the Government (Proof of Concept)
2018 GIS in Development: FOSS4G in the Government (Proof of Concept)2018 GIS in Development: FOSS4G in the Government (Proof of Concept)
2018 GIS in Development: FOSS4G in the Government (Proof of Concept)GIS in the Rockies
 

Similar to A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem Aliev and Russell Spitzer (20)

Web-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim HunterWeb-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim Hunter
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
2013 Hello GCC:The Theory, History and Future of System Linkers
2013 Hello GCC:The Theory, History and Future of System Linkers2013 Hello GCC:The Theory, History and Future of System Linkers
2013 Hello GCC:The Theory, History and Future of System Linkers
 
Lisa Kaplan Resume-15-03
Lisa Kaplan Resume-15-03Lisa Kaplan Resume-15-03
Lisa Kaplan Resume-15-03
 
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
 
Sema History and Overview
Sema History and OverviewSema History and Overview
Sema History and Overview
 
Playtrip: a CQRS/ES architecture in Erlang.
Playtrip: a CQRS/ES architecture in Erlang.Playtrip: a CQRS/ES architecture in Erlang.
Playtrip: a CQRS/ES architecture in Erlang.
 
Building a Business Logic Translation Engine with Spark Streaming for Communi...
Building a Business Logic Translation Engine with Spark Streaming for Communi...Building a Business Logic Translation Engine with Spark Streaming for Communi...
Building a Business Logic Translation Engine with Spark Streaming for Communi...
 
Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™
 
Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™
 
Resume
ResumeResume
Resume
 
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
Automated Design Flow for Coarse-Grained Reconfigurable Platforms: an RVC-CAL...
 
SF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at Lyft
SF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at LyftSF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at Lyft
SF Big Analytics_20190612: Scaling Apache Spark on Kubernetes at Lyft
 
Big Data Redis Mongodb Dynamodb Sharding
Big Data Redis Mongodb Dynamodb ShardingBig Data Redis Mongodb Dynamodb Sharding
Big Data Redis Mongodb Dynamodb Sharding
 
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
 
OrientDB - the 2nd generation of (Multi-Model) NoSQL - J On The Beach 2016
OrientDB - the 2nd generation of (Multi-Model) NoSQL  - J On The Beach 2016OrientDB - the 2nd generation of (Multi-Model) NoSQL  - J On The Beach 2016
OrientDB - the 2nd generation of (Multi-Model) NoSQL - J On The Beach 2016
 
Cyclone DDS Unleashed: ROS & Cyclone DDS.pdf
Cyclone DDS Unleashed: ROS & Cyclone DDS.pdfCyclone DDS Unleashed: ROS & Cyclone DDS.pdf
Cyclone DDS Unleashed: ROS & Cyclone DDS.pdf
 
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui MengChallenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
 
Challenging Web-Scale Graph Analytics with Apache Spark
Challenging Web-Scale Graph Analytics with Apache SparkChallenging Web-Scale Graph Analytics with Apache Spark
Challenging Web-Scale Graph Analytics with Apache Spark
 
2018 GIS in Development: FOSS4G in the Government (Proof of Concept)
2018 GIS in Development: FOSS4G in the Government (Proof of Concept)2018 GIS in Development: FOSS4G in the Government (Proof of Concept)
2018 GIS in Development: FOSS4G in the Government (Proof of Concept)
 

More from Spark Summit

VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...Spark Summit
 
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang WuApache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang WuSpark Summit
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingSpark Summit
 
Next CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub WozniakNext CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub WozniakSpark Summit
 
Powering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimSpark Summit
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraSpark Summit
 
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Spark Summit
 
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...Spark Summit
 
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spark Summit
 
Goal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim SimeonovGoal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim SimeonovSpark Summit
 
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Spark Summit
 
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir VolkGetting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir VolkSpark Summit
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Spark Summit
 
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...Spark Summit
 
Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...
Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...
Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...Spark Summit
 
Variant-Apache Spark for Bioinformatics with Piotr Szul
Variant-Apache Spark for Bioinformatics with Piotr SzulVariant-Apache Spark for Bioinformatics with Piotr Szul
Variant-Apache Spark for Bioinformatics with Piotr SzulSpark Summit
 
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed Spark Summit
 
Best Practices for Using Alluxio with Apache Spark with Gene Pang
Best Practices for Using Alluxio with Apache Spark with Gene PangBest Practices for Using Alluxio with Apache Spark with Gene Pang
Best Practices for Using Alluxio with Apache Spark with Gene PangSpark Summit
 
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg SchadSmack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg SchadSpark Summit
 
Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...
Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...
Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...Spark Summit
 

More from Spark Summit (20)

VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
 
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang WuApache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim Dowling
 
Next CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub WozniakNext CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub Wozniak
 
Powering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin Kim
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
 
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
 
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
 
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
 
Goal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim SimeonovGoal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim Simeonov
 
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
 
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir VolkGetting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir Volk
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
 
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
 
Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...
Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...
Apache Spark-Bench: Simulate, Test, Compare, Exercise, and Yes, Benchmark wit...
 
Variant-Apache Spark for Bioinformatics with Piotr Szul
Variant-Apache Spark for Bioinformatics with Piotr SzulVariant-Apache Spark for Bioinformatics with Piotr Szul
Variant-Apache Spark for Bioinformatics with Piotr Szul
 
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
Running Spark Inside Containers with Haohai Ma and Khalid Ahmed
 
Best Practices for Using Alluxio with Apache Spark with Gene Pang
Best Practices for Using Alluxio with Apache Spark with Gene PangBest Practices for Using Alluxio with Apache Spark with Gene Pang
Best Practices for Using Alluxio with Apache Spark with Gene Pang
 
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg SchadSmack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
Smack Stack and Beyond—Building Fast Data Pipelines with Jorg Schad
 
Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...
Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...
Hardware Acceleration of Apache Spark on Energy-Efficient FPGAs with Christof...
 

Recently uploaded

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...gajnagarg
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...amitlee9823
 

Recently uploaded (20)

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men  🔝Ongole🔝   Escorts S...
➥🔝 7737669865 🔝▻ Ongole Call-girls in Women Seeking Men 🔝Ongole🔝 Escorts S...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
 

A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem Aliev and Russell Spitzer

  • 1. Artem Aliev and Russell Spitzer, DataStax A Tale of Two Graph Frameworks on Spark: 
 GraphFrames and Tinkerpop OLAP #EUeco3
  • 2. #EUeco3 Pierrot and Harlequin • Artem • Graph Analytics Expert • Earth • Russell • Distributed Systems Enthusiast • Earth 2
  • 3. Tinkerpop and GraphFrames provide Complimentary Approaches for Graph Analytics DataSet Catalyst GraphFrames 3#EUeco3
  • 4. Graphs are Vertices and Edges 4 Vertices are things and edges represent their relations to one another #EUeco3
  • 5. Graphs are Vertices and Edges 5 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class[6] Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class[8][9] Service: 2286–2293 (7 Years) #EUeco3
  • 6. Graphs are Vertices and Edges 6 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class[6] Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class[8][9] Service: 2286–2293 (7 Years) Vertex Properties #EUeco3
  • 7. Graphs are Vertices and Edges 7 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class[6] Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class[8][9] Service: 2286–2293 (7 Years) succeeded by succeeded by succeeded by #EUeco3
  • 8. Graphs are Vertices and Edges 8 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class[6] Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class[8][9] Service: 2286–2293 (7 Years) Edge Edge Labelsucceeded by succeeded by succeeded by #EUeco3
  • 9. Graphs are Vertices and Edges 9 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class[6] Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class[8][9] Service: 2286–2293 (7 Years) Ship Ship Ship Ship Vertex Label succeeded by succeeded by succeeded by #EUeco3
  • 10. Graphs are Vertices and Edges 10 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class Service: 2245–2285 (40 Years) Ship Ship Ship Ship Position: Captain
 Name: Kirk Position: Captain
 Name: Picard Crew Crew succeeded by succeeded by succeeded by #EUeco3
  • 11. Graphs are Vertices and Edges 11 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class Service: 2286–2293 (7 Years) Ship Ship Ship Ship Position: Captain
 Name: Kirk Position: Captain
 Name: Picard Crew Crew succeeded by succeeded by succeeded by served on served on served on served on #EUeco3
  • 12. Graphs are Vertices and Edges 12 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class Service: 2286–2293 (7 Years) Ship Ship Ship Ship Position: Captain
 Name: Kirk Position: Captain
 Name: Picard Crew Crew succeeded by succeeded by succeeded by served on served on served on served on But why do I want this? #EUeco3
  • 13. Graphs let us ask questions about our data based on their relations 13 What Captain Served After Kirk? What Ship was two after the NCC-1701? #EUeco3
  • 14. Traversals involve following paths through the Graph 14 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-D) Class: Galaxy Service: 2363–2371 (8 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class Service: 2286–2293 (7 Years) Ship Ship Ship Ship Position: Captain
 Name: Kirk Position: Captain
 Name: Picard Crew Crew succeeded by succeeded by succeeded by served on served on served on served on #EUeco3
  • 15. What Captain was After Kirk? 15 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class Service: 2286–2293 (7 Years) Ship Ship Position: Captain
 Name: Kirk Position: Captain
 Name: Picard Crew Crewsucceeded by served on served on #EUeco3
  • 16. What Ship was two after the NCC-1701? 16 Registry: USS Enterprise (NCC-1701-C) Class: Ambassador Service: 2332[11] – 2344 (12 Years) Registry: USS Enterprise (NCC-1701) Class: Constitution class Service: 2245–2285 (40 Years) Registry: USS Enterprise (NCC-1701-A) Class: Enterprise class Service: 2286–2293 (7 Years) Ship Ship Ship succeeded by succeeded by #EUeco3
  • 17. Tinkerpop is a Powerful and Flexible Graph Framework • Server, Language, Connectors • Graph Framework for 
 OLAP and OLTP • Node Centric Representations • Fluent API (Gremlin) • Fully Self Contained Framework 17#EUeco3
  • 20. 20
  • 21. #EUeco3 What happens when you have too much data? 21
  • 22. #EUeco3 Tinkerpop Spark OLAP Mechanism • Instead of one traversal we traverse starting from all nodes simultaneously 22
  • 23. Distribution Requires Partitioning 23 ? Big Data Independent Chunks of Data#EUeco3
  • 24. #EUeco3 Vertex Stored in a PairRDD Id -> StarVertex(Edge and Property Information) 24 1 A C D Star Vertex: Adjacency list representation
 1: "A", "Kirk"
 A: "C", "Kirk"
 C: "D", "Picard"
 D: "Picard"
 Just Id 
 Of Connected 
 Vertex
  • 25. #EUeco3 Vertex Program Runs Initializing Traverser for every Vertex 25 1 A C D SparkMemory - Accumulator - Used for GlobalState
  • 26. #EUeco3 Then we cycle through a message Passing Algorithm 26 1 A C D 1 A C D 1 A C D SparkMemory - Accumulator - Used for GlobalState
  • 27. #EUeco3 Then we cycle through a message Passing Algorithm 27 1 A C D 1 A C D 1 A C D SparkMemory - Accumulator - Used for GlobalState Passes messages from one Vertex to another with a join
  • 28. #EUeco3 Then we cycle through a message Passing Algorithm 28 1 A C D 1 A C D 1 A C D SparkMemory - Accumulator - Used for GlobalState Repeat
  • 29. #EUeco3 Then we cycle through a message Passing Algorithm 29 1 A C D 1 A C D 1 A C D SparkMemory - Accumulator - Used for GlobalState All Traversers Halt
 Or Program Terminates Result!
  • 31. #EUeco3 Tinkerpop Spark OLAP Pros/Cons Pros • Every message pass requires only a single shuffle • Edges and edge properties accessible without a step • Very Flexible, Many Provider Specific Shortcuts possible • Internal properties can be any Java type • All in one, Server already ready for multiple clients Cons • Limited in ability to connect to external sources/other spark applications • Flexibility of framework allows for many platform specific shortcuts to be added • Genericness provides difficulty in making some optimizations • Edges co-partitioned with vertices, high degree nodes can cause memory issues 31
  • 32. #EUeco3 GraphFrames Background • Third Party Package • https://graphframes.github.io/ • Integrates with Dataset/Dataframe in Spark • Relational under the hood 32
  • 33. #EUeco3 GraphFrames are built of two DataFrames 33 Row Column
  • 34. #EUeco3 GraphFrames are built of two DataFrames 34 id job species Geordi Chief Engineer Human Data Science Officer Android Vertex DataFrame src dst relationship Geordi Data Friend Edge DataFrame Friend
  • 35. #EUeco3 GraphFrames are built of two DataFrames 35 id job species Geordi Chief Engineer Human Data Science Officer Android Vertex DataFrame src dst relationship Geordi Data Friend Edge DataFrame Friend Can Only Be Spark Types
  • 36. #EUeco3 GraphFrames are built of two DataFrames 36 id job species Geordi Chief Engineer Human Data Science Officer Android Vertex DataFrame src dst relationship Geordi Data Friend Edge DataFrame Friend No Built in Labels
  • 37. #EUeco3 Catalyst Optimizes any Requests • Simple requests using DataFrame api don't do anything special • Some methods fall back to GraphX (RDD Based) • Others use pure DataFrame methods 37
  • 39. #EUeco3 GraphFrames Motif Matching 39 GraphFrame (a)-[e]->(b) Vertex (a) Vertices as a UDT "A"V E A: <VertexRow>
  • 40. #EUeco3 GraphFrames Motif Matching 40 GraphFrame (a)-[e]->(b) Vertex (a) Vertices as a UDT "A" Edge [b] 
 Edges as UDT "E"
 Join with edges where A.id = E.src V E A: <VertexRow> Join A: <VertexRow>, E: <EdgeRow>
  • 41. #EUeco3 GraphFrames Motif Matching 41 GraphFrame (a)-[e]->(b) Vertex (a) Vertices as a UDT "A" [e] Vertices as UDT "B" Join with edges where E.dst = B.id Edge Vertex [b] 
 Edges as UDT "E"
 Join with edges where A.id = E.src V E A: <VertexRow> A: <VertexRow>, E: <EdgeRow> Join JoinA: <VertexRow>, E: <EdgeRow>, B: <VertexRow>
  • 42. #EUeco3 GraphFrames Motif Matching 42 GraphFrame (a)-[e]->(b) Vertex (a) Vertices as a UDT "A" [e] Vertices as UDT "B" Join with edges where E.dst = B.id Edge Vertex [b] 
 Edges as UDT "E"
 Join with edges where A.id = E.src V E A: <VertexRow> A: <VertexRow>, E: <EdgeRow> Join JoinA: <VertexRow>, E: <EdgeRow>, B: <VertexRow> THAT'S SO MANY JOINS
  • 43. #EUeco3 43 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> DataFrames means Optimizations are Automatic
  • 44. #EUeco3 44 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> Select A.ID Columns Pruned and Predicates Pushed
  • 45. 45 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> Select A.ID Columns Pruned and Predicates Pushed #EUeco3
  • 46. 46 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> Select A.ID Columns Pruned and Predicates Pushed #EUeco3
  • 47. 47 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> Select A.ID Columns Pruned and Predicates Pushed #EUeco3
  • 48. #EUeco3 All of the normal optimizations happen within this FrameWork 48 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> Broadcast? Broadcast?
  • 49. #EUeco3 Code Generation and Internal Rows 49 Vertex Edge Vertex A: <VertexRow> A: <VertexRow>, E: <EdgeRow> A: <VertexRow>, E: <EdgeRow>, B: <VertexRow> Code Generation Code Generation Code Generation Code Generation Code Generation
  • 51. #EUeco3 GraphFrame Pros Cons Pros • Much Faster on basic counts • Powerful optimizations + CodeGen • Easy to connect to other sources 
 Cons • Slower on complex traversals (2 Joins per hop) • Relational Model not as Flexible 51
  • 53. Choose TinkerPop OLAP For Long Paths • More complicated queries • Traversals that require many hops • g.V().out.out.out.out 
 • Avoid for simple counts and aggregations • Avoid if you have very high degree Vertices 53#EUeco3
  • 54. Choose GraphFrames for Interoperability and Short Paths • General Edge/Vertex stats groupCount, min, max • Connecting to other sources • Short paths • High Degree Vertices • Avoid • Long path algorithms 54#EUeco3
  • 55. #EUeco3 Choosing the Right Framework 55 Gremlin on
 Graphframes OLTP backed by DSE Graph Built in Spark We write it! Search Built In! Advanced Security
  • 56. #EUeco3 Thanks for Listening 56 Datastax Academy Graph Course https://academy.datastax.com/resources/ds330-datastax-enterprise-graph
 Try out Datastax Enterprise! https://academy.datastax.com/quick-downloads
 
 Apache Tinkerpop
 http://tinkerpop.apache.org/ 
 GraphFrames Link https://graphframes.github.io/