[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data

People You May Know
Fast Recommendations Over Massive Data
Jeff Weiner
Chief Executive Officer
Sumit Rangwala
Artificial Intelligence
Felix GV
Data Infrastructure

My Professional Network
Professional network in real world
Sumit Felix
Amol
GaojiePeter

Professional network in real world Professional network on LinkedIn
Sumit Felix
Peter
Amol
Gaojie
Sumit Felix
Peter
Amol
Gaojie

Professional network in real world Professional network on LinkedIn
Sumit Felix
Peter
Amol
Gaojie
Sumit Felix
Peter
Amol
Gaojie
Predicting
real world
connections

Helps grow member’s professional network
Recommends people that one might know
People You May Know
Enables many other LinkedIn services

Talk Outline
People You May Know
PYMK: Generating Recommendations
PYMK Architecture Evolution
PYMK Rebirth
Insights and Road Ahead

PYMK: Generating Recommendations

PYMK: Prediction Strategy
Data Mining
• LinkedIn’s Economic Graph
• Member’s activities and profile
LinkedIn Economic Graph
Sumit Felix
Peter
Amol
Gaojie

PYMK: Prediction Strategy
Data Mining
• LinkedIn’s Economic Graph
• Member’s activities and profile
Felix
Peter
Amol
Gaojie
Microsoft
USC
Sumit

Recommendation System
Candidate Generation
Feature Generation
Scoring

PYMK: Candidate Generation
Using commonalities in
economic graph
• Friends of my friends
(triangle closing)
Amol
Peter Gaojie
Sumit Felix

PYMK: Candidate Generation
Using commonalities in
economic graph
• Friends of my friends
(triangle closing)
• Coworkers
• Personalized Page Rank
Amol
Peter Gaojie
Felix
Microsoft
Sumit

PYMK: Feature Generation
Using economic graph
characteristics
• Number of common friends
Using member
activities/profile
• Common work location
Amol
Peter Gaojie
Felix
Microsoft
Sumit

PYMK: Recommendation System
Candidate
Generation
Feature
Generation

Candidate
Generation
Feature
Generation
Sumit might know Amol’s friend Felix
Sumit and Felix has one common friend
Sumit and Felix both work in Bay Area

Candidate
Generation
Feature
Generation
Graph processing
Data processing

Pre-compute recommendations
A P P R O A C H

PYMK: The Beginning
Problem Space
• 10s of millions of members
Architecture
• Pre-compute using SQL
Shortcomings
• Staleness of 6 weeks to 6 months
• Extraneous computation
Oracle

PYMK: The Beginning
Problem Space
• 10s of millions of members
Architecture
• Pre-compute using SQL
Shortcomings
• Staleness of 6 weeks to 6 months
Oracle PYMK
Service
Online service request

PYMK: Keeping up with Growth
Problem space
• Low 100s of millions of members
Architecture
• Pre-compute using Hadoop MR
• Push to a key-value store
Shortcomings
• Staleness of 2-3 days
Voldemort
PYMK
Service

PYMK: Pushing the Technology Limits
Problem Space
• Mid 100s of millions of members
Architecture
• Pre-compute using Spark1
• Push to a key-value store
Shortcomings
• Staleness of 1-2 days
• Excessive computation cost
Venice
[1] Managing Exploding Big Data
PYMK
Service

PYMK: Exploring Data Freshness
Problem Space
• Use up to date member data
Architecture
• Hybrid offline-online approach
Shortcomings
• Split-brain design
• Didn’t scale
Venice
Realtime signals
PYMK
Service

Key Realization
Freshness
matters
Pre-computation
is costly

Compute recommendations on demand
A P P R O A C H

Candidate
Generation
Feature
Generation
Online Graph Traversal
Fast Data Access

An online graph processing system

G A I A
A generic service for executing complex graph algorithms
with low latency on massive graphs

Gaia: Overview
Gaia
Any kind of graph
A snapshot
on HDFS

Gaia: Overview
Gaia
Any kind of graph
Updates to graph
A snapshot
on HDFS
Via Kafka, etc.

Gaia: Overview
Gaia
Any kind of graph
Updates to graph
Graph algorithm code
A snapshot
on HDFS
Via Kafka, etc.
Using
compute
framework
e.g., triangle closing,
random graph walks

Design Choice
Gaia
• Single server architecture with replicas
• Full in-memory graph for fast execution

Gaia: Architecture
Server Server Server
Gaia

Gaia: Architecture
Algo Algo Algo
Gaia

Gaia: Architecture
Graph snapshot on
disk
Algo Algo Algo
Gaia

Gaia: Architecture
Graph snapshot on
disk Graph updates via
Kafka, etc.
Algo Algo Algo
Gaia

PYMK
Gaia
• Candidate generation using triangle
closing and common connection count
• 10s of milliseconds (p90)

A key-value store with scoring capability

At a glance
Venice
• Tailored for serving ML jobs’ output
• High throughput ingestion
• Fast lookups
• Self-service onboarding

Supported Ingestion Modes in Venice
Batch
Hadoop Push Job

Batch Incremental
Hadoop Push Job
Samza Streaming Job

Batch Incremental
Hadoop Push Job Push Job
Samza Reprocessing Job
(Kappa Architecture)
Streaming Job

Batch Incremental
Hadoop Push Job Push Job
Samza Reprocessing Job
(Kappa Architecture)
Streaming Job
Hybrid Any Batch Job + Streaming Job
(Lambda Architecture)

Online Feature Retrieval
F i r s t P Y M K U s e C a s e

Requirements
• Millions of lookups / sec at peak
• ~1000 keys / query
• Thousands of queries / sec
• ~80B / value

Before / After
• Base latency
• 4 seconds (p99)
• Changed storage engine to RocksDB
• 60 ms (p99)

Embeddings
S e c o n d P Y M K U s e C a s e

Requirements
Embeddings
• Millions of lookups / sec at peak
• ~1000 keys / query
• Thousands of queries / sec
• ~800B / value
• 10x the previous size

Before / After
Embeddings
• Base latency
• 275 ms (p99)
• Server-side computation
• 60 ms (p99)

At a glance
Server-side Computation
• Simple vector operations
• Smaller response size
• Big input (vector)
• Small output (scalar)
• Declarative API
• No arbitrary code

More tuning
Fast Avro
• Online feature retrieval
• 60 to 40 ms (p99)
• Embeddings w/ computation
• 60 to 35 ms (p99)
• Now open-source!
• github.com/linkedin/avro-util

PYMK Today
P u t t i n g i t a l l t o g e t h e r

Candidate
Generation
Sumit might know Amol’s friend, Felix
Sumit and Felix have one common friend
PYMK Service
Feature
Generation
Scoring Sumit and Felix likely know each other
Venice
Gaia

PYMK: Today
Venice
PYMK
Service
Gaia
1. Ingest in
Gaia & Venice
2. Candidate gen
& graph features
from Gaia
4. Final scoring
by PYMK Service
3. Member features
& partial scoring
from Venice
Staleness
• Seconds to minutes

Key Learnings
• Pre-computation is viable for many products
• Scaling RT computation requires moving compute close to data
• Infra aware Machine Learning

• Further scale Gaia & Venice
• More candidates
• More features
• Larger features
• More complex computations
ML-Aware Infra

• Continue democratizing access
• Easier onboarding to Venice & Gaia
• Multi-tenancy for Venice Compute
• Integration with other frameworksProductive ML

Contributors
Amol Ghoting Gaojie Liu Kevinjeet Gill Peter Chng Min Huang
Yao Chen Hema Raghavan Many othersAshish Singhai

[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data

[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to [QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data

Similar to [QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data (20)

Recently uploaded

Recently uploaded (20)

[QCon.ai 2019] People You May Know: Fast Recommendations Over Massive Data