VELOX: MODELS IN ACTION

VELOX: MODELS IN ACTION
Dan Crankshaw
UC Berkeley AMPLab
crankshaw@cs.berkeley.edu
Marin Software
2015

Algorithms, Machines, and People

“deep questions over dirty and heterogenous data”

GraphX
“deep questions over dirty and heterogenous data”

BERKELEY DATA  
ANALYTICS STACK (BDAS)
Spark
Spark
Streaming Spark SQL
BlinkDB
GraphX
MLlib
MLBase
HDFS, S3, …
Tachyon
Mesos HadoopYarn

MODELINGTASK
Ratings
Songs
Prediction

Catify: Music for Cats
CatID Song Score
1 16 2.1
1 14 3.7
3 273 4.2
4 14 1.9

Pipeline
CatID Song Score
1 16 2.1
1 14 3.7
3 273 4.2
4 14 1.9

Tachyon + HDFS
Pipeline
CatID Song Score
1 16 2.1
1 14 3.7
3 273 4.2
4 14 1.9

Pipeline
Tachyon + HDFS
Node.js App Server
Apache Web Server

Songs
Users

Songs
Users
O(users * songs)

Pipeline
Tachyon + HDFS
Node.js App Server
Apache Web Server
Precomputed
Ratings

Pipeline
Tachyon + HDFS
Node.js App Server
Apache Web Server
Precomputed
Ratings
Black box

Pipeline
Tachyon + HDFS
Node.js App Server
Apache Web Server
Training Data
Precomputed
Ratings
Black box

1. Serving system: low-latency but
high staleness
What’s wrong?

high staleness
2. Batch training: slow incremental
maintenance, no serving
What’s wrong?

high staleness
2. Batch training: slow incremental
maintenance, no serving
3. Ad-hoc model management
What’s wrong?

VELOX GOALS
1. Low latency and fresh predictions

VELOX GOALS
2. Break the abstraction: model-
speciﬁc optimizations

VELOX GOALS
2. Break the abstraction: model-
speciﬁc optimizations
3. Uniﬁed system eases operation

Spark
Spark
Streaming Spark SQL
BlinkDB
GraphX
MLlib
MLBase
HDFS, S3, …
Tachyon
THE MISSING PIECE IN BDAS
Mesos

Spark
Streaming Spark
SQL
Graph
X ML
library
BlinkDB MLbase
Training
Spark
HDFS, S3, …
Tachyon
Mesos

Spark
Streaming Spark
SQL
Graph
X ML
library
BlinkDB MLbase
Training Management + Serving
Spark
HDFS, S3, …
Tachyon
Mesos

Spark
Streaming Spark
SQL
Graph
X ML
library
BlinkDB MLbase
Velox
Spark
HDFS, S3, …
Tachyon
Mesos

Spark
Streaming Spark
SQL
Graph
X ML
library
BlinkDB MLbase
Velox
Spark
HDFS, S3, …
Tachyon
Model
Manager
Mesos

Spark
Streaming Spark
SQL
Graph
X ML
library
BlinkDB MLbase
Velox
Spark
HDFS, S3, …
Tachyon
Model
Manager
Prediction
Service
Mesos

VELOX ARCHITECTURE
Standalone Scala
Service

VELOX ARCHITECTURE
Standalone Scala
Service
Automatic Integration
with Spark

VELOX ARCHITECTURE
Standalone Scala
Service
Personalized Predictions
as a Service
with Spark

VELOX ARCHITECTURE
Standalone Scala
Service
Shared-Nothing
Serving Cluster
Personalized Predictions
as a Service
with Spark

SYSTEM ARCHITECTURE
uuid: 01-10
uuid: 11-20
uuid: 20-30

SYSTEM ARCHITECTURE
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30

SYSTEM ARCHITECTURE
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30
uuid: 4

SYSTEM ARCHITECTURE
Predictions via REST
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30
uuid: 4

SYSTEM ARCHITECTURE
Predictions via REST
frontend.js
Returns score
uuid: 01-10
uuid: 11-20
uuid: 20-30
uuid: 4

SYSTEM ARCHITECTURE
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30
Feedback via REST
uuid: 4

SYSTEM ARCHITECTURE
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30
Feedback via REST
Model updated
in realtime
uuid: 4

SYSTEM ARCHITECTURE
master
workerworker
worker
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30

SYSTEM ARCHITECTURE
master
workerworker
worker
Batch train RPC
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30

SYSTEM ARCHITECTURE
master
workerworker
worker
Batch train RPC
frontend.js
uuid: 01-10
uuid: 11-20
uuid: 20-30
Returns batch
trained model

Mesos Mesos
HDFS, S3, …
Tachyon
HadoopYarn
Spark
Straming Shark
SQL
Graph
X ML
library
BlinkDB MLbase
Spark
Velox
PREDICTION SERVICE
Model
Manager
Prediction
Service

PREDICTION API
GET
/velox/catify/predict?userid=22&song=27632
Simple point queries:

PREDICTION API
GET
/velox/catify/predict_top_k?userid=22&k=100
GET
More complex ordering queries:

PREDICTION API
GET
GET
Low-latency and
scalable partitioning
Personalized
Predictions

PREDICTION API
GET
GET
Low-latency and
scalable partitioning
Personalized
Predictions
Intelligent
Caching
Sharing and re-use of
model partial-state

PREDICTION EXECUTION
def
predict(
u:
UUID,
x:
Context
)
uuid model

def
predict(
u:
UUID,
x:
Context
)
uuid model
Look up user
model
Read

def
predict(
u:
UUID,
x:
Context
)
uuid model
Look up user
model
Primary key lookup
Read

def
predict(
u:
UUID,
x:
Context
)
uuid model
Look up user
model
Primary key lookup
Partition queries by user:
always local
Read

Compute
Features
def
predict(
u:
UUID,
x:
Context
)
user independent
}f( )

Compute
Features
def
predict(
u:
UUID,
x:
Context
)
Feature computation  
could be costly
user independent
}f( )

Compute
Features
def
predict(
u:
UUID,
x:
Context
)
Feature computation  
could be costly
user independent
}
Cache features for
reuse across users
f( )

TOP-K QUERIES
Query predicate to pre-ﬁlter candidate set
All Songs

TOP-K QUERIES
All Songs Playlist Keywords

TOP-K QUERIES
Candidate
Songs

TOP-K QUERIES
Candidate
Songs
Score and
rank all
candidates

TOP-K QUERIES
Candidate
Songs
By exploiting split model design we can leverage:
Score and
rank all
candidates

TOP-K QUERIES
Candidate
Songs
Score and
rank all
candidates
A. Shrivastava, P. Li. “Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product
Search (MIPS).” NIPS’14 Best Paper

TOP-K QUERIES
Candidate
Songs
Score and
rank all
candidates
A. Shrivastava, P. Li. “Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product
Search (MIPS).” NIPS’14 Best Paper
Y. Low and A. X. Zheng. “Fast Top-K Similarity Queries Via Matrix Compression.” CIKM 2012

SYSTEM ARCHITECTURE
frontend.js
Returns score
uuid: 01-10
uuid: 11-20
uuid: 20-30
uuid: 4

Mesos Mesos
HDFS, S3, …
Tachyon
HadoopYarn
Spark
Straming Shark
SQL
Graph
X ML
library
BlinkDB MLbase
Spark
Velox
Model
Manager
Prediction
Service
MODEL MANAGER

PERSONALIZED MODELING
A Separate Model for Each User?

Computationally Inefﬁcient
many complex models

Statistically Inefﬁcient
not enough data per user
Computationally Inefﬁcient
many complex models

PERSONALIZED SPLIT MODEL
Input
(Song)

Input
(Song)
Shared Basis Feature Model

Input
(Song)
Big Data

Input
(Song)
Big Data
Changes Slowly

Input
(Song)
Big Data
Changes Slowly
Train in Batch!

Input
(Song)
Big Data
Changes Slowly
Train in Batch!
Personalized
User Model

Input
(Song)
Big Data
Changes Slowly
Train in Batch!
Small Data
Personalized
User Model

Input
(Song)
Big Data
Changes Slowly
Train in Batch!
Small Data
Changes Quickly
Personalized
User Model

Input
(Song)
Big Data
Changes Slowly
Train in Batch!
Small Data
Changes Quickly
Train Online!
Personalized
User Model

Input
(Song)
Personalized
User Model

Input
(Song)
Personalized
User Model
Input
(Song)

Input
(Song)
Personalized
User Model
Meow
Input
(Song)

Personalized
User Model
Meow
Input
(Song)
Input
(Song)

Personalized
User Model
Meow
Terrible
Input
(Song)
Input
(Song)

MATHEMATICAL FORMULATION
Input
(Song)

Input
(Song)
x

Shared Basis
Feature Models
Changes
slowly
Input
(Song)
x

Shared Basis
Feature Models
Changes
slowly
Input
(Song)
f(x; ✓)
x

Shared Basis
Feature Models
Personalized
User Model
Changes
slowly
Highly
dynamic
Input
(Song)
f(x; ✓)
x

Shared Basis
Feature Models
Personalized
User Model
Changes
slowly
Highly
dynamic
Input
(Song)
f(x; ✓) · wu
x

Shared Basis
Feature Models
Personalized
User Model
Changes
slowly
Highly
dynamic
= Rating
Input
(Song)
f(x; ✓) · wu
x
Meow

FEEDBACK API
POST
/velox/catify/observe?userid=22&song=27&score=3.7
Simple direct value feedback:

FEEDBACK API
POST
Continuously update  
user models inVelox
Online Learning

FEEDBACK API
POST
user models inVelox
Online Learning Ofﬂine Learning
Logged toTachyon for
feature learning in Spark

FEEDBACK API
POST
user models inVelox
Online Learning Ofﬂine Learning
Logged toTachyon for
feature learning in Spark
Evaluation
Continuously assess 
model performance

ONLINE LEARNING
velox.jar
user model
def
observe(u:
UUID,
x:
Context,
y:
Score)

ONLINE LEARNING
velox.jar
user model
def
observe(u:
UUID,
x:
Context,
y:
Score)
Update user
model with new
training data
Write

ONLINE LEARNING
velox.jar
user model
def
observe(u:
UUID,
x:
Context,
y:
Score)
Stochastic gradient descent
Update user
model with new
training data
Write

ONLINE LEARNING
velox.jar
user model
def
observe(u:
UUID,
x:
Context,
y:
Score)
Stochastic gradient descent
Incremental linear algebra
Update user
model with new
training data
Write

OFFLINE OR NEARLINE
LEARNING
def
retrain(trainingData:
RDD)
Spark Based
Training Algs.wu · f(x; ✓)
Automated retraining policies
Efﬁcient batch training using Spark
Incremental learning using Spark Streaming

Data Model
Sample Bias: model affects the training data.

ALWAYS SERVETHE BEST SONG?
Songs
Predicted
Rating

VELOX SOLUTION
Predicted
Rating
Songs
With prob. 1- ϵ serve the best predicted song

Predicted
Rating
Songs
With prob. ϵ pick a random song
VELOX SOLUTION

Predicted
Rating
Songs
Epsilon Greedy
VELOX SOLUTION

Predicted
Rating
Songs
Epsilon Greedy
Active Learning
Opportunity to explore new systems for
this emerging analytics workload
VELOX SOLUTION

1. Spam and anomaly detection
BEYOND RECOMMENDER
SYSTEMS

2. Device/location speciﬁc modeling
BEYOND RECOMMENDER
SYSTEMS

2. Device/location speciﬁc modeling
3. YOUR machine learning application
BEYOND RECOMMENDER
SYSTEMS

Today: model training and serving relies on ad-hoc,
manual processes spread across multiple systems
SUMMARY

TheVelox system automatically maintains multiple
models while providing low latency, fresh, and
personalized predictions
SUMMARY

Velox will be open-source: coming soon to BDAS
SUMMARY

https://amplab.cs.berkeley.edu/projects/velox/
SUMMARY

https://amplab.cs.berkeley.edu/projects/velox/
crankshaw@cs.berkeley.edu
SUMMARY

VELOX: MODELS IN ACTION

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (16)

Similar to VELOX: MODELS IN ACTION

Similar to VELOX: MODELS IN ACTION (20)

Recently uploaded

Recently uploaded (20)

VELOX: MODELS IN ACTION