1. The document describes Velox, a unified machine learning platform that aims to provide low latency predictions while models are continuously trained.
2. Velox uses a split model approach, where a shared basis feature model is trained in batch while personal user models are trained continuously online to provide personalized recommendations.
3. The system architecture includes a model manager that trains models, a prediction service that serves real-time queries from a frontend, and integrates with Spark for batch training.
59. Mesos Mesos
HDFS, S3, …
Tachyon
HadoopYarn
Spark
Straming Shark
SQL
Graph
X ML
library
BlinkDB MLbase
Spark
Velox
Training Management + Serving
PREDICTION SERVICE
Model
Manager
Prediction
Service
63. PREDICTION API
GET
/velox/catify/predict_top_k?userid=22&k=100
GET
/velox/catify/predict?userid=22&song=27632
Simple point queries:
More complex ordering queries:
Low-latency and
scalable partitioning
Personalized
Predictions
Intelligent
Caching
Sharing and re-use of
model partial-state
68. PREDICTION EXECUTION
def
predict(
u:
UUID,
x:
Context
)
uuid model
Look up user
model
Primary key lookup
Partition queries by user:
always local
Read
75. TOP-K QUERIES
Query predicate to pre-filter candidate set
All Songs Playlist Keywords
Candidate
Songs
Score and
rank all
candidates
76. TOP-K QUERIES
Query predicate to pre-filter candidate set
All Songs Playlist Keywords
Candidate
Songs
By exploiting split model design we can leverage:
Score and
rank all
candidates
77. TOP-K QUERIES
Query predicate to pre-filter candidate set
All Songs Playlist Keywords
Candidate
Songs
By exploiting split model design we can leverage:
Score and
rank all
candidates
A. Shrivastava, P. Li. “Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product
Search (MIPS).” NIPS’14 Best Paper
78. TOP-K QUERIES
Query predicate to pre-filter candidate set
All Songs Playlist Keywords
Candidate
Songs
By exploiting split model design we can leverage:
Score and
rank all
candidates
A. Shrivastava, P. Li. “Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product
Search (MIPS).” NIPS’14 Best Paper
Y. Low and A. X. Zheng. “Fast Top-K Similarity Queries Via Matrix Compression.” CIKM 2012
81. Mesos Mesos
HDFS, S3, …
Tachyon
HadoopYarn
Spark
Straming Shark
SQL
Graph
X ML
library
BlinkDB MLbase
Spark
Velox
Training Management + Serving
Model
Manager
Prediction
Service
MODEL MANAGER
121. ONLINE LEARNING
velox.jar
user model
def
observe(u:
UUID,
x:
Context,
y:
Score)
Stochastic gradient descent
Incremental linear algebra
Update user
model with new
training data
Write
123. OFFLINE OR NEARLINE
LEARNING
def
retrain(trainingData:
RDD)
Spark Based
Training Algs.wu · f(x; ✓)
Automated retraining policies
Efficient batch training using Spark
Incremental learning using Spark Streaming
133. Predicted
Rating
Songs
With prob. 1- ϵ serve the best predicted song
With prob. ϵ pick a random song
Epsilon Greedy
Active Learning
Opportunity to explore new systems for
this emerging analytics workload
VELOX SOLUTION
140. Today: model training and serving relies on ad-hoc,
manual processes spread across multiple systems
SUMMARY
141. Today: model training and serving relies on ad-hoc,
manual processes spread across multiple systems
TheVelox system automatically maintains multiple
models while providing low latency, fresh, and
personalized predictions
SUMMARY
142. Today: model training and serving relies on ad-hoc,
manual processes spread across multiple systems
TheVelox system automatically maintains multiple
models while providing low latency, fresh, and
personalized predictions
Velox will be open-source: coming soon to BDAS
SUMMARY
143. Today: model training and serving relies on ad-hoc,
manual processes spread across multiple systems
TheVelox system automatically maintains multiple
models while providing low latency, fresh, and
personalized predictions
Velox will be open-source: coming soon to BDAS
https://amplab.cs.berkeley.edu/projects/velox/
SUMMARY
144. Today: model training and serving relies on ad-hoc,
manual processes spread across multiple systems
TheVelox system automatically maintains multiple
models while providing low latency, fresh, and
personalized predictions
Velox will be open-source: coming soon to BDAS
https://amplab.cs.berkeley.edu/projects/velox/
crankshaw@cs.berkeley.edu
SUMMARY