Yu Liu

42 Followers

• 8+ years industrial experience in software development including large scale data processing systems with Hadoop/Spark and high volume web services • 8+ years OOD/OOP experience with Java • 2+ years experience in applying machine learning algorithms for recommendation systems and anomaly detection systems • 1+ years contribution to open source community and currently Apache committer • Good ability in software architecture design for high volume data processing Skills • Languages: Java, Python, Scala, C/C++, SQL, UML, Json • Frameworks and DBs: Hadoop, Spark, Spark MLlib, Spark Streaming, Mahout, Cassandra, Hbase, HDFS, Kafka, YARN, Hibernate, Spring, Oracle, MySQL • OS: C

Presentations
Documents
Infographics

Latest Most Popular

Yu Liu

Collaborative Filtering at Spotify

Music Recommendations at Scale with Spark

Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17

Scala Data Pipelines @ Spotify

Four Things to Know About Reliable Spark Streaming with Typesafe and Databricks

A real time architecture using Hadoop and Storm @ FOSDEM 2013

Lessons from Running Large Scale Spark Workloads

G1 Garbage Collector - Big Heaps and Low Pauses?

Spark Internals - Hadoop Source Code Reading #16 in Japan

Cassandra Summit 2014: Performance Tuning Cassandra in AWS

Anomaly Detection - New York Machine Learning

Spotify's Music Recommendations Lambda Architecture

Spark's Role in the Big Data Ecosystem (Spark Summit 2014)