Personal Information
Organization / Workplace
Greater Seattle Area United States
Occupation
Member of Technical Staff
Industry
Technology / Software / Internet
About
• 8+ years industrial experience in software development including large scale data processing systems with Hadoop/Spark and high volume web services
• 8+ years OOD/OOP experience with Java
• 2+ years experience in applying machine learning algorithms for recommendation systems and anomaly detection systems
• 1+ years contribution to open source community and currently Apache committer
• Good ability in software architecture design for high volume data processing
Skills
• Languages: Java, Python, Scala, C/C++, SQL, UML, Json
• Frameworks and DBs: Hadoop, Spark, Spark MLlib, Spark Streaming, Mahout, Cassandra, Hbase, HDFS, Kafka, YARN, Hibernate, Spring, Oracle, MySQL
• OS: C
- Presentations
- Documents
- Infographics
Collaborative Filtering at Spotify
Erik Bernhardsson
•
11 years ago
Music Recommendations at Scale with Spark
Chris Johnson
•
9 years ago
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
spark-project
•
10 years ago
Scala Data Pipelines @ Spotify
Neville Li
•
8 years ago
Four Things to Know About Reliable Spark Streaming with Typesafe and Databricks
Legacy Typesafe (now Lightbend)
•
8 years ago
A real time architecture using Hadoop and Storm @ FOSDEM 2013
Nathan Bijnens
•
11 years ago
Lessons from Running Large Scale Spark Workloads
Databricks
•
9 years ago
G1 Garbage Collector - Big Heaps and Low Pauses?
C2B2 Consulting
•
11 years ago
Spark Internals - Hadoop Source Code Reading #16 in Japan
Taro L. Saito
•
9 years ago
Cassandra Summit 2014: Performance Tuning Cassandra in AWS
DataStax Academy
•
9 years ago
Anomaly Detection - New York Machine Learning
Ted Dunning
•
9 years ago
Spotify's Music Recommendations Lambda Architecture
Esh Vckay
•
8 years ago
Spark's Role in the Big Data Ecosystem (Spark Summit 2014)
Databricks
•
9 years ago