Personal Information
Organization / Workplace
San Francisco Bay Area United States
Occupation
Infrastructure Stuff @ Uber
Industry
Technology / Software / Internet
Website
http://byte-array.blogspot.com
About
Experienced Staff Software Engineer with a demonstrated history of building large scale distributed & data systems.
Tags
big data
analytics
data
spark
voldemort
hadoop
apachepulsar
apachehudi
cdc
uber
nosql
keyvalue
distributed systems
database
wbdb2012
2012
wbdb
See more
- Presentations
- Documents
- Infographics
Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia
Databricks
•
6 years ago
File Format Benchmarks - Avro, JSON, ORC, & Parquet
Owen O'Malley
•
7 years ago
The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Analytics
DataWorks Summit/Hadoop Summit
•
7 years ago
Engineering fast indexes (Deepdive)
Daniel Lemire
•
7 years ago
Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das
Databricks
•
7 years ago
Lessons from Running Large Scale Spark Workloads
Databricks
•
9 years ago
Bringing OLTP woth OLAP: Lumos on Hadoop
DataWorks Summit
•
9 years ago
The Future of Real-Time in Spark
Databricks
•
8 years ago
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
9 years ago
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
spark-project
•
10 years ago
Efficient Data Storage for Analytics with Apache Parquet 2.0
Cloudera, Inc.
•
9 years ago
Apache storm vs. Spark Streaming
P. Taylor Goetz
•
9 years ago
Mysteries of the binary log
Mats Kindahl
•
13 years ago
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
Mats Kindahl
•
10 years ago
Kafka & Hadoop - for NYC Kafka Meetup
Gwen (Chen) Shapira
•
9 years ago
Introduction to docker
Justyna Ilczuk
•
9 years ago
How To Analyze Geolocation Data with Hive and Hadoop
Hortonworks
•
10 years ago
Gc and-pagescan-attacks-by-linux
Cuong Tran
•
10 years ago
High-Performance Storage Services with HailDB and Java
sunnygleason
•
12 years ago
HailDB: A NoSQL API Direct to InnoDB
stewartsmith
•
12 years ago