Personal Information
Organization / Workplace
San Francisco Bay Area United States
Occupation
Infrastructure Stuff @ Uber
Industry
Technology / Software / Internet
Website
http://byte-array.blogspot.com
About
Experienced Staff Software Engineer with a demonstrated history of building large scale distributed & data systems.
Tags
big data
analytics
data
spark
voldemort
hadoop
apachepulsar
apachehudi
cdc
uber
nosql
keyvalue
distributed systems
database
wbdb2012
2012
wbdb
See more
Presentations
(7)Documents
(3)Likes
(22)Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia
Databricks
•
6 years ago
File Format Benchmarks - Avro, JSON, ORC, & Parquet
Owen O'Malley
•
7 years ago
The Columnar Era: Leveraging Parquet, Arrow and Kudu for High-Performance Analytics
DataWorks Summit/Hadoop Summit
•
7 years ago
Engineering fast indexes (Deepdive)
Daniel Lemire
•
7 years ago
Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das
Databricks
•
7 years ago
Lessons from Running Large Scale Spark Workloads
Databricks
•
9 years ago
Bringing OLTP woth OLAP: Lumos on Hadoop
DataWorks Summit
•
9 years ago
The Future of Real-Time in Spark
Databricks
•
8 years ago
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
9 years ago
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
spark-project
•
10 years ago
Efficient Data Storage for Analytics with Apache Parquet 2.0
Cloudera, Inc.
•
9 years ago
Apache storm vs. Spark Streaming
P. Taylor Goetz
•
9 years ago
Mysteries of the binary log
Mats Kindahl
•
13 years ago
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
Mats Kindahl
•
10 years ago
Kafka & Hadoop - for NYC Kafka Meetup
Gwen (Chen) Shapira
•
9 years ago
Introduction to docker
Justyna Ilczuk
•
9 years ago
How To Analyze Geolocation Data with Hive and Hadoop
Hortonworks
•
10 years ago
Gc and-pagescan-attacks-by-linux
Cuong Tran
•
10 years ago
High-Performance Storage Services with HailDB and Java
sunnygleason
•
13 years ago
HailDB: A NoSQL API Direct to InnoDB
stewartsmith
•
13 years ago
InnoDB Magic
sunnygleason
•
13 years ago
High performance network programming on the jvm oscon 2012
Erik Onnen
•
11 years ago
Personal Information
Organization / Workplace
San Francisco Bay Area United States
Occupation
Infrastructure Stuff @ Uber
Industry
Technology / Software / Internet
Website
http://byte-array.blogspot.com
About
Experienced Staff Software Engineer with a demonstrated history of building large scale distributed & data systems.
Tags
big data
analytics
data
spark
voldemort
hadoop
apachepulsar
apachehudi
cdc
uber
nosql
keyvalue
distributed systems
database
wbdb2012
2012
wbdb
See more