Personal Information
Organization / Workplace
Beijing City, China China
Occupation
Senior Software Engineer at 乐视网
Industry
Technology / Software / Internet
About
Technical Blog(active writer): http://garyelephant.me
github(active developer): https://github.com/garyelephant
stackoverflow(952 reputation, top 13% of 2016): http://stackoverflow.com/users/1145750/gary-gauh
- Presentations
- Documents
- Infographics
Apache Arrow Workshop at VLDB 2019 / BOSS Session
Wes McKinney
•
4 years ago
Exactly once with spark streaming
Quentin Ambard
•
5 years ago
The Volcano/Cascades Optimizer
宇 傅
•
5 years ago
Black friday logs - Scaling Elasticsearch
Sylvain Wallez
•
6 years ago
Data cubes
Mohammed
•
12 years ago
Presto query optimizer: pursuit of performance
DataWorks Summit
•
5 years ago
Spark after Dark by Chris Fregly of Databricks
Data Con LA
•
9 years ago
Deep Dive into Apache Kafka
confluent
•
7 years ago
A Deep Dive into Kafka Controller
confluent
•
5 years ago
Large-Scale Machine Learning with Apache Spark
DB Tsai
•
9 years ago
MLlib and Machine Learning on Spark
Petr Zapletal
•
9 years ago
Spark SQL - 10 Things You Need to Know
Kristian Alexander
•
7 years ago
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
•
8 years ago
Spark sql meetup
Michael Zhang
•
9 years ago
Building a Distributed Message Log from Scratch - SCaLE 16x
Tyler Treat
•
6 years ago
Deep Dive into Spark SQL with Advanced Performance Tuning with Xiao Li & Wenchen Fan
Databricks
•
5 years ago
A Deep Dive into Stateful Stream Processing in Structured Streaming with Tathagata Das
Databricks
•
5 years ago
Top Three Big Data Governance Issues and How Apache ATLAS resolves it for the Enterprise
DataWorks Summit/Hadoop Summit
•
7 years ago
Apache Atlas: Governance for your Data
DataWorks Summit/Hadoop Summit
•
6 years ago
Fifth Elephant Apache Atlas Talk
Vimal Sharma
•
6 years ago