- Presentations
- Documents
- Infographics
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Rosen, Databricks)
Spark Summit
•
8 years ago
Coral & Transport UDFs: Building Blocks of a Postmodern Data Warehouse
Walaa Eldin Moustafa
•
4 years ago
Cost-based Query Optimization in Apache Phoenix using Apache Calcite
Julian Hyde
•
7 years ago
The Volcano/Cascades Optimizer
宇 傅
•
5 years ago
Data profiling with Apache Calcite
Julian Hyde
•
6 years ago
Accelerating query processing with materialized views in Apache Hive
DataWorks Summit
•
6 years ago
ORC File - Optimizing Your Big Data
DataWorks Summit
•
6 years ago
ORC 2015: Faster, Better, Smaller
DataWorks Summit
•
8 years ago
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
6 years ago
Hive Bucketing in Apache Spark with Tejas Patil
Databricks
•
6 years ago
An Adaptive Execution Engine for Apache Spark with Carson Wang and Yucai Yu
Databricks
•
6 years ago
Spark 2.x Troubleshooting Guide
IBM
•
8 years ago
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Databricks
•
5 years ago
Parquet performance tuning: the missing guide
Ryan Blue
•
7 years ago
Efficient Data Storage for Analytics with Parquet 2.0 - Hadoop Summit 2014
Julien Le Dem
•
9 years ago
Data Source API in Spark
Databricks
•
9 years ago
Why you should care about data layout in the file system with Cheng Lian and Vida Ha
Databricks
•
6 years ago
Deep Dive: Memory Management in Apache Spark
Databricks
•
7 years ago
Apache Spark in Depth: Core Concepts, Architecture & Internals
Anton Kirillov
•
8 years ago