- Presentations
- Documents
- Infographics
Oscon keynote: Working hard to keep it simple
Martin Odersky
•
12 years ago
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Parquet
Dremio Corporation
•
6 years ago
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
6 years ago
Migrating Netflix from Datacenter Oracle to Global Cassandra
Adrian Cockcroft
•
12 years ago
Hadoop Internals
Pietro Michiardi
•
8 years ago
Introduction to Spark Internals
Pietro Michiardi
•
8 years ago
MPP vs Hadoop
Alexey Grishchenko
•
8 years ago
Modern Data Architecture
Alexey Grishchenko
•
8 years ago
Apache Spark Architecture
Alexey Grishchenko
•
8 years ago
Optimizing Hive Queries
DataWorks Summit
•
11 years ago
Apache Hive - Introduction
Muralidharan Deenathayalan
•
9 years ago
RHive tutorials - Basic functions
Aiden Seonghak Hong
•
12 years ago
Faster Batch Processing with Cloudera 5.7: Hive-on-Spark is ready for production
Cloudera, Inc.
•
8 years ago
Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi
Databricks
•
6 years ago
Apache Kylin Use Cases in China and Japan
Luke Han
•
6 years ago
Apache Hive authorization models
Thejas Nair
•
10 years ago
Making Structured Streaming Ready for Production
Databricks
•
7 years ago
Optimizing Apache Spark SQL Joins
Databricks
•
7 years ago
Extreme Apache Spark: how in 3 months we created a pipeline that can process 2.5 billion rows a day
Josef A. Habdank
•
8 years ago
Spark 2.x Troubleshooting Guide
IBM
•
8 years ago