- Presentations
- Documents
- Infographics
The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro
Databricks
•
2 years ago
Accelerate Your Apache Spark with Intel Optane DC Persistent Memory
Databricks
•
5 years ago
Big data architectures and the data lake
James Serra
•
7 years ago
Understanding Memory Management In Spark For Fun And Profit
Spark Summit
•
7 years ago
Seamless replication and disaster recovery for Apache Hive Warehouse
DataWorks Summit
•
5 years ago
Sharing metadata across the data lake and streams
DataWorks Summit
•
5 years ago
An Overview on Optimization in Apache Hive: Past, Present, Future
DataWorks Summit
•
6 years ago
Apache Hadoop YARN: state of the union
DataWorks Summit
•
5 years ago
A guide on Aws Security Token Service
Blazeclan Technologies Private Limited
•
10 years ago
Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das
Databricks
•
7 years ago
TensorFrames: Google Tensorflow on Apache Spark
Databricks
•
7 years ago
Enhancing Spark SQL Optimizer with Reliable Statistics
Jen Aman
•
7 years ago
Deep Dive Into Catalyst: Apache Spark 2.0’s Optimizer
Databricks
•
7 years ago
Apache Spark MLlib 2.0 Preview: Data Science and Production
Databricks
•
7 years ago
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
Amazon Web Services
•
8 years ago
Graphene – Microsoft SCOPE on Tez
DataWorks Summit
•
5 years ago
Accelerating query processing
DataWorks Summit
•
5 years ago
Presto query optimizer: pursuit of performance
DataWorks Summit
•
5 years ago
Using LLVM to accelerate processing of data in Apache Arrow
DataWorks Summit
•
5 years ago