- Presentations
- Documents
- Infographics
Photon Technical Deep Dive: How to Think Vectorized
Databricks
•
3 years ago
What is New with Apache Spark Performance Monitoring in Spark 3.0
Databricks
•
3 years ago
Project Zen: Improving Apache Spark for Python Users
Databricks
•
3 years ago
Skew Mitigation For Facebook PetabyteScale Joins
Databricks
•
3 years ago
Spark NLP: State of the Art Natural Language Processing at Scale
Databricks
•
3 years ago
Materialized Column: An Efficient Way to Optimize Queries on Nested Columns
Databricks
•
3 years ago
Building a SIMD Supported Vectorized Native Engine for Spark SQL
Databricks
•
3 years ago
Frequently Bought Together Recommendations Based on Embeddings
Databricks
•
3 years ago
Optimising Geospatial Queries with Dynamic File Pruning
Databricks
•
3 years ago
How The Weather Company Uses Apache Spark to Serve Weather Data Fast at Low Cost
Databricks
•
3 years ago
Spark SQL Join Improvement at Facebook
Databricks
•
3 years ago
Deep Dive into the New Features of Apache Spark 3.0
Databricks
•
3 years ago
ClickHouse Query Performance Tips and Tricks, by Robert Hodges, Altinity CEO
Altinity Ltd
•
4 years ago
ClickHouse Data Warehouse 101: The First Billion Rows, by Alexander Zaitsev and Robert Hodges, Altinity
Altinity Ltd
•
4 years ago
Scalable Acceleration of XGBoost Training on Apache Spark GPU Clusters
Databricks
•
3 years ago
On Improving Broadcast Joins in Apache Spark SQL
Databricks
•
3 years ago
Adaptive Query Execution: Speeding Up Spark SQL at Runtime
Databricks
•
3 years ago
Presto on Apache Spark: A Tale of Two Computation Engines
Databricks
•
3 years ago
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Databricks
•
3 years ago
Flash for Apache Spark Shuffle with Cosco
Databricks
•
3 years ago