昭伟黄

0 Followers

Presentations
Documents
Infographics

Latest Most Popular

Photon Technical Deep Dive: How to Think Vectorized

Databricks • 3 years ago

What is New with Apache Spark Performance Monitoring in Spark 3.0

Databricks • 3 years ago

Project Zen: Improving Apache Spark for Python Users

Databricks • 3 years ago

Skew Mitigation For Facebook PetabyteScale Joins

Databricks • 3 years ago

Spark NLP: State of the Art Natural Language Processing at Scale

Databricks • 3 years ago

Materialized Column: An Efficient Way to Optimize Queries on Nested Columns

Databricks • 3 years ago

Building a SIMD Supported Vectorized Native Engine for Spark SQL

Databricks • 3 years ago

Frequently Bought Together Recommendations Based on Embeddings

Databricks • 3 years ago

Optimising Geospatial Queries with Dynamic File Pruning

Databricks • 3 years ago

How The Weather Company Uses Apache Spark to Serve Weather Data Fast at Low Cost

Databricks • 3 years ago

Spark SQL Join Improvement at Facebook

Databricks • 3 years ago

Deep Dive into the New Features of Apache Spark 3.0

Databricks • 3 years ago

ClickHouse Query Performance Tips and Tricks, by Robert Hodges, Altinity CEO

Altinity Ltd • 4 years ago

ClickHouse Data Warehouse 101: The First Billion Rows, by Alexander Zaitsev and Robert Hodges, Altinity

Altinity Ltd • 4 years ago

Scalable Acceleration of XGBoost Training on Apache Spark GPU Clusters

Databricks • 3 years ago

On Improving Broadcast Joins in Apache Spark SQL

Databricks • 3 years ago

Adaptive Query Execution: Speeding Up Spark SQL at Runtime

Databricks • 3 years ago

Presto on Apache Spark: A Tale of Two Computation Engines

Databricks • 3 years ago

Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service

Databricks • 3 years ago

Flash for Apache Spark Shuffle with Cosco

Databricks • 3 years ago