Wei Wu

6 Followers

6 Followers 8 Followings

6 Followers 8 Followings

Presentations
Documents
Infographics

Latest Most Popular

The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro

Databricks • 2 years ago

Accelerate Your Apache Spark with Intel Optane DC Persistent Memory

Databricks • 5 years ago

Big data architectures and the data lake

James Serra • 7 years ago

Optimizing Apache Spark Throughput Using Intel Optane and Intel Memory Drive Technology with Ravikanth durgavajhala

Databricks • 5 years ago

Understanding Memory Management In Spark For Fun And Profit

Spark Summit • 7 years ago

Seamless replication and disaster recovery for Apache Hive Warehouse

DataWorks Summit • 5 years ago

Sharing metadata across the data lake and streams

DataWorks Summit • 5 years ago

An Overview on Optimization in Apache Hive: Past, Present, Future

DataWorks Summit • 6 years ago

Apache Hadoop YARN: state of the union

DataWorks Summit • 5 years ago

A guide on Aws Security Token Service

Blazeclan Technologies Private Limited • 10 years ago

Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das

Databricks • 7 years ago

TensorFrames: Google Tensorflow on Apache Spark

Databricks • 7 years ago

Enhancing Spark SQL Optimizer with Reliable Statistics

Jen Aman • 7 years ago

Deep Dive Into Catalyst: Apache Spark 2.0’s Optimizer

Databricks • 7 years ago

Apache Spark MLlib 2.0 Preview: Data Science and Production

Databricks • 7 years ago

(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR

Amazon Web Services • 8 years ago

Graphene – Microsoft SCOPE on Tez

DataWorks Summit • 5 years ago

Accelerating query processing

DataWorks Summit • 5 years ago

Presto query optimizer: pursuit of performance

DataWorks Summit • 5 years ago

Using LLVM to accelerate processing of data in Apache Arrow

DataWorks Summit • 5 years ago