- Presentations
- Documents
- Infographics
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and cons
Jose Quesada (hiring)
•
7 years ago
Strata NY 2017 Parquet Arrow roadmap
Julien Le Dem
•
6 years ago
Advanced Apache Spark Meetup Spark SQL + DataFrames + Catalyst Optimizer + Data Sources API
Chris Fregly
•
8 years ago
Parquet performance tuning: the missing guide
Ryan Blue
•
7 years ago
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets with Jules Damji
Databricks
•
6 years ago
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
9 years ago
Spark 2.x Troubleshooting Guide
IBM
•
8 years ago
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Cloudera, Inc.
•
8 years ago
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Databricks
•
6 years ago