apache spark spark hadoop data analytics big data big data analytics scala data science mapreduce data mining machine learning generating physical plan catalyst optimizer plan optimization & execution rdd recap comparison with pig and hive pipeline dataframes operations architecture of spark sql extensions data cleansing dataframes spark sql library diagram for logical plan container definition of a dataframes api code generation catalyst analyzer dataframes features big data university streaming streaming applications twitter opensource spark streaming fault tolerance architecture apache spark introduction resilient distributed dataset rdd basics rdd deep dive rdd
See more