Enterprise Reports – Your cell phone bill is an exampleDashboard – KPI trackingParameterized Reports – What are the hot prospects in my region?Visualization – Visual exploration of dataData Mining – Large scale data processing and extraction usually fed to other tools
Over clause similar to use group by except that with group by you produce a single row for each of your group where with over clause you produce a result for each row in your group. You specify which partition you would like to use and how you would like to order itAnd then you can give it a windows
Sort Merge Bucket ( SMB ) joinIf both tables are: - sorted the same - Bucketed the same - And Joining on the sort/bucket columnEach process: - Reads a bucket from each table - Process the row with the lowest value
Community developed frameworksMachine learning / Analytics (MPI, GraphLab, Giraph, Hama, Spark, …)Services inside Hadoop (memcache, HBase, Storm…)Low latency computing (CEP or stream processing)