8. 8.3 Full text search (Apache SOLR)
8.4 Analytics functions (sum(baz) OVER (PARTITION BY foo))
8.4 CTE (WITH foo AS select * from bar)
9.5 GROUPING SETS/CUBE/ ROLLUP
9.6 parallel seq scan/aggregate (by design)
9. Fast
Very fast
Open source
Very specific SQL
Yandex ClickHouse
Horrible joins
Cant delete data(*)
Александр Зайцев. «Переезжаем на Yandex ClickHouse»
15. gpfdist — parallel file distribution program (more than 100GB)
s3 external tables (read/write/gzip)
COPY on master node (less than 100GB)
Don’t forget about VACUUM
Data loading
16. Data loading
No JSON type
pl/python + ujson
Don’t use JSON, please
Make columns from json fields (schema)