Slideshare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our User Agreement and Privacy Policy.
Slideshare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our Privacy Policy and User Agreement for details.
Published on
http://flink-forward.org/kb_sessions/evaluating-streaming-framework-performance-for-a-large-scale-aggregation-pipeline/
In this talk I present the results of a set of experiments comparing the performance of several implementations of aggregating time-series data. There are 3 implementations: a baseline implementation not using any streaming frameworks, an implementation using Apache Flink, and an implementation using Apache Spark Streaming*. These implementations all ran against the same Kafka cluster using the same data stream, with the goal to understand the limitations of the different implementations. The limitations were measured at 3 input data rates: 100%, 6000%, and breaking-point load.
Login to see the comments