How Totango uses Apache Spark

•Download as PPTX, PDF•

2 likes•3,027 views

Overview presentation on how totango uses apache spark in our data architecture. Presented at the 'Big-Data Israel Meetup'. Nov. 2015

Software

Enterprise Grade Spark
Processing at Totango
2015-11-10
Oren Raboy, VP Eng. @ Totango

AGENDA (PART 1)
• Background about Totango and our data architecture
• Spark in the Totango Architecture
• Quality: Testing Spark code in production

About us:
Founded: 2010
Offices: TLV, SF
Team: ~60
Customers: ~200

We help online business
make their customers more
successful through use of
data
Totango:

~ 500M accounts
~ $5B revenue under management
~ 100M events per day
Our Customers:
The Worlds Leading Cloud Services

ANALYTICS
• Usage Metrics
• Trends over time
• Trends across
customers
• Health score

• Alerts
• Triggered Workflows
• Email Campaigns
AUTOMATION

Totango Data Architecture
Collection
Real-time processing
Batch processing
Pixel
3rd Party
(SFDC)
CSV
Serving Layer
• ‘Lambda Architecture’
• Hosted on AWS
• AWS and Open-source technologies
• Java with a dash of Python

Totango Data Architecture
Pixel
3rd Party
(SFDC)
CSV
• Hosted on AWS
• ‘Lambda Architecture’
• AWS and Open-source technologies
• Java with a dash of Python
Kinesis
Kinesis
S3
ELB

Batch Processing
• Executed once a day (midnight at customer’s local-time)
• Each task calculates a set of account-metrics (e.g. Health, Change)
• One Spark cluster runs all tasks for all customers
• Pipeline executed by Pipeline Runner, using Spotify Luigi
calc
some
metrics
calc
other
metrics
more
merge
results
Some
dependent
computation
Merge
results
Into final
document
Raw Events Account
Documents

Environment
• Multi tenant: Shared infrastructure for all Totango customers (Services)
• Daily, hourly and on-demand schedule
• Standalone Spark cluster on AWS EC2 instances
• Input and Output on S3. Final results also indexed on Elasticsearch
Service A
calc
some
metrics
calc
other
metrics
more
merge
results
Some
dependent
computation
Merge
results
Into final
document
Raw Events Account
Documents
Service A
calc
some
metrics
calc
other
metrics
more
merge
results
Some
dependent
computation
Merge
results
Into final
document
Raw Events Account
Documents
Service XYZ
calc
some
metrics
calc
other
metrics
more
merge
results
Some
dependent
computation
Merge
results
Into final
document
Raw Events Account
Documents

Requirements from infrastructure:
• Reliability: Calculate metrics accurately at all times
• Velocity: Frequent release of new data processing code
Challenge:
High quality and highly automated regression testing
calc
some
metrics
calc
some
metric
more
merge
results
Some
dependent
computation
Merge
results
Into final
document
Raw Events Account
Documents
NEW
VERSION
How do we make sure the new version didn’t
break anything?

calc
some
metrics
merge
results
Some
dependent
computation
Merge
results
Into final
document
Raw Events Account
Documents
NEW
VERSION
SHADOW
OLD
VERSION
compare csv
Testing In Production: How
• Before deployment, run release-candidate ‘side by side’ older version.
• New version runs in Shadow mode and does not propagate results
• Compare old and new version results. Output unexpected diffs
• Deploy to production only if no diffs across all customer data sets

1.
2.
3.
4.
5.
Unit testing
Test environment: Integration testing
Side by side testing
in production of
new code
New code rolled-out,
old version
side-by-side as backup
Rollout complete!
Deployment
Flow
• We know the new
version works
correctly
• We do not need to
think of all the
corner test-cases
• We do not need to
write lots of
regression tests

QUESTIONS?
• labs.totango.com <-- engineering team blog
• oren@totango.com <-- me!
• Yes, we are hiring!

What's hot

Real time ETL processing using Spark streamingdatamantra

Spark Workflow ManagementRomi Kuntsman

Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...Spark Summit

Structured Streaming Use-Cases at AppleDatabricks

Building Realtime Data Pipelines with Kafka Connect and Spark StreamingJen Aman

How We Optimize Spark SQL Jobs With parallel and sync IODatabricks

Lessons Learned from Managing Thousands of Production Apache Spark Clusters w...Databricks

Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Databricks

The Key to Machine Learning is Prepping the Right Data with Jean Georges Perrin Databricks

Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks

Introduction to spark 2.0datamantra

Spark Magic Building and Deploying a High Scale Product in 4 Monthstsliwowicz

Building a Data Pipeline from Scratch - Joe CrobakHakka Labs

Bullet: A Real Time Data Query EngineDataWorks Summit

Efficient State Management With Spark 2.0 And Scale-Out DatabasesJen Aman

Portable UDFs: Write Once, Run AnywhereDatabricks

Introduction to Streaming Distributed Processing with StormBrandon O'Brien

Big Data visualization with Apache Spark and Zeppelinprajods

Building a unified data pipeline in Apache SparkDataWorks Summit

Apache HBase WorkshopValerii Moisieienko

What's hot (20)

Real time ETL processing using Spark streaming

Spark Workflow Management

Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...

Structured Streaming Use-Cases at Apple

Building Realtime Data Pipelines with Kafka Connect and Spark Streaming

How We Optimize Spark SQL Jobs With parallel and sync IO

Lessons Learned from Managing Thousands of Production Apache Spark Clusters w...

Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...

The Key to Machine Learning is Prepping the Right Data with Jean Georges Perrin

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Introduction to spark 2.0

Spark Magic Building and Deploying a High Scale Product in 4 Months

Building a Data Pipeline from Scratch - Joe Crobak

Bullet: A Real Time Data Query Engine

Efficient State Management With Spark 2.0 And Scale-Out Databases

Portable UDFs: Write Once, Run Anywhere

Introduction to Streaming Distributed Processing with Storm

Big Data visualization with Apache Spark and Zeppelin

Building a unified data pipeline in Apache Spark

Apache HBase Workshop

Viewers also liked

Multi dimension aggregations using spark and dataframesRomi Kuntsman

Standalone Spark Deployment for Stability and PerformanceRomi Kuntsman

Apache Spark Use case for Education IndustryVinayak Agrawal

Cancer Outlier Profile Analysis using Apache SparkMahmoud Parsian

Getting Apache Spark Customers to ProductionCloudera, Inc.

Kodu Game Lab e Project SparkFabrício Catae

Fighting Fraud with Apache SparkMiklos Christine

Record linkage, a real use case with spark ml - Paris Spark meetup Dec 2015Modern Data Stack France

Not Your Father's Database: How to Use Apache Spark Properly in Your Big Data...Databricks

Building a Turbo-fast Data Warehousing Platform with DatabricksDatabricks

Lambda Architectures in PracticeC4Media

Real-Time Log Analysis with Apache Mesos, Kafka and CassandraJoe Stein

Real Time BOM Explosions with Apache Solr and SparkQAware GmbH

Pulsar: Real-time Analytics at Scale with Kafka, Kylin and DruidTony Ng

Voxxed Days Thesaloniki 2016 - Streaming Engines for Big DataVoxxed Days Thessaloniki

Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark Summit

Apache Spark Model Deployment Databricks

How to deploy Apache Spark  to Mesos/DCOSLegacy Typesafe (now Lightbend)

Four Things to Know About Reliable Spark Streaming with Typesafe and DatabricksLegacy Typesafe (now Lightbend)

Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo LeeSpark Summit

Viewers also liked (20)

Multi dimension aggregations using spark and dataframes

Standalone Spark Deployment for Stability and Performance

Apache Spark Use case for Education Industry

Cancer Outlier Profile Analysis using Apache Spark

Getting Apache Spark Customers to Production

Kodu Game Lab e Project Spark

Fighting Fraud with Apache Spark

Record linkage, a real use case with spark ml - Paris Spark meetup Dec 2015

Not Your Father's Database: How to Use Apache Spark Properly in Your Big Data...

Building a Turbo-fast Data Warehousing Platform with Databricks

Lambda Architectures in Practice

Real-Time Log Analysis with Apache Mesos, Kafka and Cassandra

Real Time BOM Explosions with Apache Solr and Spark

Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid

Voxxed Days Thesaloniki 2016 - Streaming Engines for Big Data

Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...

Apache Spark Model Deployment

How to deploy Apache Spark  to Mesos/DCOS

Four Things to Know About Reliable Spark Streaming with Typesafe and Databricks

Data Science lifecycle with Apache Zeppelin and Spark by Moonsoo Lee

Similar to How Totango uses Apache Spark

Building A Product Assortment Recommendation EngineDatabricks

Evolving s3 storyAvi Perez

Accelerate your Cloud Success with Platform ServicesAmazon Web Services

Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at DatabricksDatabricks

A Practical Deep Dive into Observability of Streaming Applications with Kosta...HostedbyConfluent

Azure Functions Real World Examples Yochay Kiriaty

DevOps in the Cloud with Microsoft Azuregjuljo

AWSomeBuilder3-v12-clean.pdfSal Marcus

Spark + AI Summit 2020 イベント概要Paulo Gutierrez

Introduction to AWS Glue Amazon Web Services

BDA311 Introduction to AWS GlueAmazon Web Services

Introduction to AWS Glue: Data Analytics Week at the SF LoftAmazon Web Services

Kafka: Journey from Just Another Software to Being a Critical Part of PayPal ...confluent

Why and How SmartNews uses SaaS?Takumi Sakamoto

Datapolis Guest Expert Presentation: Top 15 SharePoint Server Configuration M...Datapolis

Correlate Log Data with Business Metrics Like a JediTrevor Parsons

Data analytics at a petabyte scale finalOri Reshef

AWS re:Invent 2016: How Fulfillment by Amazon (FBA) and Scopely Improved Resu...Amazon Web Services

AWS re:Invent 2016: Getting Started with Serverless Architectures (CMP211)Amazon Web Services

Auckland SQLSaturday 2018 - Building a Modern Analytics Solution in the cloud...Sergio Zenatti Filho

Similar to How Totango uses Apache Spark (20)

Building A Product Assortment Recommendation Engine

Evolving s3 story

Accelerate your Cloud Success with Platform Services

Lessons from Building Large-Scale, Multi-Cloud, SaaS Software at Databricks

A Practical Deep Dive into Observability of Streaming Applications with Kosta...

Azure Functions Real World Examples

DevOps in the Cloud with Microsoft Azure

AWSomeBuilder3-v12-clean.pdf

Spark + AI Summit 2020 イベント概要

Introduction to AWS Glue

BDA311 Introduction to AWS Glue

Introduction to AWS Glue: Data Analytics Week at the SF Loft

Kafka: Journey from Just Another Software to Being a Critical Part of PayPal ...

Why and How SmartNews uses SaaS?

Datapolis Guest Expert Presentation: Top 15 SharePoint Server Configuration M...

Correlate Log Data with Business Metrics Like a Jedi

Data analytics at a petabyte scale final

AWS re:Invent 2016: How Fulfillment by Amazon (FBA) and Scopely Improved Resu...

AWS re:Invent 2016: Getting Started with Serverless Architectures (CMP211)

Auckland SQLSaturday 2018 - Building a Modern Analytics Solution in the cloud...

Recently uploaded

A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska

PREDICTING RIVER WATER QUALITY ppt presentationvaddepallysandeep122

Recruitment Management Software Benefits (Infographic)Hr365.us smith

Introduction Computer Science - Software Design.pdfFerryKemperman

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.

Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions

What are the key points to focus on before starting to learn ETL Development....kzayra69

英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0

EY_Graph Database Powered SustainabilityNeo4j

Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110

Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa

Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater

Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray

How to submit a standout Adobe Champion ApplicationBradBedford3

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

Best Web Development Agency- Idiosys USA.pdfIdiosysTechnologies1

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea

Recently uploaded (20)

A healthy diet for your Java application Devoxx France.pdf

PREDICTING RIVER WATER QUALITY ppt presentation

Recruitment Management Software Benefits (Infographic)

Introduction Computer Science - Software Design.pdf

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data

Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...

What are the key points to focus on before starting to learn ETL Development....

英国UN学位证,北安普顿大学毕业证书1:1制作

EY_Graph Database Powered Sustainability

Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...

Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service

SpotFlow: Tracking Method Calls and States at Runtime

Ahmed Motair CV April 2024 (Senior SW Developer)

Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...

How to submit a standout Adobe Champion Application

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Best Web Development Agency- Idiosys USA.pdf

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样

How Totango uses Apache Spark

1. Enterprise Grade Spark Processing at Totango 2015-11-10 Oren Raboy, VP Eng. @ Totango

2. AGENDA (PART 1) • Background about Totango and our data architecture • Spark in the Totango Architecture • Quality: Testing Spark code in production

3. About us: Founded: 2010 Offices: TLV, SF Team: ~60 Customers: ~200

4. We help online business make their customers more successful through use of data Totango:

5. ~ 500M accounts ~ $5B revenue under management ~ 100M events per day Our Customers: The Worlds Leading Cloud Services

6. ANALYTICS • Usage Metrics • Trends over time • Trends across customers • Health score

7. • Alerts • Triggered Workflows • Email Campaigns AUTOMATION

8. Totango Data Architecture Collection Real-time processing Batch processing Pixel 3rd Party (SFDC) CSV Serving Layer • ‘Lambda Architecture’ • Hosted on AWS • AWS and Open-source technologies • Java with a dash of Python

9. Totango Data Architecture Pixel 3rd Party (SFDC) CSV • Hosted on AWS • ‘Lambda Architecture’ • AWS and Open-source technologies • Java with a dash of Python Kinesis Kinesis S3 ELB

10. Totango Data Architecture Pixel 3rd Party (SFDC) CSV • Hosted on AWS • ‘Lambda Architecture’ • AWS and Open-source technologies • Java with a dash of Python Kinesis Kinesis S3 ELB

11. Batch Processing • Executed once a day (midnight at customer’s local-time) • Each task calculates a set of account-metrics (e.g. Health, Change) • One Spark cluster runs all tasks for all customers • Pipeline executed by Pipeline Runner, using Spotify Luigi calc some metrics calc other metrics more merge results Some dependent computation Merge results Into final document Raw Events Account Documents

12. Environment • Multi tenant: Shared infrastructure for all Totango customers (Services) • Daily, hourly and on-demand schedule • Standalone Spark cluster on AWS EC2 instances • Input and Output on S3. Final results also indexed on Elasticsearch Service A calc some metrics calc other metrics more merge results Some dependent computation Merge results Into final document Raw Events Account Documents Service A calc some metrics calc other metrics more merge results Some dependent computation Merge results Into final document Raw Events Account Documents Service XYZ calc some metrics calc other metrics more merge results Some dependent computation Merge results Into final document Raw Events Account Documents

13. Challenge: Quality

14. Requirements from infrastructure: • Reliability: Calculate metrics accurately at all times • Velocity: Frequent release of new data processing code Challenge: High quality and highly automated regression testing calc some metrics calc some metric more merge results Some dependent computation Merge results Into final document Raw Events Account Documents NEW VERSION How do we make sure the new version didn’t break anything?

15. calc some metrics merge results Some dependent computation Merge results Into final document Raw Events Account Documents NEW VERSION SHADOW OLD VERSION compare csv Testing In Production: How • Before deployment, run release-candidate ‘side by side’ older version. • New version runs in Shadow mode and does not propagate results • Compare old and new version results. Output unexpected diffs • Deploy to production only if no diffs across all customer data sets

16. 1. 2. 3. 4. 5. Unit testing Test environment: Integration testing Side by side testing in production of new code New code rolled-out, old version side-by-side as backup Rollout complete! Deployment Flow • We know the new version works correctly • We do not need to think of all the corner test-cases • We do not need to write lots of regression tests

17. QUESTIONS? • labs.totango.com <-- engineering team blog • oren@totango.com <-- me! • Yes, we are hiring!

Editor's Notes

Thanks everyone

How Totango uses Apache Spark

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to How Totango uses Apache Spark

Similar to How Totango uses Apache Spark (20)

Recently uploaded

Recently uploaded (20)

How Totango uses Apache Spark

Editor's Notes