Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development"

•Download as PPTX, PDF•

1 like•537 views

Zsolt Várnai, Principal Software Engineer at Skyscanner, presented "The advantages of real-time monitoring in apps development" as part of the Big Data, Budapest v 3.0 meetup organised on the 19th of May 2016 at Skyscanner's headquarters.

Technology

The advantages of real-time
monitoring in app development
Zsolt Varnai
Principal Software Engineer
zsolt.varnai@skyscanner.net

01
02
03
04
05
06
The past
Issues
Possible solutions
The present
The future
Examples

• Monthly or less frequent releases
• Big release test cycle and bugfixing period
• Analytics data is used occasionally
• Minimal information about what is happening in the production app
The past (1-2 years ago)

• If it worked before release then it will work later as well
=> NOT TRUE
• What can change
• OS update
• New devices
• Server side behavior (any 3rd party tool + internal servers)
• Higher diversity, lots of use cases on real devices
The past (1-2 years ago)

• Bi-weekly release trains
• Feature flag controls feature visibility
• Checking GA and MP data through API on daily basis (with
daily summary)
• React on issues in 1-2 days
• Monitoring app reviews regularly (slack channel feed)
The past (6 months ago)

• What could possibly go wrong?
• Failing network requests
• Looks OK on the server side, remains unnoticed
• Client fails when tries to process it
• 3rd party tools causing crashes
• There is no failure, but the app doesn’t show the relevant content
• Invalid state causing permant crash/error loops
Issues

• Collect, process and monitor as much data as possible from
various sources
• Analytics data (conversion metrics and other metrics for core
functionality)
• Monitor store reviews (manual, but a good source of direct
information)
• Low level application logs, visible and silent errors/warnings
Solutions

• Deep instrumentation throughout the application code
• Stream based real-time metrics from production apps (Kafka)
• Aggregating relevant metrics from the event stream (openTSDB=time series database)
• Alerting on metrics (Bosun)
• Incident management system (VictorOps)
• Dashboards (Grafana)
• Drilling down on detailed events in case of an incident (Elasticsearch)
• Good chance of fixing big issues remotely before new release (feature flag coverage)
Today

• Smarter alerting capabilities
• General error/crash rates are misleading
• Ability to alert on big changes within a specific dimension (app version,
running experiments, different error types/services)
• Proper green flag system to alert relevant people without a dedicated
squad to supervise (“You build it you run it” model)
• Automated staged rollout progression based on real time metrics
• Automated review analysis
Future

Thank you!
Questions?
zsolt.varnai@skyscanner.net
@CodeVoyagers
http://codevoyagers.com/

What's hot

Turning an idea into a Data-Driven Production System: An Energy Load Forecas...Big Data Spain

Brokering Data: Accelerating Data Evaluation with Databricks White LabelDatabricks

Data Care, Feeding, and MaintenanceMercedes Coyle

How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...Databricks

Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Dataconomy Media

Building data "Py-pelines"Rob Winters

FordDataWorks Summit/Hadoop Summit

Building A Product Assortment Recommendation EngineDatabricks

Moving from BI to AI : For decision makerszekeLabs Technologies

Misusing MLflow To Help Deduplicate Data At ScaleDatabricks

ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML MeetupRomain Yon

Fineo Technical Overview - NextSQL for IoTJesse Yates

Dsc 2021 presentation_radovan_bacovicRadovan Baćović

Deliver Trusted Data by Leveraging ETL TestingCognizant

Wizard Driven AI Anomaly Detection with Databricks in AzureDatabricks

StreamSet ETL toolSwapnilSHampi

Cómo transformar los datos en análisis con los que tomar decisionesElasticsearch

Real time analytics @ netflixCody Rioux

Stream processing for the practitioner: Blueprints for common stream processi...Aljoscha Krettek

Better Customer Experience with Data Science - Bernard Burg, ComcastSri Ambati

What's hot (20)

Turning an idea into a Data-Driven Production System: An Energy Load Forecas...

Brokering Data: Accelerating Data Evaluation with Databricks White Label

Data Care, Feeding, and Maintenance

How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...

Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...

Building data "Py-pelines"

Ford

Building A Product Assortment Recommendation Engine

Moving from BI to AI : For decision makers

Misusing MLflow To Help Deduplicate Data At Scale

ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup

Fineo Technical Overview - NextSQL for IoT

Dsc 2021 presentation_radovan_bacovic

Deliver Trusted Data by Leveraging ETL Testing

Wizard Driven AI Anomaly Detection with Databricks in Azure

StreamSet ETL tool

Cómo transformar los datos en análisis con los que tomar decisiones

Real time analytics @ netflix

Stream processing for the practitioner: Blueprints for common stream processi...

Better Customer Experience with Data Science - Bernard Burg, Comcast

Viewers also liked

Agile @Skyscanner : From Theory to PracticeChris Downey

BPM & Enterprise Middleware - DatasheetXpand IT

O Diferencial de uma Estratégia Mobile...e Multiplataforma!Xpand IT

GIT Best Practices V 0.1Vikram Singh Chandel

MongoDB and AWS Best PracticesMongoDB

Grow Customer Retention with Predictive Marketing and User-Generated ContentWhatConts

Migrating to gitXpand IT

MgidigitalglobalizationVera Kovaleva

Secret Life of a Weather Datum end of project eventlifeofdata

Revving Up Revenue By ReplenishingWhatConts

Science Communication 2.0: changing University attitude through Science resea...Miquel Duran

Part 1rvb1019

MongoDB at Flight Centre LtdMongoDB

Strongly Typed Languages and Flexible SchemasNorberto Leite

Special projectAnton Gorbachev

MongoDB at ex.fmMongoDB

NOSQL Session GlueCon May 2010MongoDB

R Statistics With MongoDBMongoDB

Microsoft xamarin-experienceXpand IT

Review: Leadership FrameworksMariam Nazarudin

Viewers also liked (20)

Agile @Skyscanner : From Theory to Practice

BPM & Enterprise Middleware - Datasheet

O Diferencial de uma Estratégia Mobile...e Multiplataforma!

GIT Best Practices V 0.1

MongoDB and AWS Best Practices

Grow Customer Retention with Predictive Marketing and User-Generated Content

Migrating to git

Mgidigitalglobalization

Secret Life of a Weather Datum end of project event

Revving Up Revenue By Replenishing

Science Communication 2.0: changing University attitude through Science resea...

Part 1

MongoDB at Flight Centre Ltd

Strongly Typed Languages and Flexible Schemas

Special project

MongoDB at ex.fm

NOSQL Session GlueCon May 2010

R Statistics With MongoDB

Microsoft xamarin-experience

Review: Leadership Frameworks

Similar to Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development"

Global Azure Bootcamp 2017 - Performance and Health Management for Modern App...Adin Ermie

ATAGTR2017 Unified APM: The new age performance monitoring for production sys...Agile Testing Alliance

Building an Open Source AppSec Pipeline - 2015 Texas Linux FestMatt Tesauro

How to improve your system monitoringAndrew White

Performance monitoring in a DevOps WorldSolidify

Process and Project Metrics-1Saqib Raza

Unified Monitoring Webinar with Dustin WhittleAppDynamics

Production Monitoring PlatformAriel Smoliar

Serena Business Manager Visualizing 2016Serena Software

Monitoring microservice applications: An SRE’s perspectiveDevOpsProdigy

What is Platform Observability? An OverviewKumar Kolaganti

The IBM dashboard for operational metricsPlatform CF

Transpara Visual KPI Overview - May 2019Transpara

Azure Monitoring Overviewgjuljo

StatsCraft 2015: Introduction to monitoring - Yoav Abrahami and Mark SonisStatsCraft

Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16AppDynamics

The differing ways to monitor and instrumentJonah Kowall

Softwaretestingtoolsanditstaxonomy 131204003332-phpapp01Aravindharamanan S

Wikibon #IoT #HyperConvergence Presentation via @theCUBE John Furrier

Hyper-Convergence CrowdChatWikibon Community

Similar to Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development" (20)

Global Azure Bootcamp 2017 - Performance and Health Management for Modern App...

ATAGTR2017 Unified APM: The new age performance monitoring for production sys...

Building an Open Source AppSec Pipeline - 2015 Texas Linux Fest

How to improve your system monitoring

Performance monitoring in a DevOps World

Process and Project Metrics-1

Unified Monitoring Webinar with Dustin Whittle

Production Monitoring Platform

Serena Business Manager Visualizing 2016

Monitoring microservice applications: An SRE’s perspective

What is Platform Observability? An Overview

The IBM dashboard for operational metrics

Transpara Visual KPI Overview - May 2019

Azure Monitoring Overview

StatsCraft 2015: Introduction to monitoring - Yoav Abrahami and Mark Sonis

Monitoring and Instrumentation Strategies: Tips and Best Practices - AppSphere16

The differing ways to monitor and instrument

Softwaretestingtoolsanditstaxonomy 131204003332-phpapp01

Wikibon #IoT #HyperConvergence Presentation via @theCUBE

Hyper-Convergence CrowdChat

Recently uploaded

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Manulife - Insurer Innovation Award 2024The Digital Insurer

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Real Time Object Detection Using Open CVKhem

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

GenAI Risks & Security Meetup 01052024.pdflior mazor

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Recently uploaded (20)

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Manulife - Insurer Innovation Award 2024

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

AWS Community Day CPH - Three problems of Terraform

presentation ICT roal in 21st century education

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Boost Fertility New Invention Ups Success Rates.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

Apidays New York 2024 - The value of a flexible API Management solution for O...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Real Time Object Detection Using Open CV

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

GenAI Risks & Security Meetup 01052024.pdf

Tata AIG General Insurance Company - Insurer Innovation Award 2024

HTML Injection Attacks: Impact and Mitigation Strategies

Artificial Intelligence Chap.5 : Uncertainty

Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development"

1. The advantages of real-time monitoring in app development Zsolt Varnai Principal Software Engineer zsolt.varnai@skyscanner.net

2. 01 02 03 04 05 06 The past Issues Possible solutions The present The future Examples

3. • Monthly or less frequent releases • Big release test cycle and bugfixing period • Analytics data is used occasionally • Minimal information about what is happening in the production app The past (1-2 years ago)

4. • If it worked before release then it will work later as well => NOT TRUE • What can change • OS update • New devices • Server side behavior (any 3rd party tool + internal servers) • Higher diversity, lots of use cases on real devices The past (1-2 years ago)

5. • Bi-weekly release trains • Feature flag controls feature visibility • Checking GA and MP data through API on daily basis (with daily summary) • React on issues in 1-2 days • Monitoring app reviews regularly (slack channel feed) The past (6 months ago)

6. • What could possibly go wrong? • Failing network requests • Looks OK on the server side, remains unnoticed • Client fails when tries to process it • 3rd party tools causing crashes • There is no failure, but the app doesn’t show the relevant content • Invalid state causing permant crash/error loops Issues

7. • Collect, process and monitor as much data as possible from various sources • Analytics data (conversion metrics and other metrics for core functionality) • Monitor store reviews (manual, but a good source of direct information) • Low level application logs, visible and silent errors/warnings Solutions

8. • Deep instrumentation throughout the application code • Stream based real-time metrics from production apps (Kafka) • Aggregating relevant metrics from the event stream (openTSDB=time series database) • Alerting on metrics (Bosun) • Incident management system (VictorOps) • Dashboards (Grafana) • Drilling down on detailed events in case of an incident (Elasticsearch) • Good chance of fixing big issues remotely before new release (feature flag coverage) Today

10.

11.

12. • Smarter alerting capabilities • General error/crash rates are misleading • Ability to alert on big changes within a specific dimension (app version, running experiments, different error types/services) • Proper green flag system to alert relevant people without a dedicated squad to supervise (“You build it you run it” model) • Automated staged rollout progression based on real time metrics • Automated review analysis Future

13. Thank you! Questions? zsolt.varnai@skyscanner.net @CodeVoyagers http://codevoyagers.com/

Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development"

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development"

Similar to Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development" (20)

More from Dataconomy Media

More from Dataconomy Media (20)

Recently uploaded

Recently uploaded (20)

Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of real-time monitoring in apps development"