SlideShare a Scribd company logo
1 of 54
Angelo Fausti & Frossie Economou
Vera C Rubin Observatory
How InfluxDB is helping us in
our quest to make the deepest,
widest image of the universe
Influx for
hardware
telemetry
Influx for
devops-
type
metrics
Influx for
capturing
scientific
insight
… but how did we get here?
Space
Space
Space is in a
state of flux
• Comets and asteroids
vary in position
• (Super)novae, variable
stars vary in brightness
• Galaxies vary in age
• Dark energy varies in,
uh, spacetime?
maybe?
Subaru HSC colour composite of COSMOS field, NAOJ
How to understand the
changing universe in 5
[not very] easy steps
xkcd
1522
Step 0:
Find Funding
Step 1:
Build a
3200 Megapixel
Camera
LSST Camera
Media: Rubin Observatory
Step 2:
Build a large
but nimble
telescope
Media: Rubin Observatory
<- 8.4 meter continuous
surface primary-tertiary mirror
Step 3:
Haul everything
up a mountain
Media: Rubin Observatory
Yes there’s Internet
No you can’t count on it
Step 4:
Observe the Sky
Relentlessly
for 10 years;
Issue 10M Alerts
Every Night
Media: Rubin Observatory
• “All” sky 2x per week
• 60 seconds to produce
alerts
• 10-year images: 0.5 EB
• Final DB size: 15 PB
Legacy Survey of Space & Time (LSST)
observing cadence simulation
Step 5:
Get People
(also a data centre or three)
Write Software
Wait for 2022
Media: Rubin Observatory
And get yourself a data
centre or three…
All our own code is
💯% open source
github.com/lsst
github.com/lsst-sqre
photo: Wil O’Mullane
← ~ Oct 2019
We’ll hang
out on
#influxdays-
virtual
for more
Q&A
(@frossie
@afausti)
Over to
Angelo
How InfluxDB Helps Vera C. Rubin Observatory
Make the Deepest, Widest Image of the Universe
15
InfluxDays North America
November 2020
Frossie Economou
Technical Manager for Data Management,
Vera C. Rubin Observatory
Angelo Fausti
Software Engineer
Vera C. Rubin Observatory
HSC COSMOS Ultra Deep Field (1.77 deg2) ~ Rubin 10yr depth
Data processing in Astronomy
https://pipelines.lsst.io
17
Data Management team
~70 FTEs (105 members)
18
I - Application Monitoring
Science Requirements and Performance Metrics
19
Rubin Science Requirements
https://ls.st/lpm-17
Example: Astrometric Performance
Better astrometry
Minimum goalDesign goalStretch goal
20
What a metric definition looks like
Verification Framework https://sqr-019.lsst.io
21
What a specification looks like
https://sqr-019.lsst.io
22
23
Problems with our in-house solution
● A relational DB is not optimized for time series data
● Stuck with predefined dashboards and visualizations
● Limited exploratory analysis capabilities
● Our in-house development didn’t scale
● Use time more wisely: adopt an existing solution instead of
(re)inventing our own
24
Time (Years)
Adopting a TSDB, which one?
https://db-engines.com/en/ranking
25+
25
30+
log(Score)
“If it takes more than three days to get it
working it is not the right solution for you.”
Frossie Economou
26
Why InfluxDB?
● It is more than a TSDB, it is an innovative solution
● Open source software and community
● InfluxDB: efficient store for time series + InfluxQL and
Flux language
● Chronograf: postdefined visualizations
● Kapacitor: foster collaborative conversation (Slack)
27
InfluxDB schema design
FieldsTags
Results from the Data Release Production pipeline
● Measurement groups the results of the pipeline
● Timestamp is the time when the pipeline run finishes
● Tags are metadata associated to the pipeline run
● Fields are the metrics measured by the pipeline
Timestamp
28
First the Tags, then the Series
29
filter is the name of the optical filter used
at the telescope at a given time
drp,dataset=HSC,tract=509,filter=g {fields} timestamp
For each combination of tag values, there’s a new series.
A tract identifies a region in the
sky*
(*) https://pipelines.lsst.io/modules/lsst.skymap
Example of a Series
AM1: 6.42357
AM2: 6.48177
AM3: 4.62033
Time (run ID)
{field-set}i
Each point in a series contains the set of metrics measured by
the pipeline run and the results are grouped by the pipeline
name.
30
drp,dataset=HSC,tract=509,filter=g
Tracking application metrics with InfluxDB
https://squash.lsst.codes
31
Notifications going to Slack
32
Why that metric value change?
Make an annotation!
33
“Annotations are more important than
the data itself.”
Frossie Economou
34
II - Engineering and Facilities Database
Real-Time Monitoring of the Observatory Data
35
36
All subsystems of the Observatory coexist in a state of active interplay.
Observatory Data
https://ts-xml.lsst.io
37
● 60+ subsystems
● Total of 1148 DDS topics
○ 350 commands
○ 531 events
○ 267 telemetry topics
● Total throughput ~21GB/h → real-time monitoring
○ ~15TB per month → offline analysis
○ ~1.5PB for the 10yr of operations → trend analysis
The M1M3 mirror cell subsystem
38
M1M3 mirror cell data
39
● 156 force actuators and sensors producing data at 50Hz
● Can we record and analyze the M1M3 data in real-time?
Kafka + InfluxDB architecture
https://sqr-029.lsst.io
40
Stream Reactor
(OSS)
End-to-end latency characterization
Latency = (WriteTimestamp - SndTimestamp)
41
SndTimestamp
WriteTimestamp
Median latency ~60ms writing ~100k ppm
Executing queries
while writing
42
43
Aux Telescope and Weather Station tower
Aux Telescope Camera
Tucson Teststand - Aug 2019
44
Weather Station
Summit - September 2019
45
46
M2 mirror cell functional testing
Summit - March 2020
M2 mirror cell functional testing
Summit - March 2020
47
48
The beginnings of the Telescope control room
Summit - March 2020
49
US Data Facility
Urbana, IL
Project staff access
RP 10yr
TestStand
Tucson, AZ
Summit
Cerro Pachon, Chile
Restricted access
RP ~30 days
TestStand
Chilean Data Facility
La Serena, Chile
<10MB/s
raw stream
A preview of
operations
Data Replication and Aggregation
https://sqr-034.lsst.codes
50
Data Aggregation in Kafka with Faust
https://kafka-aggregator.lsst.io
51
Faust agents compute summary statistics on non-
overlapping windows of N seconds.
Data Reduction factor R~10
What’s next
52
● Migration to InfluxDB 2.0
○ Conversation with InfluxData design team about Annotations in 2.0
○ Flux training for the Observatory Staff
○ Flux Tasks for downsampling and trend analysis
● Rubin Observatory Interim Data Facility on Google Cloud
● Project transition from Construction to Operations is happening
○ New opportunities for using InfluxDB
● Self-monitoring
● Scalability as we load more data, RPs, etc.
Learn more…
53
● Vera C. Rubin Observatory
● Data Processing
● Verification Framework
● Engineering and Facilities Database
● Kafka Aggregator
● Rubin Science Platform
● Rubin Technical Documentation
Thank you!
54

More Related Content

What's hot

What's hot (20)

Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...
 
Why Architecting for Disaster Recovery is Important for Your Time Series Data...
Why Architecting for Disaster Recovery is Important for Your Time Series Data...Why Architecting for Disaster Recovery is Important for Your Time Series Data...
Why Architecting for Disaster Recovery is Important for Your Time Series Data...
 
Container Monitoring Best Practices Using AWS and InfluxData by Gunnar Aasen
Container Monitoring Best Practices Using AWS and InfluxData by Gunnar AasenContainer Monitoring Best Practices Using AWS and InfluxData by Gunnar Aasen
Container Monitoring Best Practices Using AWS and InfluxData by Gunnar Aasen
 
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy FarkasVirtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
 
Optimizing InfluxDB Performance in the Real World by Dean Sheehan, Senior Dir...
Optimizing InfluxDB Performance in the Real World by Dean Sheehan, Senior Dir...Optimizing InfluxDB Performance in the Real World by Dean Sheehan, Senior Dir...
Optimizing InfluxDB Performance in the Real World by Dean Sheehan, Senior Dir...
 
Databricks clusters in autopilot mode
Databricks clusters in autopilot modeDatabricks clusters in autopilot mode
Databricks clusters in autopilot mode
 
OPTIMIZING THE TICK STACK
OPTIMIZING THE TICK STACKOPTIMIZING THE TICK STACK
OPTIMIZING THE TICK STACK
 
Principles in Data Stream Processing | Matthias J Sax, Confluent
Principles in Data Stream Processing | Matthias J Sax, ConfluentPrinciples in Data Stream Processing | Matthias J Sax, Confluent
Principles in Data Stream Processing | Matthias J Sax, Confluent
 
Flink Forward SF 2017: James Malone - Make The Cloud Work For You
Flink Forward SF 2017: James Malone - Make The Cloud Work For YouFlink Forward SF 2017: James Malone - Make The Cloud Work For You
Flink Forward SF 2017: James Malone - Make The Cloud Work For You
 
Processing 70Tb Of Genomics Data With ADAM And Toil
Processing 70Tb Of Genomics Data With ADAM And ToilProcessing 70Tb Of Genomics Data With ADAM And Toil
Processing 70Tb Of Genomics Data With ADAM And Toil
 
FlinkDTW: Time-series Pattern Search at Scale Using Dynamic Time Warping - Ch...
FlinkDTW: Time-series Pattern Search at Scale Using Dynamic Time Warping - Ch...FlinkDTW: Time-series Pattern Search at Scale Using Dynamic Time Warping - Ch...
FlinkDTW: Time-series Pattern Search at Scale Using Dynamic Time Warping - Ch...
 
Espresso: LinkedIn's Distributed Data Serving Platform (Talk)
Espresso: LinkedIn's Distributed Data Serving Platform (Talk)Espresso: LinkedIn's Distributed Data Serving Platform (Talk)
Espresso: LinkedIn's Distributed Data Serving Platform (Talk)
 
Data Engineer’s Lunch #41: PygramETL
Data Engineer’s Lunch #41: PygramETLData Engineer’s Lunch #41: PygramETL
Data Engineer’s Lunch #41: PygramETL
 
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
 
And Then There Are Algorithms
And Then There Are AlgorithmsAnd Then There Are Algorithms
And Then There Are Algorithms
 
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
 
InfluxDB Live Product Training
InfluxDB Live Product TrainingInfluxDB Live Product Training
InfluxDB Live Product Training
 
Intro to open source observability with grafana, prometheus, loki, and tempo(...
Intro to open source observability with grafana, prometheus, loki, and tempo(...Intro to open source observability with grafana, prometheus, loki, and tempo(...
Intro to open source observability with grafana, prometheus, loki, and tempo(...
 
Spark Streaming into context
Spark Streaming into contextSpark Streaming into context
Spark Streaming into context
 
Stateful stream processing with Apache Flink
Stateful stream processing with Apache FlinkStateful stream processing with Apache Flink
Stateful stream processing with Apache Flink
 

Similar to Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB Helps Vera C. Rubin Observatory Make the Deepest, Widest Image of the Universe | InfluxDays Virtual Experience NA 2020

Accelerating Astronomical Discoveries with Apache Spark
Accelerating Astronomical Discoveries with Apache SparkAccelerating Astronomical Discoveries with Apache Spark
Accelerating Astronomical Discoveries with Apache Spark
Databricks
 

Similar to Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB Helps Vera C. Rubin Observatory Make the Deepest, Widest Image of the Universe | InfluxDays Virtual Experience NA 2020 (20)

Burst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud runBurst data retrieval after 50k GPU Cloud run
Burst data retrieval after 50k GPU Cloud run
 
Accelerating Astronomical Discoveries with Apache Spark
Accelerating Astronomical Discoveries with Apache SparkAccelerating Astronomical Discoveries with Apache Spark
Accelerating Astronomical Discoveries with Apache Spark
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Bring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsBring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science Workflows
 
NRP Engagement webinar - Running a 51k GPU multi-cloud burst for MMA with Ic...
 NRP Engagement webinar - Running a 51k GPU multi-cloud burst for MMA with Ic... NRP Engagement webinar - Running a 51k GPU multi-cloud burst for MMA with Ic...
NRP Engagement webinar - Running a 51k GPU multi-cloud burst for MMA with Ic...
 
Near Exascale Computing in the Cloud
Near Exascale Computing in the CloudNear Exascale Computing in the Cloud
Near Exascale Computing in the Cloud
 
HPC Cluster Computing from 64 to 156,000 Cores 
HPC Cluster Computing from 64 to 156,000 Cores HPC Cluster Computing from 64 to 156,000 Cores 
HPC Cluster Computing from 64 to 156,000 Cores 
 
Crash course on data streaming (with examples using Apache Flink)
Crash course on data streaming (with examples using Apache Flink)Crash course on data streaming (with examples using Apache Flink)
Crash course on data streaming (with examples using Apache Flink)
 
19th Session.pptx
19th Session.pptx19th Session.pptx
19th Session.pptx
 
"Building and running the cloud GPU vacuum cleaner"
"Building and running the cloud GPU vacuum cleaner""Building and running the cloud GPU vacuum cleaner"
"Building and running the cloud GPU vacuum cleaner"
 
OSDC 2016 - Chronix - A fast and efficient time series storage based on Apach...
OSDC 2016 - Chronix - A fast and efficient time series storage based on Apach...OSDC 2016 - Chronix - A fast and efficient time series storage based on Apach...
OSDC 2016 - Chronix - A fast and efficient time series storage based on Apach...
 
A Fast and Efficient Time Series Storage Based on Apache Solr
A Fast and Efficient Time Series Storage Based on Apache SolrA Fast and Efficient Time Series Storage Based on Apache Solr
A Fast and Efficient Time Series Storage Based on Apache Solr
 
Chronix: A fast and efficient time series storage based on Apache Solr
Chronix: A fast and efficient time series storage based on Apache SolrChronix: A fast and efficient time series storage based on Apache Solr
Chronix: A fast and efficient time series storage based on Apache Solr
 
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache Spark
 
Solar System Processing with LSST: A Status Update
Solar System Processing with LSST: A Status UpdateSolar System Processing with LSST: A Status Update
Solar System Processing with LSST: A Status Update
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Round Table Introduction: Analytics on 100 TB+ catalogs
Round Table Introduction: Analytics on 100 TB+ catalogsRound Table Introduction: Analytics on 100 TB+ catalogs
Round Table Introduction: Analytics on 100 TB+ catalogs
 
The Matsu Project - Open Source Software for Processing Satellite Imagery Data
The Matsu Project - Open Source Software for Processing Satellite Imagery DataThe Matsu Project - Open Source Software for Processing Satellite Imagery Data
The Matsu Project - Open Source Software for Processing Satellite Imagery Data
 
Chronix Time Series Database - The New Time Series Kid on the Block
Chronix Time Series Database - The New Time Series Kid on the BlockChronix Time Series Database - The New Time Series Kid on the Block
Chronix Time Series Database - The New Time Series Kid on the Block
 
The new time series kid on the block
The new time series kid on the blockThe new time series kid on the block
The new time series kid on the block
 

More from InfluxData

How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
InfluxData
 
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
InfluxData
 
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
InfluxData
 
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
InfluxData
 

More from InfluxData (20)

Announcing InfluxDB Clustered
Announcing InfluxDB ClusteredAnnouncing InfluxDB Clustered
Announcing InfluxDB Clustered
 
Best Practices for Leveraging the Apache Arrow Ecosystem
Best Practices for Leveraging the Apache Arrow EcosystemBest Practices for Leveraging the Apache Arrow Ecosystem
Best Practices for Leveraging the Apache Arrow Ecosystem
 
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
 
Power Your Predictive Analytics with InfluxDB
Power Your Predictive Analytics with InfluxDBPower Your Predictive Analytics with InfluxDB
Power Your Predictive Analytics with InfluxDB
 
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
 
Build an Edge-to-Cloud Solution with the MING Stack
Build an Edge-to-Cloud Solution with the MING StackBuild an Edge-to-Cloud Solution with the MING Stack
Build an Edge-to-Cloud Solution with the MING Stack
 
Meet the Founders: An Open Discussion About Rewriting Using Rust
Meet the Founders: An Open Discussion About Rewriting Using RustMeet the Founders: An Open Discussion About Rewriting Using Rust
Meet the Founders: An Open Discussion About Rewriting Using Rust
 
Introducing InfluxDB Cloud Dedicated
Introducing InfluxDB Cloud DedicatedIntroducing InfluxDB Cloud Dedicated
Introducing InfluxDB Cloud Dedicated
 
Gain Better Observability with OpenTelemetry and InfluxDB
Gain Better Observability with OpenTelemetry and InfluxDB Gain Better Observability with OpenTelemetry and InfluxDB
Gain Better Observability with OpenTelemetry and InfluxDB
 
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
 
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
 
Introducing InfluxDB’s New Time Series Database Storage Engine
Introducing InfluxDB’s New Time Series Database Storage EngineIntroducing InfluxDB’s New Time Series Database Storage Engine
Introducing InfluxDB’s New Time Series Database Storage Engine
 
Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena
 
Understanding InfluxDB’s New Storage Engine
Understanding InfluxDB’s New Storage EngineUnderstanding InfluxDB’s New Storage Engine
Understanding InfluxDB’s New Storage Engine
 
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDBStreamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
 
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
 
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
 
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
 
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
 
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 

Frossie Economou & Angelo Fausti [Vera C. Rubin Observatory] | How InfluxDB Helps Vera C. Rubin Observatory Make the Deepest, Widest Image of the Universe | InfluxDays Virtual Experience NA 2020

  • 1. Angelo Fausti & Frossie Economou Vera C Rubin Observatory How InfluxDB is helping us in our quest to make the deepest, widest image of the universe
  • 2. Influx for hardware telemetry Influx for devops- type metrics Influx for capturing scientific insight … but how did we get here?
  • 5. Space is in a state of flux • Comets and asteroids vary in position • (Super)novae, variable stars vary in brightness • Galaxies vary in age • Dark energy varies in, uh, spacetime? maybe? Subaru HSC colour composite of COSMOS field, NAOJ
  • 6. How to understand the changing universe in 5 [not very] easy steps xkcd 1522
  • 8. Step 1: Build a 3200 Megapixel Camera LSST Camera Media: Rubin Observatory
  • 9. Step 2: Build a large but nimble telescope Media: Rubin Observatory <- 8.4 meter continuous surface primary-tertiary mirror
  • 10. Step 3: Haul everything up a mountain Media: Rubin Observatory Yes there’s Internet No you can’t count on it
  • 11.
  • 12. Step 4: Observe the Sky Relentlessly for 10 years; Issue 10M Alerts Every Night Media: Rubin Observatory • “All” sky 2x per week • 60 seconds to produce alerts • 10-year images: 0.5 EB • Final DB size: 15 PB Legacy Survey of Space & Time (LSST) observing cadence simulation
  • 13. Step 5: Get People (also a data centre or three) Write Software Wait for 2022 Media: Rubin Observatory And get yourself a data centre or three… All our own code is 💯% open source github.com/lsst github.com/lsst-sqre
  • 14. photo: Wil O’Mullane ← ~ Oct 2019 We’ll hang out on #influxdays- virtual for more Q&A (@frossie @afausti) Over to Angelo
  • 15. How InfluxDB Helps Vera C. Rubin Observatory Make the Deepest, Widest Image of the Universe 15 InfluxDays North America November 2020 Frossie Economou Technical Manager for Data Management, Vera C. Rubin Observatory Angelo Fausti Software Engineer Vera C. Rubin Observatory
  • 16. HSC COSMOS Ultra Deep Field (1.77 deg2) ~ Rubin 10yr depth
  • 17. Data processing in Astronomy https://pipelines.lsst.io 17
  • 18. Data Management team ~70 FTEs (105 members) 18
  • 19. I - Application Monitoring Science Requirements and Performance Metrics 19
  • 20. Rubin Science Requirements https://ls.st/lpm-17 Example: Astrometric Performance Better astrometry Minimum goalDesign goalStretch goal 20
  • 21. What a metric definition looks like Verification Framework https://sqr-019.lsst.io 21
  • 22. What a specification looks like https://sqr-019.lsst.io 22
  • 23. 23
  • 24. Problems with our in-house solution ● A relational DB is not optimized for time series data ● Stuck with predefined dashboards and visualizations ● Limited exploratory analysis capabilities ● Our in-house development didn’t scale ● Use time more wisely: adopt an existing solution instead of (re)inventing our own 24
  • 25. Time (Years) Adopting a TSDB, which one? https://db-engines.com/en/ranking 25+ 25 30+ log(Score)
  • 26. “If it takes more than three days to get it working it is not the right solution for you.” Frossie Economou 26
  • 27. Why InfluxDB? ● It is more than a TSDB, it is an innovative solution ● Open source software and community ● InfluxDB: efficient store for time series + InfluxQL and Flux language ● Chronograf: postdefined visualizations ● Kapacitor: foster collaborative conversation (Slack) 27
  • 28. InfluxDB schema design FieldsTags Results from the Data Release Production pipeline ● Measurement groups the results of the pipeline ● Timestamp is the time when the pipeline run finishes ● Tags are metadata associated to the pipeline run ● Fields are the metrics measured by the pipeline Timestamp 28
  • 29. First the Tags, then the Series 29 filter is the name of the optical filter used at the telescope at a given time drp,dataset=HSC,tract=509,filter=g {fields} timestamp For each combination of tag values, there’s a new series. A tract identifies a region in the sky* (*) https://pipelines.lsst.io/modules/lsst.skymap
  • 30. Example of a Series AM1: 6.42357 AM2: 6.48177 AM3: 4.62033 Time (run ID) {field-set}i Each point in a series contains the set of metrics measured by the pipeline run and the results are grouped by the pipeline name. 30 drp,dataset=HSC,tract=509,filter=g
  • 31. Tracking application metrics with InfluxDB https://squash.lsst.codes 31
  • 33. Why that metric value change? Make an annotation! 33
  • 34. “Annotations are more important than the data itself.” Frossie Economou 34
  • 35. II - Engineering and Facilities Database Real-Time Monitoring of the Observatory Data 35
  • 36. 36 All subsystems of the Observatory coexist in a state of active interplay.
  • 37. Observatory Data https://ts-xml.lsst.io 37 ● 60+ subsystems ● Total of 1148 DDS topics ○ 350 commands ○ 531 events ○ 267 telemetry topics ● Total throughput ~21GB/h → real-time monitoring ○ ~15TB per month → offline analysis ○ ~1.5PB for the 10yr of operations → trend analysis
  • 38. The M1M3 mirror cell subsystem 38
  • 39. M1M3 mirror cell data 39 ● 156 force actuators and sensors producing data at 50Hz ● Can we record and analyze the M1M3 data in real-time?
  • 40. Kafka + InfluxDB architecture https://sqr-029.lsst.io 40 Stream Reactor (OSS)
  • 41. End-to-end latency characterization Latency = (WriteTimestamp - SndTimestamp) 41 SndTimestamp WriteTimestamp
  • 42. Median latency ~60ms writing ~100k ppm Executing queries while writing 42
  • 43. 43 Aux Telescope and Weather Station tower
  • 44. Aux Telescope Camera Tucson Teststand - Aug 2019 44
  • 45. Weather Station Summit - September 2019 45
  • 46. 46 M2 mirror cell functional testing Summit - March 2020
  • 47. M2 mirror cell functional testing Summit - March 2020 47
  • 48. 48 The beginnings of the Telescope control room Summit - March 2020
  • 49. 49 US Data Facility Urbana, IL Project staff access RP 10yr TestStand Tucson, AZ Summit Cerro Pachon, Chile Restricted access RP ~30 days TestStand Chilean Data Facility La Serena, Chile <10MB/s raw stream A preview of operations
  • 50. Data Replication and Aggregation https://sqr-034.lsst.codes 50
  • 51. Data Aggregation in Kafka with Faust https://kafka-aggregator.lsst.io 51 Faust agents compute summary statistics on non- overlapping windows of N seconds. Data Reduction factor R~10
  • 52. What’s next 52 ● Migration to InfluxDB 2.0 ○ Conversation with InfluxData design team about Annotations in 2.0 ○ Flux training for the Observatory Staff ○ Flux Tasks for downsampling and trend analysis ● Rubin Observatory Interim Data Facility on Google Cloud ● Project transition from Construction to Operations is happening ○ New opportunities for using InfluxDB ● Self-monitoring ● Scalability as we load more data, RPs, etc.
  • 53. Learn more… 53 ● Vera C. Rubin Observatory ● Data Processing ● Verification Framework ● Engineering and Facilities Database ● Kafka Aggregator ● Rubin Science Platform ● Rubin Technical Documentation

Editor's Notes

  1. Cerro Pachon, mirror is in the building, TMA is in the building, dome nearing completion