Self-driving PelotonDB automates hybrid workloads

•Download as PPTX, PDF•

0 likes•224 views

宇

PelotonDB is a self-driving database that uses a hybrid storage layout to support both OLTP and OLAP workloads. It uses logical tiles to decouple storage management from query execution. Tuples can be stored in either a narrow storage model, dense storage model, or flexible storage model depending on how "hot" the data is. The system continuously monitors queries and reorganizes the physical data layout in the background to optimize for the workload over time using k-means clustering. It employs MVCC for concurrency control and minimizes overhead during data reorganization by only modifying versioning metadata. An evaluation using the ADAPT benchmark shows PelotonDB can adapt the storage layout to improve performance.

Software

PelotonDB
A self-driving database for hybrid workloads

Agenda
• Overview of Peloton
• Hybrid Storage Layout
• Concurrency Control
• Layout Reorganization
• Evaluation
3

How to be Self-driving?
• Understand the workload – OLTP or OLAP
• Forecast resource utilization trends
• Identify potential actions that tune and optimize the database
5

Problems
• Many existing systems: OLTP + OLAP
• Takes minutes or even hours to propagate changes
• Administrative overhead
• Developer needs to write query for multiple systems
Transactional
Database
(e.g. MySQL)
Analytical
Database
(e.g. HiStore)
ETL
8

HTAP
• Classic solution – 2 separated engines
• OLTP engine with row-oriented data
• OLAP engine with column-oriented data
• Use a synchronization method (e.g. 2PC) to combine the results
• Well, this looks better but still too complex
• Limited types of queries
• Performance overhead
9

Hybrid Storage in Peloton
• A unified architecture for 'hot' and 'cold' tuples, based on a logical
abstraction over these different layouts
• A novel online reorganization technique that continuously enhances
the physical design
10

Storage Models
• NSM is good for OLTP
• DSM is good for OLAP
• FSM: adaptive as data get cooler
• NSM/DSM is special case of FSM
11

Physical Tiles
• A tuple can be stored in
different layouts over time.
• Stores new tuples in NSM
• Reorganizes as they become
colder
13

Logical Tiles
Composed of
• Underlying physical tiles
• Mapping of attributes
• Offsets of values
14

Logical Tile Algebra
• Decoupling storage managing and execution engine
• Intermediate results can be represented as passthrough logical tile
15

Bridge Operator
Pipeline Breaker
Metadata Operator
Mutator
16

Benefits of Logical Tiles
• Layout Transparency
• Vectorized Processing
• Flexible Materialization
• Caching Behavior
17

MVCC
• HTAP workloads are comprised of short-duration transactions
alongside long-running analytical queries.
• Every transaction holds
• A unique transaction Id
• A unique commit timestamp (assigned on committing)
• Timestamp of last committed transaction
19

Versioning Metadata
• For each tuple
• TxnId: The transaction id that currently holds a latch
• BeginCTS: The commit timestamp from which it becomes visible
• EndCTS: The commit timestamp after which it ceases to be visible
• PreV: Reference to the previous version
20

Operations under MVCC
• Insert / Delete / Update
• TableScan / IndexScan
21

Approach
• Track the recent query workloads
• Periodically compute a workload-optimized storage layout in the
background
23

Query Monitoring
• Collects information about the attributes in queries
24

Find Optimized Partition
• Naive algorithm takes . Infeasible!
• Heuristic approach
1. Clustering similar queries by k-means
• distance(q, p) = #attributes appears only in one side / #attributes
• Prioritizes each query based on its plan cost to avoid partial to TP queries
• Prioritizes the older samples with a weight w
2. Generate a layout in greedy way
• Iterates over the clusters in the weight-descending order
• For each cluster, groups the attributes accessed by that its representative
query together into a tile
25

Data Layout Reorganization
• Copies over the data to the new layout then atomically swaps in
• Concurrent DML operation only modifies the versioning metadata
• Old data is reclaimed if not referenced by any logical tile
27

ADAPT benchmark
Simulating enterprise workloads
29
# Attributes Size of tuple # Tuples PK
Narrow table 50 200B
10m a0
Wide table 500 2KB

What's hot

HNSciCloud Info Day, 7 Sept 2016, Functional Requirements by Helge MeinhardHelix Nebula The Science Cloud

Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...Flink Forward

Designing for operability and managabilityGaurav Bahrani

Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen LiBowen Li

From Batch to Streaming with Apache Apex Dataworks Summit 2017Apache Apex

Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...Flink Forward

Integrating Flink with Hive, Seattle Flink Meetup, Feb 2019Bowen Li

AthenaX - Unified Stream & Batch Processing using SQL at Uber, Zhenqiu Huang,...Bowen Li

Concourse CIMatteo Gazzetta

Flink Forward Berlin 2017: Stephan Ewen - The State of Flink and how to adopt...Flink Forward

Flink Connector Development Tips & TricksEron Wright

OSGi Community Event 2010 - Modular Applications on a Data Grid - A Case Stud...mfrancis

Scaling stream data pipelines with Pravega and Apache FlinkTill Rohrmann

"What's New With Globus" Webinar: Spring 2018Globus

Unify Enterprise Data Processing System Platform Level Integration of Flink a...Flink Forward

Updating the Globus Connect Architecture - ARCC Workshop at PEARC17Mary Bass

Centralised logging with ELK stackSimon Hanmer

Grafana 7.0Juraj Hantak

Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uberconfluent

What's hot (19)

HNSciCloud Info Day, 7 Sept 2016, Functional Requirements by Helge Meinhard

Flink Forward Berlin 2017: Jörg Schad, Till Rohrmann - Apache Flink meets Apa...

Designing for operability and managability

Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li

From Batch to Streaming with Apache Apex Dataworks Summit 2017

Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...

Integrating Flink with Hive, Seattle Flink Meetup, Feb 2019

AthenaX - Unified Stream & Batch Processing using SQL at Uber, Zhenqiu Huang,...

Concourse CI

Flink Forward Berlin 2017: Stephan Ewen - The State of Flink and how to adopt...

Flink Connector Development Tips & Tricks

OSGi Community Event 2010 - Modular Applications on a Data Grid - A Case Stud...

Scaling stream data pipelines with Pravega and Apache Flink

"What's New With Globus" Webinar: Spring 2018

Unify Enterprise Data Processing System Platform Level Integration of Flink a...

Updating the Globus Connect Architecture - ARCC Workshop at PEARC17

Centralised logging with ELK stack

Grafana 7.0

Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber

Similar to Self-driving PelotonDB automates hybrid workloads

Operational-AnalyticsNiloy Mukherjee

Preso-v0.1Muhammad Arif

Kubernetes Walk Through from Technical ViewLei (Harry) Zhang

Using PostgreSQL With Docker & Kubernetes - July 2018Jonathan Katz

LISA2017 Kubernetes: Hit the Ground RunningChris McEniry

SQL Server 2014 In-Memory OLTPTony Rogerson

Membase Intro from Membase Meetup San FranciscoMembase

Linux kernel development ch4huangachou

Rackspace: Email's Solution for Indexing 50K Documents per Second: Presented ...Lucidworks

A Closer Look at Apache KuduAndriy Zabavskyy

What's new in JBoss ON 3.2Thomas Segismont

AWS re:Invent 2016: Streaming ETL for RDS and DynamoDB (DAT315)Amazon Web Services

Apache Kylin: OLAP Engine on Hadoop - Tech Deep DiveXu Jiang

273CC03851E778670A (1).pptGayathriSanthosh11

Work with hundred of hot terabytes in JVMsMalin Weiss

Presto at Facebook - Presto Meetup @ Boston (10/6/2015)Martin Traverso

What you need to know for postgresql operationAnton Bushmelev

Data all over the place! How SQL and Apache Calcite bring sanity to streaming...Julian Hyde

Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...Edgar Alejandro Villegas

Swift at Scale: The IBM SoftLayer StoryBrian Cline

Similar to Self-driving PelotonDB automates hybrid workloads (20)

Operational-Analytics

Preso-v0.1

Kubernetes Walk Through from Technical View

Using PostgreSQL With Docker & Kubernetes - July 2018

LISA2017 Kubernetes: Hit the Ground Running

SQL Server 2014 In-Memory OLTP

Membase Intro from Membase Meetup San Francisco

Linux kernel development ch4

Rackspace: Email's Solution for Indexing 50K Documents per Second: Presented ...

A Closer Look at Apache Kudu

What's new in JBoss ON 3.2

AWS re:Invent 2016: Streaming ETL for RDS and DynamoDB (DAT315)

Apache Kylin: OLAP Engine on Hadoop - Tech Deep Dive

273CC03851E778670A (1).ppt

Work with hundred of hot terabytes in JVMs

Presto at Facebook - Presto Meetup @ Boston (10/6/2015)

What you need to know for postgresql operation

Data all over the place! How SQL and Apache Calcite bring sanity to streaming...

Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...

Swift at Scale: The IBM SoftLayer Story

Recently uploaded

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

Exploring iOS App Development: Simplifying the ProcessEvangelist Apps https://twitter.com/EvangelistSW/

The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171

DNT_Corporate presentation know about usDynamic Netsoft

Right Money Management App For Your Financial GoalsJhone kinadey

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.

Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq

Clustering techniques data mining book ....ShaimaaMohamedGalal

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

TECUNIQUE: Success Stories: IT Service providermohitmore19

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Diamond Application Development Crafting Solutions with PrecisionSolGuruz

Professional Resume Template for Software DevelopersVinodh Ram

Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions

Recently uploaded (20)

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

Exploring iOS App Development: Simplifying the Process

The Ultimate Test Automation Guide_ Best Practices and Tips.pdf

DNT_Corporate presentation know about us

Right Money Management App For Your Financial Goals

Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...

Hand gesture recognition PROJECT PPT.pptx

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...

Salesforce Certified Field Service Consultant

Clustering techniques data mining book ....

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

Optimizing AI for immediate response in Smart CCTV

TECUNIQUE: Success Stories: IT Service provider

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

Diamond Application Development Crafting Solutions with Precision

Professional Resume Template for Software Developers

Advancing Engineering with AI through the Next Generation of Strategic Projec...

Self-driving PelotonDB automates hybrid workloads

1. PelotonDB A self-driving database for hybrid workloads

2. 2

3. Agenda • Overview of Peloton • Hybrid Storage Layout • Concurrency Control • Layout Reorganization • Evaluation 3

4. Overview of Peloton 4

5. How to be Self-driving? • Understand the workload – OLTP or OLAP • Forecast resource utilization trends • Identify potential actions that tune and optimize the database 5

6. Architecture 6

7. Hybrid Storage Layout 7

8. Problems • Many existing systems: OLTP + OLAP • Takes minutes or even hours to propagate changes • Administrative overhead • Developer needs to write query for multiple systems Transactional Database (e.g. MySQL) Analytical Database (e.g. HiStore) ETL 8

9. HTAP • Classic solution – 2 separated engines • OLTP engine with row-oriented data • OLAP engine with column-oriented data • Use a synchronization method (e.g. 2PC) to combine the results • Well, this looks better but still too complex • Limited types of queries • Performance overhead 9

10. Hybrid Storage in Peloton • A unified architecture for 'hot' and 'cold' tuples, based on a logical abstraction over these different layouts • A novel online reorganization technique that continuously enhances the physical design 10

11. Storage Models • NSM is good for OLTP • DSM is good for OLAP • FSM: adaptive as data get cooler • NSM/DSM is special case of FSM 11

12. 12

13. Physical Tiles • A tuple can be stored in different layouts over time. • Stores new tuples in NSM • Reorganizes as they become colder 13

14. Logical Tiles Composed of • Underlying physical tiles • Mapping of attributes • Offsets of values 14

15. Logical Tile Algebra • Decoupling storage managing and execution engine • Intermediate results can be represented as passthrough logical tile 15

16. Bridge Operator Pipeline Breaker Metadata Operator Mutator 16

17. Benefits of Logical Tiles • Layout Transparency • Vectorized Processing • Flexible Materialization • Caching Behavior 17

18. Concurrency Control 18

19. MVCC • HTAP workloads are comprised of short-duration transactions alongside long-running analytical queries. • Every transaction holds • A unique transaction Id • A unique commit timestamp (assigned on committing) • Timestamp of last committed transaction 19

20. Versioning Metadata • For each tuple • TxnId: The transaction id that currently holds a latch • BeginCTS: The commit timestamp from which it becomes visible • EndCTS: The commit timestamp after which it ceases to be visible • PreV: Reference to the previous version 20

21. Operations under MVCC • Insert / Delete / Update • TableScan / IndexScan 21

22. Layout Reorganization 22

23. Approach • Track the recent query workloads • Periodically compute a workload-optimized storage layout in the background 23

24. Query Monitoring • Collects information about the attributes in queries 24

25. Find Optimized Partition • Naive algorithm takes . Infeasible! • Heuristic approach 1. Clustering similar queries by k-means • distance(q, p) = #attributes appears only in one side / #attributes • Prioritizes each query based on its plan cost to avoid partial to TP queries • Prioritizes the older samples with a weight w 2. Generate a layout in greedy way • Iterates over the clusters in the weight-descending order • For each cluster, groups the attributes accessed by that its representative query together into a tile 25

26. K-means 26

27. Data Layout Reorganization • Copies over the data to the new layout then atomically swaps in • Concurrent DML operation only modifies the versioning metadata • Old data is reclaimed if not referenced by any logical tile 27

28. Evaluation 28

29. ADAPT benchmark Simulating enterprise workloads 29 # Attributes Size of tuple # Tuples PK Narrow table 50 200B 10m a0 Wide table 500 2KB

30. 30 Performance Impact of Storage Models

31. 31Workload-Aware Adaptation

32. Thanks! Q&A 32

Editor's Notes

Then, over time they become colder and thus are less likely to be updated again. For instance, more than half of the content that Facebook users access and interact with are shared by their friends in the past two days, and then there is a rapid decline in content popularity over the following days
(Q1) an insert query that adds a single tuple into the table (Q2) a scan query that projects a subset of attributes of the tuples that satisfy a predicate (Q3) an aggregate query that computes the maximum value for a set of attributes over the selected tuples (Q4) an arithmetic query that sums up a subset of attributes of the selected tuples (Q5) a join query that combines the tuples from two tables based on a predicate defined over the attributes in the tables

Self-driving PelotonDB automates hybrid workloads

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Self-driving PelotonDB automates hybrid workloads

Similar to Self-driving PelotonDB automates hybrid workloads (20)

More from 宇傅

More from 宇傅 (12)

Recently uploaded

Recently uploaded (20)

Self-driving PelotonDB automates hybrid workloads

Editor's Notes

Self-driving PelotonDB automates hybrid workloads

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Self-driving PelotonDB automates hybrid workloads

Similar to Self-driving PelotonDB automates hybrid workloads (20)

More from 宇 傅

More from 宇 傅 (12)

Recently uploaded

Recently uploaded (20)

Self-driving PelotonDB automates hybrid workloads

Editor's Notes

More from 宇傅

More from 宇傅 (12)