SlideShare a Scribd company logo
1 of 12
Download to read offline
Adapting Swift for Tape
Storage or other high-latency
media
October 27, 2015
Harald Seipp (IBM Systems – Presenter)
Slavisa Sarafijanovic (IBM Research)
Goal
Augment cloud object storage with a
low-cost, cold storage tier for
archive/backup use cases
Reduced cost
●
significantly lower than disk
Reduced availability
●
on the order of minutes
primary storage
highly available
archival storage
low-cost
archive
restore
Standard API (REST)
Client Application
HDD High-latency
media
OpenStack Swift Cluster
Main Idea
Single Object Storage name space
for Objects on
●
Tape or
●
Optical Disc or
●
SMR or MAID Disk
integrated with a standard disk-
based OpenStack Swift installation
primary storage
highly available
archival storage
low-cost
archive
restore
Standard API (REST)
Client Application
HDD High-latency
media
OpenStack Swift Cluster
Facts about Tape
Tape is 5x-10x cheaper than disk
Tape density scaling and cost are
projected to be advantageous over
disk for the next 10 years (see 220
TB cartridge demo)
Tape is a mature technology
Tape is already used in today’s
cloud offerings
LTFS is a widely adapted standard
primary storage
highly available
archival storage
low-cost
archive
restore
Standard API (REST)
Client Application
HDD LTFS Tape
OpenStack Swift Cluster
Shortcomings to be solved
 Time-to-data
●
Up to (single-digit) minutes
→ Not playing well with Swift infrastructure
(application/load balancer) time-out
assumptions
 Resource availability
●
Few drives per 100s cartridges
→ Random access (mounts/seeks) can lead
to resource congestion
Addressing shortcomings
 Swift API for archiving operations
●
Support explicit bulk operations (to minimize tape mounts and seeks)
●
Store/provide object state (“offline bit”) in a standardized way
●
Provide additional error code (“in transit”) upon access of migrated object
 Improved timeout management
 Configurable Data Ring Auditing
●
Support asynchronous tape data verification
 Policy based global cluster object distribution
●
Assumption: related data (e.g. container) is likely to be accessed together
Discussed at
Vancouver Summit
Addressing shortcomings
Reference: https://etherpad.openstack.org/p/liberty-swift-tape-storage
 Swift API for archiving operations
●
Support explicit bulk operations (to minimize tape mounts and seeks)
●
Store/provide object state (“offline bit”) in a standardized way
●
Provide additional error code (“in transit”) upon access of migrated object
 Improved timeout management
 Configurable Data Ring Auditing
●
Support asynchronous tape data verification
 Policy based global cluster object distribution
●
Assumption: related data (e.g. container) is likely to be accessed together
Swift
Swift API
Swift API ILM extensions
ILM
capable
backend
POSIX
File System
Swift API ILM* extensions:
• Migrate (High-Latency media → Disk)
• Recall (Disk → high-latency media)
• Query status
Implementation proposal:
• SwiftILM middleware
• Control path to ILM capable backend:
• (1) Swift EA ←→ file attribute (async)
• (2) Backend executable (sync/async)
(1)
(2)
SwiftILM
Middleware
Disk
cache
Tape
Optical
Disc
MAID/
SMR
Call
Executable
Swift archiving API through SwiftILM
*Information Lifecycle Management
SwiftILM API proposal
To migrate a single object, issue following HTTP POST
http://SWIFT-URL/ACCT/CONT/OBJ?MIGRATE
●
Similar GET/HEAD requests for RECALL and STATUS
Bulk operations on container level
http://SWIFT-URL/ACCT/CONT?MIGRATE
...or through regular expressions on Swift namespace
●
Get back a request ID for efficient status tracking
SwiftILM API proposal – advanced
(Optional) Setting ILM operations through SwiftILM API
●
Migration/recall based on object age/size/type etc.
(Optional) Backend-specific additions
●
e.g. to control placement to specific library/medium/pool
(Optional) Co-existence with Swift3
●
enabling ILM for S3 protocol as well
Add ILM to your existing Swift cluster
OpenStack Swift
Client Application
Standard Swift API with SwiftILM extensions(REST)
Standard Disk Data Ring
(replication or erasure code)
scale-out
ILM-based Data Ring
(replication across nodes)
scale-out
SwiftILM
Middleware
 Take unmodified Swift
 Configure ILM-based
Data Ring
 Add SwiftILM
middleware
 Add ILM-capable
backend
ILM
capable
backend
Storage Node
ILM
capable
backend
Storage Node
Disk
cache
Tape
Optical
Disc
MAID/
SMR
Disk
cache
Tape
Optical
Disc
MAID/
SMR
Join us at the Design Summit or IBM booth
for further discussions!
seipp@de.ibm.com
IRC: hseipp
Twitter: @HaraldSeipp
http://www.research.ibm.com/labs/zurich/sto/tier_icetier.html

More Related Content

What's hot

What's hot (18)

An intro to Ceph and big data - CERN Big Data Workshop
An intro to Ceph and big data - CERN Big Data WorkshopAn intro to Ceph and big data - CERN Big Data Workshop
An intro to Ceph and big data - CERN Big Data Workshop
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 
Storage tiering and erasure coding in Ceph (SCaLE13x)
Storage tiering and erasure coding in Ceph (SCaLE13x)Storage tiering and erasure coding in Ceph (SCaLE13x)
Storage tiering and erasure coding in Ceph (SCaLE13x)
 
Hadoop over rgw
Hadoop over rgwHadoop over rgw
Hadoop over rgw
 
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
CEPH DAY BERLIN - WHAT'S NEW IN CEPH CEPH DAY BERLIN - WHAT'S NEW IN CEPH
CEPH DAY BERLIN - WHAT'S NEW IN CEPH
 
Everything You Need To Know About Persistent Storage in Kubernetes
Everything You Need To Know About Persistent Storage in KubernetesEverything You Need To Know About Persistent Storage in Kubernetes
Everything You Need To Know About Persistent Storage in Kubernetes
 
Red Hat Gluster Storage : GlusterFS
Red Hat Gluster Storage : GlusterFSRed Hat Gluster Storage : GlusterFS
Red Hat Gluster Storage : GlusterFS
 
Ceph Research at UCSC
Ceph Research at UCSCCeph Research at UCSC
Ceph Research at UCSC
 
{code} and Containers - Open Source Infrastructure within Dell Technologies
{code} and Containers - Open Source Infrastructure within Dell Technologies{code} and Containers - Open Source Infrastructure within Dell Technologies
{code} and Containers - Open Source Infrastructure within Dell Technologies
 
What is a Ceph (and why do I care). OpenStack storage - Colorado OpenStack Me...
What is a Ceph (and why do I care). OpenStack storage - Colorado OpenStack Me...What is a Ceph (and why do I care). OpenStack storage - Colorado OpenStack Me...
What is a Ceph (and why do I care). OpenStack storage - Colorado OpenStack Me...
 
Ceph Overview for Distributed Computing Denver Meetup
Ceph Overview for Distributed Computing Denver MeetupCeph Overview for Distributed Computing Denver Meetup
Ceph Overview for Distributed Computing Denver Meetup
 
Red Hat Storage Server For AWS
Red Hat Storage Server For AWSRed Hat Storage Server For AWS
Red Hat Storage Server For AWS
 
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage TieringHadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
Hadoop Meetup Jan 2019 - Router-Based Federation and Storage Tiering
 
Disperse xlator ramon_datalab
Disperse xlator ramon_datalabDisperse xlator ramon_datalab
Disperse xlator ramon_datalab
 
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
Practical CephFS with nfs today using OpenStack Manila - Ceph Day Berlin - 12...
 
New Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesNew Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference Architectures
 
HKG15-401: Ceph and Software Defined Storage on ARM servers
HKG15-401: Ceph and Software Defined Storage on ARM serversHKG15-401: Ceph and Software Defined Storage on ARM servers
HKG15-401: Ceph and Software Defined Storage on ARM servers
 
Dustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep DiveDustin Black - Red Hat Storage Server Administration Deep Dive
Dustin Black - Red Hat Storage Server Administration Deep Dive
 

Similar to Adapting Swift for Tape Storage or other high-latency media

040419 san forum
040419 san forum040419 san forum
040419 san forum
Thiru Raja
 

Similar to Adapting Swift for Tape Storage or other high-latency media (20)

Swift extensions for Tape Storage or other High-Latency Media
Swift extensions for Tape Storage or other High-Latency MediaSwift extensions for Tape Storage or other High-Latency Media
Swift extensions for Tape Storage or other High-Latency Media
 
Spectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN CachingSpectrum Scale Unified File and Object with WAN Caching
Spectrum Scale Unified File and Object with WAN Caching
 
Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...Software Defined Analytics with File and Object Access Plus Geographically Di...
Software Defined Analytics with File and Object Access Plus Geographically Di...
 
In Place Analytics For File and Object Data
In Place Analytics For File and Object DataIn Place Analytics For File and Object Data
In Place Analytics For File and Object Data
 
Spectrum scale object analytics
Spectrum scale object analyticsSpectrum scale object analytics
Spectrum scale object analytics
 
IBM Cloud Object Storage System (powered by Cleversafe) and its Applications
IBM Cloud Object Storage System (powered by Cleversafe) and its ApplicationsIBM Cloud Object Storage System (powered by Cleversafe) and its Applications
IBM Cloud Object Storage System (powered by Cleversafe) and its Applications
 
S016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710dS016825 ibm-cos-nola-v1710d
S016825 ibm-cos-nola-v1710d
 
Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2Elastic storage in the cloud session 5224 final v2
Elastic storage in the cloud session 5224 final v2
 
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMFGestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
Gestione gerarchica dei dati con SUSE Enterprise Storage e HPE DMF
 
040419 san forum
040419 san forum040419 san forum
040419 san forum
 
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for TomorrowOpenStack Cinder, Implementation Today and New Trends for Tomorrow
OpenStack Cinder, Implementation Today and New Trends for Tomorrow
 
Cncf storage-final-filip
Cncf storage-final-filipCncf storage-final-filip
Cncf storage-final-filip
 
Oracle Cloud Hybrid Storage Tiering
Oracle Cloud Hybrid Storage TieringOracle Cloud Hybrid Storage Tiering
Oracle Cloud Hybrid Storage Tiering
 
Open ebs 101
Open ebs 101Open ebs 101
Open ebs 101
 
Red Hat Storage Server Roadmap & Integration With Open Stack
Red Hat Storage Server Roadmap & Integration With Open StackRed Hat Storage Server Roadmap & Integration With Open Stack
Red Hat Storage Server Roadmap & Integration With Open Stack
 
Analytics with unified file and object
Analytics with unified file and object Analytics with unified file and object
Analytics with unified file and object
 
OpenStack Swift tiering proposal and prototype details
OpenStack Swift tiering proposal and prototype detailsOpenStack Swift tiering proposal and prototype details
OpenStack Swift tiering proposal and prototype details
 
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS SupportWebinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
 
Openstack swift - VietOpenStack 6thmeeetup
Openstack swift - VietOpenStack 6thmeeetupOpenstack swift - VietOpenStack 6thmeeetup
Openstack swift - VietOpenStack 6thmeeetup
 
SoNAS
SoNASSoNAS
SoNAS
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Adapting Swift for Tape Storage or other high-latency media

  • 1. Adapting Swift for Tape Storage or other high-latency media October 27, 2015 Harald Seipp (IBM Systems – Presenter) Slavisa Sarafijanovic (IBM Research)
  • 2. Goal Augment cloud object storage with a low-cost, cold storage tier for archive/backup use cases Reduced cost ● significantly lower than disk Reduced availability ● on the order of minutes primary storage highly available archival storage low-cost archive restore Standard API (REST) Client Application HDD High-latency media OpenStack Swift Cluster
  • 3. Main Idea Single Object Storage name space for Objects on ● Tape or ● Optical Disc or ● SMR or MAID Disk integrated with a standard disk- based OpenStack Swift installation primary storage highly available archival storage low-cost archive restore Standard API (REST) Client Application HDD High-latency media OpenStack Swift Cluster
  • 4. Facts about Tape Tape is 5x-10x cheaper than disk Tape density scaling and cost are projected to be advantageous over disk for the next 10 years (see 220 TB cartridge demo) Tape is a mature technology Tape is already used in today’s cloud offerings LTFS is a widely adapted standard primary storage highly available archival storage low-cost archive restore Standard API (REST) Client Application HDD LTFS Tape OpenStack Swift Cluster
  • 5. Shortcomings to be solved  Time-to-data ● Up to (single-digit) minutes → Not playing well with Swift infrastructure (application/load balancer) time-out assumptions  Resource availability ● Few drives per 100s cartridges → Random access (mounts/seeks) can lead to resource congestion
  • 6. Addressing shortcomings  Swift API for archiving operations ● Support explicit bulk operations (to minimize tape mounts and seeks) ● Store/provide object state (“offline bit”) in a standardized way ● Provide additional error code (“in transit”) upon access of migrated object  Improved timeout management  Configurable Data Ring Auditing ● Support asynchronous tape data verification  Policy based global cluster object distribution ● Assumption: related data (e.g. container) is likely to be accessed together
  • 7. Discussed at Vancouver Summit Addressing shortcomings Reference: https://etherpad.openstack.org/p/liberty-swift-tape-storage  Swift API for archiving operations ● Support explicit bulk operations (to minimize tape mounts and seeks) ● Store/provide object state (“offline bit”) in a standardized way ● Provide additional error code (“in transit”) upon access of migrated object  Improved timeout management  Configurable Data Ring Auditing ● Support asynchronous tape data verification  Policy based global cluster object distribution ● Assumption: related data (e.g. container) is likely to be accessed together
  • 8. Swift Swift API Swift API ILM extensions ILM capable backend POSIX File System Swift API ILM* extensions: • Migrate (High-Latency media → Disk) • Recall (Disk → high-latency media) • Query status Implementation proposal: • SwiftILM middleware • Control path to ILM capable backend: • (1) Swift EA ←→ file attribute (async) • (2) Backend executable (sync/async) (1) (2) SwiftILM Middleware Disk cache Tape Optical Disc MAID/ SMR Call Executable Swift archiving API through SwiftILM *Information Lifecycle Management
  • 9. SwiftILM API proposal To migrate a single object, issue following HTTP POST http://SWIFT-URL/ACCT/CONT/OBJ?MIGRATE ● Similar GET/HEAD requests for RECALL and STATUS Bulk operations on container level http://SWIFT-URL/ACCT/CONT?MIGRATE ...or through regular expressions on Swift namespace ● Get back a request ID for efficient status tracking
  • 10. SwiftILM API proposal – advanced (Optional) Setting ILM operations through SwiftILM API ● Migration/recall based on object age/size/type etc. (Optional) Backend-specific additions ● e.g. to control placement to specific library/medium/pool (Optional) Co-existence with Swift3 ● enabling ILM for S3 protocol as well
  • 11. Add ILM to your existing Swift cluster OpenStack Swift Client Application Standard Swift API with SwiftILM extensions(REST) Standard Disk Data Ring (replication or erasure code) scale-out ILM-based Data Ring (replication across nodes) scale-out SwiftILM Middleware  Take unmodified Swift  Configure ILM-based Data Ring  Add SwiftILM middleware  Add ILM-capable backend ILM capable backend Storage Node ILM capable backend Storage Node Disk cache Tape Optical Disc MAID/ SMR Disk cache Tape Optical Disc MAID/ SMR
  • 12. Join us at the Design Summit or IBM booth for further discussions! seipp@de.ibm.com IRC: hseipp Twitter: @HaraldSeipp http://www.research.ibm.com/labs/zurich/sto/tier_icetier.html