SlideShare a Scribd company logo
1 of 33
Download to read offline
Seagate Kinetic Open Storage Platform
James Hughes
…and many others
Storage is a Price Elastic Market
http://en.wikipedia.org/wiki/Alfred_Marshall

Price elasticity of demand
• Alfred Marshall (1890)

As the price of Storage approaches $0
• Demands for storage will approach infinity

If the price of a Cisco router
approaches $0
• Demands for routers will not approach
infinity - Storage is different

"3
Areal Density Growth

• 

100000

Perpendicular expected to extend
to 0.5-1 Tb/in2

10000

Additional innovations required
at that point

•  heat-assisted
recording
•  bit patterned
media recording

Single particle
superparamagnetic limit
(estimated)

40%

HAMR
+BPM

1000

HAMR
Perpendicular Writing & GMR

100

Seagate Confidential: Subject to NDA
100%
10 Charap’s limiteffective Jan. 18, 2009,
No. 77103,
(broken)
and all applicable supplements GMR reading
•  Inductive Writing/
1
0.1

29%

•  Inductive Writing/ MR reading
•  Inductive Writing & Reading

19
89
19
91
19
93
19
95
19
97
19
99
20
01
20
03
20
05
20
07
20
09
20
11
20
13
20
15
20
17
20
19

• 

Late 1990s – super paramagnetic
limit demonstrated through
modeling

gigabit / in2

• 

•  Areal Density CAGR 40%
•  Transfer Rate CAGR 20%

year

"4
Cloud Computing will increase this trend
•Jevons Paradox
• Cloud Computing increases the efficiency of computing....

Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

http://en.wikipedia.org/wiki/Jevons_paradox

"5
Cloud Computing will increase this trend
•Jevons Paradox
• Cloud Computing increases the efficiency of computing....

Improved technology doubles
Seagate Confidential: Subject to NDA
the amount of Information
No. 77103, effective Jan. produced
18,
with a given amount of Storage 2009,
!
and all applicable supplements
Demand for Storage rises

http://en.wikipedia.org/wiki/Jevons_paradox

"5
Technology TrendsNDA
Seagate Confidential: Subject to
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"6
Shingled Disks
•Write head larger than read
head
• Turns Disk into a sequentially
written media
Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

•All updates to data and
metadata are written
sequentially to a continuous
stream, called a log
•Disk API of sectors is no
longer “natural”

http://www.ssrc.ucsc.edu/Papers/amer-ieeetm11.pdf

"7
Log Structured Storage
How much is erased on a reposition?
• Tape - the remainder of the tape
• Shingled disk - the remainder of the track group
• Flash - the entire page

All persistent Storage systems do/will implement log structure
• e.g. “NoSQL Database of sectors”

Does it make sense to layer a database on top of a
database?
• Could we use the log structure of the media to provide a more
natural storage systems, not mimicking an antique paradigm?

"8
Single System Performance Trend
Leading to disaggregation of servers

http://web.eecs.umich.edu/~twenisch/papers/isca09-disaggregate.pdf

"9
Scaling Storage
Distributed Hash Table
• Key/Value Store

RAM
Flash
Disk

Memcached
FAWN
Riak

http://en.wikipedia.org/wiki/Distributed_hash_table

"10
Metadata and Metadata Servers are Evil
Required by traditional file systems (POSIX) to translate
names to sectors
• Hard to scale, heavy HA requirements, expensive

Can we use a name as a key?
• Place the data into a scaled key value store?
• Eliminate costly metadata servers?

"11
Cumulative operations ordered by length
100%

operations
92% of the operations

80%

60%

40%

32KB

Cumulative percentage

data

20%

0.5% of the data
0%
1.00

10.00

100.00

1000.00

10000.00

100000.00

Length (KB)
"12
512KB
Length

Map of Operations

0

0
1

3

Tim
e

(m

2
inu
tes

)

3
4

0

1
oc
L

2
B)
(T
on
ati

"13
Seagate Kinetic
Seagate Confidential: Subject to NDA
Open Storage Platform
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"14
Seagate Kinetic Open Storage Platform
Dis-intermediates applications to drive
–Goes around file systems, volume managers, drivers

Enable ecosystem of value added software
–Partners (like Basho) can create their own system value

Lower TCO
–Eliminates complexity

"15
"16
"16
D

A
S

"17
App

Proprietary
to System Vendor

• Application
• Clustering
• Management

App
D

LibKinetic

C++, Java, Python, Erlang, DIY
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

Proprietary
to Seagate

A

• Storage

S

"17
App

App

App

App

App

Proprietary
to System Vendor

• Application
• Clustering
• Management

App

D

LibKinetic

LibKinetic

LibKinetic
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

Proprietary
to Seagate

A

• Storage

S

"17
App

App

App

App

App

D

LibKinetic

LibKinetic

LibKinetic
GPL
Standard

ProtoBuf
TCP/IP/GbE

• Interconnect

D

A

D

A
S

D

A
S

D

A

A
S

S

D

A

S

D

A
S

D

A
S

A
S

S

"17

Proprietary
to Seagate

D

• Storage

App

Proprietary
to System Vendor

• Application
• Clustering
• Management
System Hardware
Typical JBOD architecture
• Does not require a server, just JBODs to the ToR Switch
• 10 JBODS × 60 drives × 4TB = 2.4PB/Rack

"18
Features
Provides RPC to Key/Value database
• Data is pre-indexed
• Compression and other value is easy and transparent

P2P (Drive to Drive) copy of key ranges
Communicate using existing Data Center Plumbing (TCP/IP)
Multiple masters - Data sharing between machines
Configurable caching per command
• Async, Sync, Flush

Local space management

"19
Kinetic Systems
Clustering (performance, reliability, management)
Compatibility with large scale applications (S3, etc.)
Centralized Management
• Reliability, availability, durability

"20
Lower TCO
Elimination of server layers
Less Human requirements
Reduced mistakes
Disaggregate storage from
servers
Power management

"21
Lower TCO
Elimination of server layers
Less Human requirements
Reduced mistakes
Disaggregate storage from
servers
Power management

"21
Goals of API
Data movement
• Get/put/delete/getnext/getprevious
• Versioned (== for success), options

Range operations
Multiple masters
• Authentication/Integrity/Authorization

Cluster-able
• Simple cluster configuration version enforcement

3rd party copy
Management
"22
Management (System Vendor)
Configures the drive
• Network
• authorized clients

Monitors
• Health
• Statistics
• Logs

Initiates recovery
• Change cluster version
• 3rdPartyCopy
"23
Data formats
Key Structure
• Variable number of octets (0-4KB)

Data Structure (Serialized to a byte stream)
• KeyOf
• Version
• E2E Data Integrity
–Algorithm name

• Data Variable length (0-xMB)

"24
Performance Metrics
Same normal performance expectations
•
•
•
•

Sequential Write
Random Write
Sequential Read
Sequential Write

Iometer for key/value

"25
Demo Time!

Seagate Confidential: Subject to NDA
No. 77103, effective Jan. 18, 2009,
and all applicable supplements

"26
Performance Results

MB/s

Puts/s

120

1000

90

750

60

500

30

250

0

0
0

2

4

6

8

1MB values put rate (MB/s)

0

2

4

6

1KB values put rate

"27

8
Conclusion
Deliver more value to Seagate, Partners and Customers
• Dis-intermediates cloud applications to drive
• Enable innovation in hardware and software ecosystem
• Lower TCO

OpenSource Software
–Basho Riak, Swift, HDFS

More information
• http://seagate.com/www/kinetic
• https://developers.seagate.com
• http://guthub.com/Seagate
"28

More Related Content

What's hot

Disk health prediction for Ceph
Disk health prediction for CephDisk health prediction for Ceph
Disk health prediction for CephCeph Community
 
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...Ceph Community
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red_Hat_Storage
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation Yahoo Developer Network
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS
 
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanPerformance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanCeph Community
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureDanielle Womboldt
 
Ndb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskNdb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskmikaelronstrom
 
Red Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed_Hat_Storage
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephSage Weil
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed_Hat_Storage
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3GlusterFS
 
Modern network servers
Modern network serversModern network servers
Modern network serversAPNIC
 
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCeph Community
 
Ceph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightCeph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightColleen Corrice
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed_Hat_Storage
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Community
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitecturePatrick McGarry
 

What's hot (20)

Disk health prediction for Ceph
Disk health prediction for CephDisk health prediction for Ceph
Disk health prediction for Ceph
 
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
CEPH DAY BERLIN - 5 REASONS TO USE ARM-BASED MICRO-SERVER ARCHITECTURE FOR CE...
 
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology Red Hat Ceph Storage Acceleration Utilizing Flash Technology
Red Hat Ceph Storage Acceleration Utilizing Flash Technology
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation
 
GlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 MeetupGlusterFS Architecture - June 30, 2011 Meetup
GlusterFS Architecture - June 30, 2011 Meetup
 
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li XiaoyanPerformance tuning in BlueStore & RocksDB - Li Xiaoyan
Performance tuning in BlueStore & RocksDB - Li Xiaoyan
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureCeph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
 
Ndb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_diskNdb cluster 80_ycsb_disk
Ndb cluster 80_ycsb_disk
 
Red Hat Gluster Storage Performance
Red Hat Gluster Storage PerformanceRed Hat Gluster Storage Performance
Red Hat Gluster Storage Performance
 
BlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for CephBlueStore: a new, faster storage backend for Ceph
BlueStore: a new, faster storage backend for Ceph
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep Dive
 
Red Hat Storage Roadmap
Red Hat Storage RoadmapRed Hat Storage Roadmap
Red Hat Storage Roadmap
 
Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3Gluster Webinar: Introduction to GlusterFS v3.3
Gluster Webinar: Introduction to GlusterFS v3.3
 
Modern network servers
Modern network serversModern network servers
Modern network servers
 
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCERCEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
CEPH DAY BERLIN - MASTERING CEPH OPERATIONS: UPMAP AND THE MGR BALANCER
 
MySQL Head-to-Head
MySQL Head-to-HeadMySQL Head-to-Head
MySQL Head-to-Head
 
Ceph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer SpotlightCeph Deployment at Target: Customer Spotlight
Ceph Deployment at Target: Customer Spotlight
 
Red Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference ArchitecturesRed Hat Storage Day New York - New Reference Architectures
Red Hat Storage Day New York - New Reference Architectures
 
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
 
QCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference ArchitectureQCT Ceph Solution - Design Consideration and Reference Architecture
QCT Ceph Solution - Design Consideration and Reference Architecture
 

Similar to Kinetic basho public

Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storageAndrea Mauro
 
In-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesIn-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesHazelcast
 
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Amazon Web Services
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Community
 
Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Howard Marks
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisMike Pittaro
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis PyData
 
Dissecting Scalable Database Architectures
Dissecting Scalable Database ArchitecturesDissecting Scalable Database Architectures
Dissecting Scalable Database Architectureshypertable
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Colin Charles
 
Memory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationMemory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationBigstep
 
Solid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsSolid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsMatt Simmons
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]Speedment, Inc.
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]Malin Weiss
 
Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10xKinAnx
 
Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation        Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation Crossbarinc
 
S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dTony Pearson
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...Linaro
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)Alex Rasmussen
 

Similar to Kinetic basho public (20)

Evoluzione dello storage
Evoluzione dello storageEvoluzione dello storage
Evoluzione dello storage
 
In-memory Data Management Trends & Techniques
In-memory Data Management Trends & TechniquesIn-memory Data Management Trends & Techniques
In-memory Data Management Trends & Techniques
 
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
Best practices for Data warehousing with Amazon Redshift - AWS PS Summit Canb...
 
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
Ceph Day Amsterdam 2015: Measuring and predicting performance of Ceph clusters
 
Deploying ssd in the data center 2014
Deploying ssd in the data center 2014Deploying ssd in the data center 2014
Deploying ssd in the data center 2014
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
 
Dissecting Scalable Database Architectures
Dissecting Scalable Database ArchitecturesDissecting Scalable Database Architectures
Dissecting Scalable Database Architectures
 
CPU Caches
CPU CachesCPU Caches
CPU Caches
 
Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016Tuning Linux for your database FLOSSUK 2016
Tuning Linux for your database FLOSSUK 2016
 
Memory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and VirtualizationMemory, Big Data, NoSQL and Virtualization
Memory, Big Data, NoSQL and Virtualization
 
Solid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln LabsSolid State Drive Technology - MIT Lincoln Labs
Solid State Drive Technology - MIT Lincoln Labs
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
 
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
JavaOne2016 - Microservices: Terabytes in Microseconds [CON4516]
 
Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10Collier exadata technical overview presentation 4 14-10
Collier exadata technical overview presentation 4 14-10
 
Wolfgang Lehner Technische Universitat Dresden
Wolfgang Lehner Technische Universitat DresdenWolfgang Lehner Technische Universitat Dresden
Wolfgang Lehner Technische Universitat Dresden
 
Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation        Crossbar ARM TechCon 2016 presentation
Crossbar ARM TechCon 2016 presentation
 
S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710d
 
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
LCA13: Jason Taylor Keynote - ARM & Disaggregated Rack - LCA13-Hong - 6 March...
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
 

Recently uploaded

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Recently uploaded (20)

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

Kinetic basho public

  • 1. Seagate Kinetic Open Storage Platform James Hughes …and many others
  • 2. Storage is a Price Elastic Market http://en.wikipedia.org/wiki/Alfred_Marshall Price elasticity of demand • Alfred Marshall (1890) As the price of Storage approaches $0 • Demands for storage will approach infinity If the price of a Cisco router approaches $0 • Demands for routers will not approach infinity - Storage is different "3
  • 3. Areal Density Growth •  100000 Perpendicular expected to extend to 0.5-1 Tb/in2 10000 Additional innovations required at that point •  heat-assisted recording •  bit patterned media recording Single particle superparamagnetic limit (estimated) 40% HAMR +BPM 1000 HAMR Perpendicular Writing & GMR 100 Seagate Confidential: Subject to NDA 100% 10 Charap’s limiteffective Jan. 18, 2009, No. 77103, (broken) and all applicable supplements GMR reading •  Inductive Writing/ 1 0.1 29% •  Inductive Writing/ MR reading •  Inductive Writing & Reading 19 89 19 91 19 93 19 95 19 97 19 99 20 01 20 03 20 05 20 07 20 09 20 11 20 13 20 15 20 17 20 19 •  Late 1990s – super paramagnetic limit demonstrated through modeling gigabit / in2 •  •  Areal Density CAGR 40% •  Transfer Rate CAGR 20% year "4
  • 4. Cloud Computing will increase this trend •Jevons Paradox • Cloud Computing increases the efficiency of computing.... Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements http://en.wikipedia.org/wiki/Jevons_paradox "5
  • 5. Cloud Computing will increase this trend •Jevons Paradox • Cloud Computing increases the efficiency of computing.... Improved technology doubles Seagate Confidential: Subject to NDA the amount of Information No. 77103, effective Jan. produced 18, with a given amount of Storage 2009, ! and all applicable supplements Demand for Storage rises http://en.wikipedia.org/wiki/Jevons_paradox "5
  • 6. Technology TrendsNDA Seagate Confidential: Subject to No. 77103, effective Jan. 18, 2009, and all applicable supplements "6
  • 7. Shingled Disks •Write head larger than read head • Turns Disk into a sequentially written media Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements •All updates to data and metadata are written sequentially to a continuous stream, called a log •Disk API of sectors is no longer “natural” http://www.ssrc.ucsc.edu/Papers/amer-ieeetm11.pdf "7
  • 8. Log Structured Storage How much is erased on a reposition? • Tape - the remainder of the tape • Shingled disk - the remainder of the track group • Flash - the entire page All persistent Storage systems do/will implement log structure • e.g. “NoSQL Database of sectors” Does it make sense to layer a database on top of a database? • Could we use the log structure of the media to provide a more natural storage systems, not mimicking an antique paradigm? "8
  • 9. Single System Performance Trend Leading to disaggregation of servers http://web.eecs.umich.edu/~twenisch/papers/isca09-disaggregate.pdf "9
  • 10. Scaling Storage Distributed Hash Table • Key/Value Store RAM Flash Disk Memcached FAWN Riak http://en.wikipedia.org/wiki/Distributed_hash_table "10
  • 11. Metadata and Metadata Servers are Evil Required by traditional file systems (POSIX) to translate names to sectors • Hard to scale, heavy HA requirements, expensive Can we use a name as a key? • Place the data into a scaled key value store? • Eliminate costly metadata servers? "11
  • 12. Cumulative operations ordered by length 100% operations 92% of the operations 80% 60% 40% 32KB Cumulative percentage data 20% 0.5% of the data 0% 1.00 10.00 100.00 1000.00 10000.00 100000.00 Length (KB) "12
  • 14. Seagate Kinetic Seagate Confidential: Subject to NDA Open Storage Platform No. 77103, effective Jan. 18, 2009, and all applicable supplements "14
  • 15. Seagate Kinetic Open Storage Platform Dis-intermediates applications to drive –Goes around file systems, volume managers, drivers Enable ecosystem of value added software –Partners (like Basho) can create their own system value Lower TCO –Eliminates complexity "15
  • 16. "16
  • 17. "16
  • 19. App Proprietary to System Vendor • Application • Clustering • Management App D LibKinetic C++, Java, Python, Erlang, DIY GPL Standard ProtoBuf TCP/IP/GbE • Interconnect Proprietary to Seagate A • Storage S "17
  • 20. App App App App App Proprietary to System Vendor • Application • Clustering • Management App D LibKinetic LibKinetic LibKinetic GPL Standard ProtoBuf TCP/IP/GbE • Interconnect Proprietary to Seagate A • Storage S "17
  • 22. System Hardware Typical JBOD architecture • Does not require a server, just JBODs to the ToR Switch • 10 JBODS × 60 drives × 4TB = 2.4PB/Rack "18
  • 23. Features Provides RPC to Key/Value database • Data is pre-indexed • Compression and other value is easy and transparent P2P (Drive to Drive) copy of key ranges Communicate using existing Data Center Plumbing (TCP/IP) Multiple masters - Data sharing between machines Configurable caching per command • Async, Sync, Flush Local space management "19
  • 24. Kinetic Systems Clustering (performance, reliability, management) Compatibility with large scale applications (S3, etc.) Centralized Management • Reliability, availability, durability "20
  • 25. Lower TCO Elimination of server layers Less Human requirements Reduced mistakes Disaggregate storage from servers Power management "21
  • 26. Lower TCO Elimination of server layers Less Human requirements Reduced mistakes Disaggregate storage from servers Power management "21
  • 27. Goals of API Data movement • Get/put/delete/getnext/getprevious • Versioned (== for success), options Range operations Multiple masters • Authentication/Integrity/Authorization Cluster-able • Simple cluster configuration version enforcement 3rd party copy Management "22
  • 28. Management (System Vendor) Configures the drive • Network • authorized clients Monitors • Health • Statistics • Logs Initiates recovery • Change cluster version • 3rdPartyCopy "23
  • 29. Data formats Key Structure • Variable number of octets (0-4KB) Data Structure (Serialized to a byte stream) • KeyOf • Version • E2E Data Integrity –Algorithm name • Data Variable length (0-xMB) "24
  • 30. Performance Metrics Same normal performance expectations • • • • Sequential Write Random Write Sequential Read Sequential Write Iometer for key/value "25
  • 31. Demo Time! Seagate Confidential: Subject to NDA No. 77103, effective Jan. 18, 2009, and all applicable supplements "26
  • 33. Conclusion Deliver more value to Seagate, Partners and Customers • Dis-intermediates cloud applications to drive • Enable innovation in hardware and software ecosystem • Lower TCO OpenSource Software –Basho Riak, Swift, HDFS More information • http://seagate.com/www/kinetic • https://developers.seagate.com • http://guthub.com/Seagate "28