SlideShare a Scribd company logo
1 of 22
Evolving HDFS to a
Generalized Storage
Subsystem
Sanjay Radia
Chief Architect, Founder,
Hortonworks
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2013 - Confidential
Hello, my name is Sanjay Radia
Chief Architect, Founder, Hortonworks
Part of the original Hadoop team at Yahoo! since 2007
–Chief Architect of Hadoop Core at Yahoo!
–Apache Hadoop PMC and Committer
Prior
–Data center automation, virtualization, Java, HA, OSs, File Systems
– Startup, Sun Microsystems, Inria …
–Ph.D., University of Waterloo
Page 2
Architecting the Future of Big Data
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Overview
HDFS – Evolution in past and motivations for the future
Scaling HDFS
• Where we do well (# of clients/cluster size, raw storage)
• Where we have challenges (Small files and blocks)
• Solution
• Partial namespace (Briefly)
• Block Containers - But we are generalizing the storage layer to support this
Storage Containers to Generalize the Storage Layer
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Background: HDFS Layering
DN 1 DN 2 DN m
.. .. ..
NS1
Foreign NS n
... ...
NS k
Block Management Layer
Block Pool nBlock Pool kBlock Pool
1
NN-1 NN-k NN-n
Common Storage
BlockStorageNamespace
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Security in
virtualized
compute env
HDFS Dimensions Large # of compute
clients: 100K cores
Reliability Reliability
Reliability, Disk/DN FT
HA, DR, Snapshots ….
PBs of Data (Big Data)
Horizontal Scaling
Bad Apps
Multi-tenancy
Resource
Mgt/Isolation, Audit
Large number of files
and blocks
Beyond files:
optimized storage
Heterogeneous
storage
Erasure codes
(In Beta)
Performance
File co-location
Fat DataNodes BRs
Transparent
Encryption
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scalability
The Problems and the Solutions
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scalability – What HDFS Does Well
• HDFS NN stores all namespace metadata in memory (as per GFS)
• Scales to large clusters (5K) since all metadata in memory
– 60K-100K tasks can share the Namenode
– Low latency
• Large data if files are large
• Proof points of large data and large clusters
– Single Organizations have over 600PB in HDFS
– Single clusters with over 200PB using federation
– Large clusters over 4K multi-core nodes bombarding a single NN
Metadata in memory the strength of the original GFS and HDFS design
But also its weakness in scaling number of files and blocks
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scalability - The Challenges
• Challenges
• Large number of files (> 350 million)
• NN’s strength has become a limitation
• Number of File operations
• Need to improve concurrency move to multiple name servers
HDFS Federation is the current solution
• Add NameNodes to scale number of files & operations
• Deployed at Twitter
• Cluster with three NameNodes > 5000 node cluster (Plans to grow to 10,000 nodes)
• Back ported and used at Facebook to scale HDFS
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Scaling Files and Blocks
1. Scale Namespace
• Keep only partial namespace in memory - the workingSet
• Of last 3-5 years data only small portion is actively used
– the working set metadata fits in memory
- Do not want to page the working set =>still large NN memory to scale to 100K tasks
2. Scale Block Management
• Keeping only part of the BlockMap in mem does not work
• Soln: Containers of blocks (2GB-16GB+)
• Will reduce BlockMap
• Reduce Number of Block/Container reports
But extend DN to support generalized Storage Container
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Big Picture
A Brief Interlude on
Partial Namespace + Volumes
Partial Namespace in Memory is not focus of this talk
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Partial Namespace - Briefly
• Has been prototyped
• Benchmarks so that model works well
• Most file systems keep only partial namespace in memory but not at this scale
– Hence Cache replacement policies of working-set is important
• Work in progress to get it into HDFS
• Namespace Volumes – a better way to Federate the Namespace service
• Partial Namespace in Memory will allow multiple namespace volumes
• Scale both namespace and number of operations using multiple servers
• BTW Nameservers can run on DataNodes if you prefer …
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2013 - Confidential
Big Picture on HDFS Namespace + Volumes ..
 Only WorkingSet of namespace in memory
› Scale beyond memory of NN
 NameServer – Containers for namespaces
› More namespace volumes
– Chosen per user/tenant/DBs
– Management policies (quota, backup, DR …)
– Mount tables for unified namespace
• Can be managed by a central volume server
Number of NameServers =
› Sum of (Namespace working set) +
› Sum of (Namespace throughput)
› Move namespace for balancing
› N+K Failover amongst NameServers
12
Datanode Datanode…
…
NameServers as
Containers of Namespaces
Storage Layer
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Storage Containers:
Better HDFS and Beyond
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
DataNodes
Big Picture
Support multiple data layout structures
• Indexing
• Caching
• Use cases
• HDFS Block Container (scale blocks) + Co-location
• Object Store Container
• Local replica + S3 replica
• Hbase
• Block Store (e.g. Cinder for Openstack)
Common Shared Infrastructure for
• Replication
• Consistency
• Cluster membership
• Container location
Other Container Benefits
• Place to put in protocol enhancements
• Smaller riskier features
Block
Container
Object Store
Container
HBase
Container
Table
Container
Cluster
Membership
Replication
Management
Container
Location Service
Container Management Services
(Runs on DataNodes)
HBase
Object
Store
Metadata
Applications
HDFS
Physical Storage - Shared
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Current vs New World (Storage Containers)
Current
• Namespace (in NameNode)
• File=BlockIds[]
• BlockManager (In NameNode)
• BlockMap: BlockId->locations
• PipeLine repair
• Replication management
• BlockData in DataNode
• BlockId->Data
• Other
• Generation Id (note BlockId=Gen#+Number)
• File/Block Completion coordination
New World
• Namespace (in NameNode)
• File=BlockIds[] (but BlockId=ContainerId+LocalBid)
• ContainerManager (logically central)
• ContainerMap: ContainerId->locations
• Replication management
• Cluster membership
• Containers (in DataNode)
• Container’s BlockMetadata + Data
• BlockId->Data
• PipeLine repair
• Block Completion
• GenerationId equivalent? (Epoc of Raft?)
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2013 - Confidential
Storage Container
Contains data for many blocks with different block ids
Recall how the client will perform the mapping:
–file  blockId[] (NN)
–blockId ->ContainerLocation (Container Manager)
–Container maps the blockId to data (DataNode)
A container can be viewed as a local key-value store.
–Block Id is the key and Block data is the value
Page 16
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Container Structure (Using LevelDB/RocksDB)
Container
Index
Chunk
data
file
Chunk
data file
Chunk
data file
Chunk
data file
Key 1
LSM
LevelDB/RocksDB
Key N
Chunk
Data File
Name
Offset
Lengt
h
 An embeddable key-value store
 BlockId is the key and filename of local
chunk file is value
 Optimizations
– Small blocks (< 1MB) can be stored directly in
rocksDB
– Compaction for block data to avoid lots of files
• But this can be evolved over time
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Container Structure Can Support Random Writes
 4KB Chunks can be atomically updated in K-V store
 Chunk Data can be added at end of Chunk file (Log structured FSs)
Container
Index
Chunk
data file
Chunk
data file
Chunk
data file
Chunk
data file
Key 1
LSM
e.g
LevelDB/RocksDB
Key N
Chunk Data
File Name Offset Length
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2013 - Confidential
Replication: Possible Approaches
 Data pipeline
–Data pipeline as a form of chain-replication has been successfully used for data
–However, its correctness depended on central coordinator
–Needs to be extended for block metadata, but hard to get it right given no central coordinator
Use RAFT replication instead of data pipeline, for both data and metadata
–Proven to be correct
–Has been primarily used for small updates and transactions, fits well for metadata
–Performance concerns for large streaming writes, needs prototyping
Hybrid: RAFT + Pipeline
–Hybrid approach: It can be viewed as if central coordinator is replaced by RAFT
–Data pipeline approach for the data + the raft protocol -- under discussion
Page 19
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Next steps
• Remove Block management layer’s locking with Namespace
• Reduce lock contention, remove the tight coupling (immediate benefit)
• Allows us to implement a cleanly separated Container Management layer
• Block container (to support tens of billions of blocks)
• 2-4gb block containers initially => reduction of 40-80 in BR and block map
• Reduce BR pressure in on NN
• Early release:
– Single Replica Containers for a Cloud Storage Caching FS (Similar to HDFS-9806)
• Partial Namespace (to billions of files per volume)
• Will take us to 2B files initially and then more as we gain experience on file-working-set management
• Volumes + N+K failover
• Scale both ops and namespace + operational improvement for HA
• Other containers
• Local Replica & Cloud storage (e.g. S3) replica (Caching Mount)
• Object store, HBase …..
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Summary
• HDFS scale proven in real production systems
• 4K+ clusters
• Raw Storage >200PB in single federated NN cluster and >30PB in non-federated clusters
• But very large number of small files is a challenge
• Important Area of Current Focus: Scaling # Files and Blocks
• Partial Namespace: initially scale to 2B files, later 5-10B files per volume + multiple volumes
• Block containers: initially scale to 6B-12B blocks, later to 100B+ blocks
– However we are implementing this to extend the storage layer
• Restructuring storage layer to support generalized storage containers
• Support storage needs beyond HDFS: Object Store, better HBase support, etc.
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Q&A
Thank You

More Related Content

What's hot

Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesDataWorks Summit
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
Managing Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo ScaleManaging Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo ScaleDataWorks Summit/Hadoop Summit
 
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...DataWorks Summit
 
Schema Registry - Set Your Data Free
Schema Registry - Set Your Data FreeSchema Registry - Set Your Data Free
Schema Registry - Set Your Data FreeDataWorks Summit
 
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...DataWorks Summit
 
Scaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInScaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInDataWorks Summit
 
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDataWorks Summit
 
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks
 
Operating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsOperating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsDataWorks Summit/Hadoop Summit
 
The Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral ProcessingThe Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral ProcessingDataWorks Summit
 
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...DataWorks Summit/Hadoop Summit
 

What's hot (20)

Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Managing Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo ScaleManaging Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo Scale
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
 
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduceApache Hadoop 3.0 What's new in YARN and MapReduce
Apache Hadoop 3.0 What's new in YARN and MapReduce
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
Schema Registry - Set Your Data Free
Schema Registry - Set Your Data FreeSchema Registry - Set Your Data Free
Schema Registry - Set Your Data Free
 
What's new in Ambari
What's new in AmbariWhat's new in Ambari
What's new in Ambari
 
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
Dancing Elephants - Efficiently Working with Object Stores from Apache Spark ...
 
Scaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInScaling Hadoop at LinkedIn
Scaling Hadoop at LinkedIn
 
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARNDeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
DeathStar: Easy, Dynamic, Multi-Tenant HBase via YARN
 
Hortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices WorkshopHortonworks Technical Workshop - Operational Best Practices Workshop
Hortonworks Technical Workshop - Operational Best Practices Workshop
 
Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?Ingest and Stream Processing - What will you choose?
Ingest and Stream Processing - What will you choose?
 
Empower Data-Driven Organizations with HPE and Hadoop
Empower Data-Driven Organizations with HPE and HadoopEmpower Data-Driven Organizations with HPE and Hadoop
Empower Data-Driven Organizations with HPE and Hadoop
 
Operating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsOperating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and Improvements
 
Curb your insecurity with HDP
Curb your insecurity with HDPCurb your insecurity with HDP
Curb your insecurity with HDP
 
The Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral ProcessingThe Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral Processing
 
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
End to End Processing of 3.7 Million Telemetry Events per Second using Lambda...
 

Viewers also liked

Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondDataWorks Summit/Hadoop Summit
 
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisDataWorks Summit/Hadoop Summit
 
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on HiveFaster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on HiveDataWorks Summit/Hadoop Summit
 
Webinar: Solr 6 Deep Dive - SQL and Graph
Webinar: Solr 6 Deep Dive - SQL and GraphWebinar: Solr 6 Deep Dive - SQL and Graph
Webinar: Solr 6 Deep Dive - SQL and GraphLucidworks
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation Yahoo Developer Network
 
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...DataWorks Summit/Hadoop Summit
 
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopOzone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopHortonworks
 
Ozone: An Object Store in HDFS
Ozone: An Object Store in HDFSOzone: An Object Store in HDFS
Ozone: An Object Store in HDFSDataWorks Summit
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionDataWorks Summit/Hadoop Summit
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Precisely
 

Viewers also liked (20)

LEGO: Data Driven Growth Hacking Powered by Big Data
LEGO: Data Driven Growth Hacking Powered by Big Data LEGO: Data Driven Growth Hacking Powered by Big Data
LEGO: Data Driven Growth Hacking Powered by Big Data
 
YARN Federation
YARN Federation YARN Federation
YARN Federation
 
Building a Data Lake on AWS
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWS
 
Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyond
 
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data AnalysisApache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
Apache Zeppelin + LIvy: Bringing Multi Tenancy to Interactive Data Analysis
 
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on HiveFaster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
Faster, Faster, Faster: The True Story of a Mobile Analytics Data Mart on Hive
 
Webinar: Solr 6 Deep Dive - SQL and Graph
Webinar: Solr 6 Deep Dive - SQL and GraphWebinar: Solr 6 Deep Dive - SQL and Graph
Webinar: Solr 6 Deep Dive - SQL and Graph
 
Apache NiFi Crash Course Intro
Apache NiFi Crash Course IntroApache NiFi Crash Course Intro
Apache NiFi Crash Course Intro
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
 
Enterprise Grade Streaming under 2ms on Hadoop
Enterprise Grade Streaming under 2ms on HadoopEnterprise Grade Streaming under 2ms on Hadoop
Enterprise Grade Streaming under 2ms on Hadoop
 
Data Preparation of Data Science
Data Preparation of Data ScienceData Preparation of Data Science
Data Preparation of Data Science
 
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
 
Lambda-less Stream Processing @Scale in LinkedIn
Lambda-less Stream Processing @Scale in LinkedIn Lambda-less Stream Processing @Scale in LinkedIn
Lambda-less Stream Processing @Scale in LinkedIn
 
Ozone- Object store for Apache Hadoop
Ozone- Object store for Apache HadoopOzone- Object store for Apache Hadoop
Ozone- Object store for Apache Hadoop
 
Ozone: An Object Store in HDFS
Ozone: An Object Store in HDFSOzone: An Object Store in HDFS
Ozone: An Object Store in HDFS
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 

Similar to Evolving HDFS to a Generalized Storage Subsystem

Evolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage SubsystemEvolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage SubsystemDataWorks Summit/Hadoop Summit
 
Ozone and HDFS’s evolution
Ozone and HDFS’s evolutionOzone and HDFS’s evolution
Ozone and HDFS’s evolutionDataWorks Summit
 
Ozone and HDFS's Evolution
Ozone and HDFS's EvolutionOzone and HDFS's Evolution
Ozone and HDFS's EvolutionDataWorks Summit
 
Ozone and HDFS’s evolution
Ozone and HDFS’s evolutionOzone and HDFS’s evolution
Ozone and HDFS’s evolutionDataWorks Summit
 
Ozone: scaling HDFS to trillions of objects
Ozone: scaling HDFS to trillions of objectsOzone: scaling HDFS to trillions of objects
Ozone: scaling HDFS to trillions of objectsDataWorks Summit
 
HDFS Federation++
HDFS Federation++HDFS Federation++
HDFS Federation++Hortonworks
 
Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...DataWorks Summit
 
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBaseHBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBaseCloudera, Inc.
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseenissoz
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesDataWorks Summit/Hadoop Summit
 
HDFS- What is New and Future
HDFS- What is New and FutureHDFS- What is New and Future
HDFS- What is New and FutureDataWorks Summit
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateDataWorks Summit
 
Moving towards enterprise ready Hadoop clusters on the cloud
Moving towards enterprise ready Hadoop clusters on the cloudMoving towards enterprise ready Hadoop clusters on the cloud
Moving towards enterprise ready Hadoop clusters on the cloudDataWorks Summit/Hadoop Summit
 
Hadoop operations-2014-strata-new-york-v5
Hadoop operations-2014-strata-new-york-v5Hadoop operations-2014-strata-new-york-v5
Hadoop operations-2014-strata-new-york-v5Chris Nauroth
 
Democratizing Memory Storage
Democratizing Memory StorageDemocratizing Memory Storage
Democratizing Memory StorageDataWorks Summit
 
Big data spain keynote nov 2016
Big data spain keynote nov 2016Big data spain keynote nov 2016
Big data spain keynote nov 2016alanfgates
 
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage  object store integration in production (final)Hadoop & cloud storage  object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)Chris Nauroth
 

Similar to Evolving HDFS to a Generalized Storage Subsystem (20)

Evolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage SubsystemEvolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage Subsystem
 
Evolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage SubsystemEvolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage Subsystem
 
Ozone and HDFS’s evolution
Ozone and HDFS’s evolutionOzone and HDFS’s evolution
Ozone and HDFS’s evolution
 
Ozone and HDFS's Evolution
Ozone and HDFS's EvolutionOzone and HDFS's Evolution
Ozone and HDFS's Evolution
 
Ozone and HDFS’s evolution
Ozone and HDFS’s evolutionOzone and HDFS’s evolution
Ozone and HDFS’s evolution
 
Ozone: scaling HDFS to trillions of objects
Ozone: scaling HDFS to trillions of objectsOzone: scaling HDFS to trillions of objects
Ozone: scaling HDFS to trillions of objects
 
HDFS Federation++
HDFS Federation++HDFS Federation++
HDFS Federation++
 
Hadoop 3 in a Nutshell
Hadoop 3 in a NutshellHadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
 
Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...
 
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBaseHBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
 
HDFS- What is New and Future
HDFS- What is New and FutureHDFS- What is New and Future
HDFS- What is New and Future
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community Update
 
Moving towards enterprise ready Hadoop clusters on the cloud
Moving towards enterprise ready Hadoop clusters on the cloudMoving towards enterprise ready Hadoop clusters on the cloud
Moving towards enterprise ready Hadoop clusters on the cloud
 
Hadoop operations-2014-strata-new-york-v5
Hadoop operations-2014-strata-new-york-v5Hadoop operations-2014-strata-new-york-v5
Hadoop operations-2014-strata-new-york-v5
 
Democratizing Memory Storage
Democratizing Memory StorageDemocratizing Memory Storage
Democratizing Memory Storage
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
Big data spain keynote nov 2016
Big data spain keynote nov 2016Big data spain keynote nov 2016
Big data spain keynote nov 2016
 
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage  object store integration in production (final)Hadoop & cloud storage  object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)
 

More from DataWorks Summit/Hadoop Summit

Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerDataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformDataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLDataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...DataWorks Summit/Hadoop Summit
 

More from DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

Evolving HDFS to a Generalized Storage Subsystem

  • 1. Evolving HDFS to a Generalized Storage Subsystem Sanjay Radia Chief Architect, Founder, Hortonworks
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved © Hortonworks Inc. 2013 - Confidential Hello, my name is Sanjay Radia Chief Architect, Founder, Hortonworks Part of the original Hadoop team at Yahoo! since 2007 –Chief Architect of Hadoop Core at Yahoo! –Apache Hadoop PMC and Committer Prior –Data center automation, virtualization, Java, HA, OSs, File Systems – Startup, Sun Microsystems, Inria … –Ph.D., University of Waterloo Page 2 Architecting the Future of Big Data
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Overview HDFS – Evolution in past and motivations for the future Scaling HDFS • Where we do well (# of clients/cluster size, raw storage) • Where we have challenges (Small files and blocks) • Solution • Partial namespace (Briefly) • Block Containers - But we are generalizing the storage layer to support this Storage Containers to Generalize the Storage Layer
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Background: HDFS Layering DN 1 DN 2 DN m .. .. .. NS1 Foreign NS n ... ... NS k Block Management Layer Block Pool nBlock Pool kBlock Pool 1 NN-1 NN-k NN-n Common Storage BlockStorageNamespace
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Security in virtualized compute env HDFS Dimensions Large # of compute clients: 100K cores Reliability Reliability Reliability, Disk/DN FT HA, DR, Snapshots …. PBs of Data (Big Data) Horizontal Scaling Bad Apps Multi-tenancy Resource Mgt/Isolation, Audit Large number of files and blocks Beyond files: optimized storage Heterogeneous storage Erasure codes (In Beta) Performance File co-location Fat DataNodes BRs Transparent Encryption
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scalability The Problems and the Solutions
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scalability – What HDFS Does Well • HDFS NN stores all namespace metadata in memory (as per GFS) • Scales to large clusters (5K) since all metadata in memory – 60K-100K tasks can share the Namenode – Low latency • Large data if files are large • Proof points of large data and large clusters – Single Organizations have over 600PB in HDFS – Single clusters with over 200PB using federation – Large clusters over 4K multi-core nodes bombarding a single NN Metadata in memory the strength of the original GFS and HDFS design But also its weakness in scaling number of files and blocks
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scalability - The Challenges • Challenges • Large number of files (> 350 million) • NN’s strength has become a limitation • Number of File operations • Need to improve concurrency move to multiple name servers HDFS Federation is the current solution • Add NameNodes to scale number of files & operations • Deployed at Twitter • Cluster with three NameNodes > 5000 node cluster (Plans to grow to 10,000 nodes) • Back ported and used at Facebook to scale HDFS
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Scaling Files and Blocks 1. Scale Namespace • Keep only partial namespace in memory - the workingSet • Of last 3-5 years data only small portion is actively used – the working set metadata fits in memory - Do not want to page the working set =>still large NN memory to scale to 100K tasks 2. Scale Block Management • Keeping only part of the BlockMap in mem does not work • Soln: Containers of blocks (2GB-16GB+) • Will reduce BlockMap • Reduce Number of Block/Container reports But extend DN to support generalized Storage Container
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Big Picture A Brief Interlude on Partial Namespace + Volumes Partial Namespace in Memory is not focus of this talk
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Partial Namespace - Briefly • Has been prototyped • Benchmarks so that model works well • Most file systems keep only partial namespace in memory but not at this scale – Hence Cache replacement policies of working-set is important • Work in progress to get it into HDFS • Namespace Volumes – a better way to Federate the Namespace service • Partial Namespace in Memory will allow multiple namespace volumes • Scale both namespace and number of operations using multiple servers • BTW Nameservers can run on DataNodes if you prefer …
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved © Hortonworks Inc. 2013 - Confidential Big Picture on HDFS Namespace + Volumes ..  Only WorkingSet of namespace in memory › Scale beyond memory of NN  NameServer – Containers for namespaces › More namespace volumes – Chosen per user/tenant/DBs – Management policies (quota, backup, DR …) – Mount tables for unified namespace • Can be managed by a central volume server Number of NameServers = › Sum of (Namespace working set) + › Sum of (Namespace throughput) › Move namespace for balancing › N+K Failover amongst NameServers 12 Datanode Datanode… … NameServers as Containers of Namespaces Storage Layer
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Storage Containers: Better HDFS and Beyond
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DataNodes Big Picture Support multiple data layout structures • Indexing • Caching • Use cases • HDFS Block Container (scale blocks) + Co-location • Object Store Container • Local replica + S3 replica • Hbase • Block Store (e.g. Cinder for Openstack) Common Shared Infrastructure for • Replication • Consistency • Cluster membership • Container location Other Container Benefits • Place to put in protocol enhancements • Smaller riskier features Block Container Object Store Container HBase Container Table Container Cluster Membership Replication Management Container Location Service Container Management Services (Runs on DataNodes) HBase Object Store Metadata Applications HDFS Physical Storage - Shared
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Current vs New World (Storage Containers) Current • Namespace (in NameNode) • File=BlockIds[] • BlockManager (In NameNode) • BlockMap: BlockId->locations • PipeLine repair • Replication management • BlockData in DataNode • BlockId->Data • Other • Generation Id (note BlockId=Gen#+Number) • File/Block Completion coordination New World • Namespace (in NameNode) • File=BlockIds[] (but BlockId=ContainerId+LocalBid) • ContainerManager (logically central) • ContainerMap: ContainerId->locations • Replication management • Cluster membership • Containers (in DataNode) • Container’s BlockMetadata + Data • BlockId->Data • PipeLine repair • Block Completion • GenerationId equivalent? (Epoc of Raft?)
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved © Hortonworks Inc. 2013 - Confidential Storage Container Contains data for many blocks with different block ids Recall how the client will perform the mapping: –file  blockId[] (NN) –blockId ->ContainerLocation (Container Manager) –Container maps the blockId to data (DataNode) A container can be viewed as a local key-value store. –Block Id is the key and Block data is the value Page 16
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Container Structure (Using LevelDB/RocksDB) Container Index Chunk data file Chunk data file Chunk data file Chunk data file Key 1 LSM LevelDB/RocksDB Key N Chunk Data File Name Offset Lengt h  An embeddable key-value store  BlockId is the key and filename of local chunk file is value  Optimizations – Small blocks (< 1MB) can be stored directly in rocksDB – Compaction for block data to avoid lots of files • But this can be evolved over time
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Container Structure Can Support Random Writes  4KB Chunks can be atomically updated in K-V store  Chunk Data can be added at end of Chunk file (Log structured FSs) Container Index Chunk data file Chunk data file Chunk data file Chunk data file Key 1 LSM e.g LevelDB/RocksDB Key N Chunk Data File Name Offset Length
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved © Hortonworks Inc. 2013 - Confidential Replication: Possible Approaches  Data pipeline –Data pipeline as a form of chain-replication has been successfully used for data –However, its correctness depended on central coordinator –Needs to be extended for block metadata, but hard to get it right given no central coordinator Use RAFT replication instead of data pipeline, for both data and metadata –Proven to be correct –Has been primarily used for small updates and transactions, fits well for metadata –Performance concerns for large streaming writes, needs prototyping Hybrid: RAFT + Pipeline –Hybrid approach: It can be viewed as if central coordinator is replaced by RAFT –Data pipeline approach for the data + the raft protocol -- under discussion Page 19
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Next steps • Remove Block management layer’s locking with Namespace • Reduce lock contention, remove the tight coupling (immediate benefit) • Allows us to implement a cleanly separated Container Management layer • Block container (to support tens of billions of blocks) • 2-4gb block containers initially => reduction of 40-80 in BR and block map • Reduce BR pressure in on NN • Early release: – Single Replica Containers for a Cloud Storage Caching FS (Similar to HDFS-9806) • Partial Namespace (to billions of files per volume) • Will take us to 2B files initially and then more as we gain experience on file-working-set management • Volumes + N+K failover • Scale both ops and namespace + operational improvement for HA • Other containers • Local Replica & Cloud storage (e.g. S3) replica (Caching Mount) • Object store, HBase …..
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Summary • HDFS scale proven in real production systems • 4K+ clusters • Raw Storage >200PB in single federated NN cluster and >30PB in non-federated clusters • But very large number of small files is a challenge • Important Area of Current Focus: Scaling # Files and Blocks • Partial Namespace: initially scale to 2B files, later 5-10B files per volume + multiple volumes • Block containers: initially scale to 6B-12B blocks, later to 100B+ blocks – However we are implementing this to extend the storage layer • Restructuring storage layer to support generalized storage containers • Support storage needs beyond HDFS: Object Store, better HBase support, etc.
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Q&A Thank You