SlideShare a Scribd company logo
1 of 28
1© Cloudera, Inc. All rights reserved.
Multi-Tenant Operations with
Cloudera Enterprise
A look inside British Telecommunications
Phill Radley | Chief Data Architect | BT
Matt Schumpert | Director Product Management | Cloudera
2© Cloudera, Inc. All rights reserved.
What is Multi-Tenant Hadoop
• Single General Purpose Hadoop Cluster
• Multiple distinct user groups with code & data that need to be separated
• Sharing storage (HDFS) & processing resources (cores & RAM)
• Storage allocated with HDFS Quota
• Compute managed with Fair Share Scheduler (at run time)
• Mixed work loads storage only, batch & interactive processing
• Typically On-Premise run by an in-house data centre team
3© Cloudera, Inc. All rights reserved.
Why Implement Multi-Tenant Hadoop
• A single place for all raw enterprise data kept for as long as needed
Universally popular concept in the business except for in Finance
Target data sets the business will be interested in
• Highly efficient use of Infrastructure
• Allows small tenants access to big resources
• Self-Service fast provisioning enabling fast project spin up
• New Low unit cost makes old businesses cases viable (e.g. active archives)
• Start small, with one or two small tenants, but plan for many more
• E.g. find a struggling old batch applications & re-platform as an internal IT project
• Once platform up and running go after a high profile flagship tenant
4© Cloudera, Inc. All rights reserved.
Platform as a Service – Hadoop as a Service
Target Users
• Application developers, testers & production
• Business Analysts/Data Scientists wanting access to live data
Service specification
• HaaS Version 1.0, change control & roadmap
• Features (e.g. HDFS(httpFS/NFS/API)  Map/Reduce  HUE  PIG Hive  Hbase  Search  )
Service Management
• Ordering form & process, Helpdesk
• Service Manager, Capacity Manager
5© Cloudera, Inc. All rights reserved.
Security & Governance
• Tenant data privacy
• Microsoft Active Directory integration with Kerberos
• All user groups & accounts managed in AD
• HDFS Encryption Zones
• Data governance to control data sharing
• Identified data stewards who approve creation of shared views and grants
• Security Logging & Reporting
6© Cloudera, Inc. All rights reserved.
The genesis of HaaS
Research & Innovation
Adastral Park
Business HQ
London
7© Cloudera, Inc. All rights reserved.
From Hadoop to HaaS
• Standing up a cluster is straightforward
• Buy Hadoop optimized servers (lots of local disk)
The unit cost is a fraction of a typical private cloud
• Install Linux (integrate with Active Directory/Kerberos)
• Use Cloudera Manager to create cluster
• Decide what services to offer based on the pipeline of tenant
workloads.
• Feb 2014 HaaS R1: was a “minimum viable product”
• Storage + Batch Compute (M/R) + UI (Hue) + Kerberos
• Oct 2015 HaaS R2: Added interactive SQL use
• Impala + Sqoop + Sentry
• Aug 2016 HaaS R3: In Memory
• Spark + Second site + Search…
8© Cloudera, Inc. All rights reserved.
HAAS A AP 00307_12126
Microsoft Active Directory Groups
What is a HaaS Tenant?
• A tenant is synonymous with a HaaS Service instance
1. An identifying Group in Active Directory
2. A set of Hadoop resources owned by the Group
• HDFS Quota
• YARN Resource Pool
• Hive database
• ( + other options e.g. Flume port/agent, + data wrangling tool)
• All services are accessed through common access points
Service ID: HAAS A AP 00307_12126
  
  
DFLT QUOTA
500GB
 
Pig Hql java
Hive Database
HAASA AP 00307_12126
Table 1 View 1
Q
Table 2 View 2
YARN Resource Pool
HAASA AP 00307_12126
HDFS Storage
/user/HAASA AP 00307_12126
HaaS Service Instance Admin
(e.g. developer, data scientist)
Hadoop
Platform Admin
service request
Provisioning script
“Welcome to HaaS”
CLUSTER SERVICE
TYPE
SERVICE
NO.
BUS. APP.
ID
9© Cloudera, Inc. All rights reserved.
HaaS Tenant Reporting
BT has developed a range of supporting tools & training materials to help
on-board tenants and monitor the service
For example the provisioning script and weekly HDFS capacity reports:
One Project: NAD
multiple services
Service 123=prod
Service 153=test
P for
Production
T for Test
D for Dev
10© Cloudera, Inc. All rights reserved.
e.g. HAASAAP0067_05038: CMF Customer Master File
1 Pre-Load
CSS
COSMOSS
DISE
BTC
C2B
Antillia
Glossi
Cyclone
Phoenix
Radianz
Siebel OV
Siebel OS
“Customer Master file (CMF) ”
• A 10 year old batch app needing to re-platform (2014)
• Data from 12 Source systems merged with D&B Legal
Entities used as Reference Data
• Existing SQL modules ported to HQL+PIG
Benefits
• Business able to do multiple runs in a day (15x faster)
• Adding new sources is quicker (schema on read)
• Data available for Self-Service Teams (DQ/Data Science)













HAASAAP0067_05038
OLD CMF
DBStaging
Source Systems
2 Load
3 Match / De-Dupe
4 Key Gen
5 Business Rule
6 Publish
7 Post Load
CMF
Reference Data
11© Cloudera, Inc. All rights reserved.
HAASA AP 00101_2029
Faults
4369
Orders
3531
CRM
2029
 Three existing business applications (CRM, Orders, Faults) extended into HaaS 
RDBMS
Customer
Table
RDBMS
Orders
Table
RDBMS
Faults
Table
T_CustomerHive DB
HAASA
AP 00101_2029
sqoop
V_Customer
HAASA AP 00202_3531
T_OrdersHive DB
HAASA
AP 0202_3531
sqoop
V_Orders
HAASA AP 00303_4369
T_FaultsHive DB
HAASA
AP 0303_4369
sqoop
V_Faults
Business
Data
Stewards
Business Analysts / Data Scientists

CRM

Orders

Faults
Target for Self-Service Data Access using HaaS
1. Browse & select data
2. Get Steward Approval
3. Create VIEWs & GRANTs
4. Select/join Views
Data
Catalogue
• Self-service workflow-driven access to any table on any
system (contrast with design/develop legacy warehouse
approach)
• Option to add homomorphic encryption to any table to
anonymize PII data to further reduce risk
12© Cloudera, Inc. All rights reserved.
Cloudera Manager 5.7
Easier Multi-Tenant Operations
13© Cloudera, Inc. All rights reserved.
Major Enablers of Multi-Tenancy in Cloudera Manager
• Dynamic Resource Pools
• Cluster Utilization Reporting
• HDFS Usage Reports
14© Cloudera, Inc. All rights reserved.
Dynamic Resource Pools Define Tenants!
• Hierarchical buckets that
• Express prioritization
• Protect fixed capacity
• Create sensible guardrails
15© Cloudera, Inc. All rights reserved.
Dynamic Resource Pools Define Tenants!
• Hierarchical buckets that
• Express prioritization
• Protect fixed capacity
• Create sensible guardrails
• Make an admins’ life easy with
• User/group-based creation
• ACLs
• Automatic preemption
• Rotating service windows
16© Cloudera, Inc. All rights reserved.
Dynamic Resource Pools Configuration
17© Cloudera, Inc. All rights reserved.
Roadmap: Dynamic Resource Pools
• Automatic user/group-based job placement under a tenant’s pool
18© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
BI Marketing Engineering
19© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
Usage Data
Resource Allocations
BI Marketing Engineering
20© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
Usage Data
Resource Allocations
Report
BI Marketing Engineering
• Configurable Time Window
• Tenant Aggregation View
• User Aggregation View
21© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
Usage Data
Resource Allocations
Report
BI Marketing Engineering
• “How much CPU & memory did each tenant use?”
• “I set up fair scheduler. Did each of my tenants get their fair share?”
• “Which tenants had to wait the longest for their applications to get resources?
• “Which tenants asked for the most memory but used the least?”
• “When do I need to add nodes to my cluster?”
• Configurable Time Window
• Tenant Aggregation View
• User Aggregation View
22© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
23© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
24© Cloudera, Inc. All rights reserved.
Cluster Utilization Reporting
25© Cloudera, Inc. All rights reserved.
Roadmap: Cluster Utilization Reporting
• Container Allocation Latency
• A definitive wait metric for each bit of YARN workload
• Support for more components
• HDFS, HBase, Search, etc
• Support additional metrics
• Disk I/O, Network I/O
• Add additional tools to existing metrics:
• Showback/chargeback: associate $$ with resource usage
• Capacity planning: trend lines
• DBA tools: identify/flag rogue queries (Hive, Impala, HBase)
• Workload management: tag critical apps with SLAs
26© Cloudera, Inc. All rights reserved.
HDFS Usage Reports
• Recently revamped based on known HaaS implementations
• Drill-down by user/tenant to do housecleaning
27© Cloudera, Inc. All rights reserved.
More Information & Next Steps
Get Started
• Download C5.7:
www.cloudera.com/downloads
Release Notes
• www.cloudera.com/documentation/
enterprise/latest/topics/rg_release_
notes.html
Training Classes
• university.cloudera.com
Check out Cloudera Manager Demo
Videos at go.cloudera.com/hadoop-
demo-cm1
28© Cloudera, Inc. All rights reserved.
Questions?

More Related Content

What's hot

Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...Cloudera, Inc.
 
大数据数据治理及数据安全
大数据数据治理及数据安全大数据数据治理及数据安全
大数据数据治理及数据安全Jianwei Li
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Cloudera, Inc.
 
Spark One Platform Webinar
Spark One Platform WebinarSpark One Platform Webinar
Spark One Platform WebinarCloudera, Inc.
 
Security implementation on hadoop
Security implementation on hadoopSecurity implementation on hadoop
Security implementation on hadoopWei-Chiu Chuang
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupAndrei Savu
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseCloudera, Inc.
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudCloudera, Inc.
 
sql on hadoop
sql on hadoop sql on hadoop
sql on hadoop Jianwei Li
 
大数据数据安全
大数据数据安全大数据数据安全
大数据数据安全Jianwei Li
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessCloudera, Inc.
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSCloudera, Inc.
 
What the Enterprise Requires - Business Continuity and Visibility
What the Enterprise Requires - Business Continuity and VisibilityWhat the Enterprise Requires - Business Continuity and Visibility
What the Enterprise Requires - Business Continuity and VisibilityCloudera, Inc.
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduCloudera, Inc.
 
Extreme Sports & Beyond: Exploring a new frontier in data with GoPro
Extreme Sports & Beyond: Exploring a new frontier in data with GoProExtreme Sports & Beyond: Exploring a new frontier in data with GoPro
Extreme Sports & Beyond: Exploring a new frontier in data with GoProCloudera, Inc.
 
快速数据快速分析引擎-Kudu
快速数据快速分析引擎-Kudu快速数据快速分析引擎-Kudu
快速数据快速分析引擎-KuduJianwei Li
 
Road to Cloudera certification
Road to Cloudera certificationRoad to Cloudera certification
Road to Cloudera certificationCloudera, Inc.
 
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18Cloudera, Inc.
 

What's hot (20)

Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 
大数据数据治理及数据安全
大数据数据治理及数据安全大数据数据治理及数据安全
大数据数据治理及数据安全
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
 
Spark One Platform Webinar
Spark One Platform WebinarSpark One Platform Webinar
Spark One Platform Webinar
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Security implementation on hadoop
Security implementation on hadoopSecurity implementation on hadoop
Security implementation on hadoop
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the Enterprise
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
 
sql on hadoop
sql on hadoop sql on hadoop
sql on hadoop
 
大数据数据安全
大数据数据安全大数据数据安全
大数据数据安全
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data Success
 
Five Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWSFive Tips for Running Cloudera on AWS
Five Tips for Running Cloudera on AWS
 
What the Enterprise Requires - Business Continuity and Visibility
What the Enterprise Requires - Business Continuity and VisibilityWhat the Enterprise Requires - Business Continuity and Visibility
What the Enterprise Requires - Business Continuity and Visibility
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache Kudu
 
Extreme Sports & Beyond: Exploring a new frontier in data with GoPro
Extreme Sports & Beyond: Exploring a new frontier in data with GoProExtreme Sports & Beyond: Exploring a new frontier in data with GoPro
Extreme Sports & Beyond: Exploring a new frontier in data with GoPro
 
快速数据快速分析引擎-Kudu
快速数据快速分析引擎-Kudu快速数据快速分析引擎-Kudu
快速数据快速分析引擎-Kudu
 
Road to Cloudera certification
Road to Cloudera certificationRoad to Cloudera certification
Road to Cloudera certification
 
Cloudbreak - Technical Deep Dive
Cloudbreak - Technical Deep DiveCloudbreak - Technical Deep Dive
Cloudbreak - Technical Deep Dive
 
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
 

Viewers also liked

The Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data ArchitectureThe Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data ArchitectureDataWorks Summit/Hadoop Summit
 
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsMulti-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsDataWorks Summit
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
Hadoop meets Cloud with Multi-Tenancy
Hadoop meets Cloud with Multi-TenancyHadoop meets Cloud with Multi-Tenancy
Hadoop meets Cloud with Multi-TenancyTreasure Data, Inc.
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...StampedeCon
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Hortonworks
 
Rigorous and Multi-tenant HBase Performance Measurement
Rigorous and Multi-tenant HBase Performance MeasurementRigorous and Multi-tenant HBase Performance Measurement
Rigorous and Multi-tenant HBase Performance MeasurementDataWorks Summit
 
Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010Cloudera, Inc.
 
Cloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in TelecomCloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in TelecomEinsny Phionesgo
 
6.ปกหลัง
6.ปกหลัง6.ปกหลัง
6.ปกหลังkai kk
 
Apache hadoop yarn 勉強会 8. capacity scheduler in yarn
Apache hadoop yarn 勉強会 8. capacity scheduler in yarnApache hadoop yarn 勉強会 8. capacity scheduler in yarn
Apache hadoop yarn 勉強会 8. capacity scheduler in yarnShuya Tsukamoto
 
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
Scale 12 x   Efficient Multi-tenant Hadoop 2 Workloads with YarnScale 12 x   Efficient Multi-tenant Hadoop 2 Workloads with Yarn
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with YarnDavid Kaiser
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Cloudera, Inc.
 
STAC Summit 2014 - Building a multitenant Big Data infrastructure
STAC Summit 2014 - Building a multitenant Big Data infrastructureSTAC Summit 2014 - Building a multitenant Big Data infrastructure
STAC Summit 2014 - Building a multitenant Big Data infrastructureGord Sissons
 
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Cloudera, Inc.
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsDataWorks Summit
 

Viewers also liked (20)

Managing a Multi-Tenant Data Lake
Managing a Multi-Tenant Data LakeManaging a Multi-Tenant Data Lake
Managing a Multi-Tenant Data Lake
 
The Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data ArchitectureThe Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data Architecture
 
Effective Spark on Multi-Tenant Clusters
Effective Spark on Multi-Tenant ClustersEffective Spark on Multi-Tenant Clusters
Effective Spark on Multi-Tenant Clusters
 
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsMulti-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Hadoop meets Cloud with Multi-Tenancy
Hadoop meets Cloud with Multi-TenancyHadoop meets Cloud with Multi-Tenancy
Hadoop meets Cloud with Multi-Tenancy
 
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
 
Rigorous and Multi-tenant HBase Performance Measurement
Rigorous and Multi-tenant HBase Performance MeasurementRigorous and Multi-tenant HBase Performance Measurement
Rigorous and Multi-tenant HBase Performance Measurement
 
Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010
 
Cloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in TelecomCloudera Enterprise_Data Hub in Telecom
Cloudera Enterprise_Data Hub in Telecom
 
6.ปกหลัง
6.ปกหลัง6.ปกหลัง
6.ปกหลัง
 
Apache hadoop yarn 勉強会 8. capacity scheduler in yarn
Apache hadoop yarn 勉強会 8. capacity scheduler in yarnApache hadoop yarn 勉強会 8. capacity scheduler in yarn
Apache hadoop yarn 勉強会 8. capacity scheduler in yarn
 
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
Scale 12 x   Efficient Multi-tenant Hadoop 2 Workloads with YarnScale 12 x   Efficient Multi-tenant Hadoop 2 Workloads with Yarn
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
STAC Summit 2014 - Building a multitenant Big Data infrastructure
STAC Summit 2014 - Building a multitenant Big Data infrastructureSTAC Summit 2014 - Building a multitenant Big Data infrastructure
STAC Summit 2014 - Building a multitenant Big Data infrastructure
 
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
 
Architecting a multi-tenanted platform
Architecting a multi-tenanted platform Architecting a multi-tenanted platform
Architecting a multi-tenanted platform
 

Similar to Multi-Tenant Operations with Cloudera 5.7 & BT

Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera, Inc.
 
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency ObjectivesHadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency ObjectivesCloudera, Inc.
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanJim Kaskade
 
Cloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the CloudCloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the CloudGoDataDriven
 
Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015Shravan (Sean) Pabba
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...Cloudera, Inc.
 
Big Data with KNIME is as easy as 1, 2, 3, ...4!
Big Data with KNIME is as easy as 1, 2, 3, ...4!Big Data with KNIME is as easy as 1, 2, 3, ...4!
Big Data with KNIME is as easy as 1, 2, 3, ...4!KNIMESlides
 
Big Data as easy as 1, 2, 3, ... 4 ... with KNIME
Big Data as easy as 1, 2, 3, ... 4 ... with KNIMEBig Data as easy as 1, 2, 3, ... 4 ... with KNIME
Big Data as easy as 1, 2, 3, ... 4 ... with KNIMERosaria Silipo
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Vantara
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Cloudera, Inc.
 
Hadoop and Hive in Enterprises
Hadoop and Hive in EnterprisesHadoop and Hive in Enterprises
Hadoop and Hive in Enterprisesmarkgrover
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformCloudera, Inc.
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Cloudera, Inc.
 
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?Christopher Foot
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Stefan Lipp
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Cloudera, Inc.
 
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse OptimisationBigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse OptimisationExcelerate Systems
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformCloudera, Inc.
 
Cloud's Hidden Impact on IT Shops
Cloud's Hidden Impact on IT ShopsCloud's Hidden Impact on IT Shops
Cloud's Hidden Impact on IT ShopsChristopher Foot
 

Similar to Multi-Tenant Operations with Cloudera 5.7 & BT (20)

Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
 
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency ObjectivesHadoop Essentials -- The What, Why and How to Meet Agency Objectives
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
 
Vmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps IronfanVmware Serengeti - Based on Infochimps Ironfan
Vmware Serengeti - Based on Infochimps Ironfan
 
Cloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the CloudCloudera GoDataFest Deploying Cloudera in the Cloud
Cloudera GoDataFest Deploying Cloudera in the Cloud
 
Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015Hadoop security @ Philly Hadoop Meetup May 2015
Hadoop security @ Philly Hadoop Meetup May 2015
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 
Big Data with KNIME is as easy as 1, 2, 3, ...4!
Big Data with KNIME is as easy as 1, 2, 3, ...4!Big Data with KNIME is as easy as 1, 2, 3, ...4!
Big Data with KNIME is as easy as 1, 2, 3, ...4!
 
Big Data as easy as 1, 2, 3, ... 4 ... with KNIME
Big Data as easy as 1, 2, 3, ... 4 ... with KNIMEBig Data as easy as 1, 2, 3, ... 4 ... with KNIME
Big Data as easy as 1, 2, 3, ... 4 ... with KNIME
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
 
Hadoop and Hive in Enterprises
Hadoop and Hive in EnterprisesHadoop and Hive in Enterprises
Hadoop and Hive in Enterprises
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data Platform
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18
 
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse OptimisationBigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
 
Cloud's Hidden Impact on IT Shops
Cloud's Hidden Impact on IT ShopsCloud's Hidden Impact on IT Shops
Cloud's Hidden Impact on IT Shops
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Recently uploaded

Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 

Recently uploaded (20)

Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 

Multi-Tenant Operations with Cloudera 5.7 & BT

  • 1. 1© Cloudera, Inc. All rights reserved. Multi-Tenant Operations with Cloudera Enterprise A look inside British Telecommunications Phill Radley | Chief Data Architect | BT Matt Schumpert | Director Product Management | Cloudera
  • 2. 2© Cloudera, Inc. All rights reserved. What is Multi-Tenant Hadoop • Single General Purpose Hadoop Cluster • Multiple distinct user groups with code & data that need to be separated • Sharing storage (HDFS) & processing resources (cores & RAM) • Storage allocated with HDFS Quota • Compute managed with Fair Share Scheduler (at run time) • Mixed work loads storage only, batch & interactive processing • Typically On-Premise run by an in-house data centre team
  • 3. 3© Cloudera, Inc. All rights reserved. Why Implement Multi-Tenant Hadoop • A single place for all raw enterprise data kept for as long as needed Universally popular concept in the business except for in Finance Target data sets the business will be interested in • Highly efficient use of Infrastructure • Allows small tenants access to big resources • Self-Service fast provisioning enabling fast project spin up • New Low unit cost makes old businesses cases viable (e.g. active archives) • Start small, with one or two small tenants, but plan for many more • E.g. find a struggling old batch applications & re-platform as an internal IT project • Once platform up and running go after a high profile flagship tenant
  • 4. 4© Cloudera, Inc. All rights reserved. Platform as a Service – Hadoop as a Service Target Users • Application developers, testers & production • Business Analysts/Data Scientists wanting access to live data Service specification • HaaS Version 1.0, change control & roadmap • Features (e.g. HDFS(httpFS/NFS/API)  Map/Reduce  HUE  PIG Hive  Hbase  Search  ) Service Management • Ordering form & process, Helpdesk • Service Manager, Capacity Manager
  • 5. 5© Cloudera, Inc. All rights reserved. Security & Governance • Tenant data privacy • Microsoft Active Directory integration with Kerberos • All user groups & accounts managed in AD • HDFS Encryption Zones • Data governance to control data sharing • Identified data stewards who approve creation of shared views and grants • Security Logging & Reporting
  • 6. 6© Cloudera, Inc. All rights reserved. The genesis of HaaS Research & Innovation Adastral Park Business HQ London
  • 7. 7© Cloudera, Inc. All rights reserved. From Hadoop to HaaS • Standing up a cluster is straightforward • Buy Hadoop optimized servers (lots of local disk) The unit cost is a fraction of a typical private cloud • Install Linux (integrate with Active Directory/Kerberos) • Use Cloudera Manager to create cluster • Decide what services to offer based on the pipeline of tenant workloads. • Feb 2014 HaaS R1: was a “minimum viable product” • Storage + Batch Compute (M/R) + UI (Hue) + Kerberos • Oct 2015 HaaS R2: Added interactive SQL use • Impala + Sqoop + Sentry • Aug 2016 HaaS R3: In Memory • Spark + Second site + Search…
  • 8. 8© Cloudera, Inc. All rights reserved. HAAS A AP 00307_12126 Microsoft Active Directory Groups What is a HaaS Tenant? • A tenant is synonymous with a HaaS Service instance 1. An identifying Group in Active Directory 2. A set of Hadoop resources owned by the Group • HDFS Quota • YARN Resource Pool • Hive database • ( + other options e.g. Flume port/agent, + data wrangling tool) • All services are accessed through common access points Service ID: HAAS A AP 00307_12126       DFLT QUOTA 500GB   Pig Hql java Hive Database HAASA AP 00307_12126 Table 1 View 1 Q Table 2 View 2 YARN Resource Pool HAASA AP 00307_12126 HDFS Storage /user/HAASA AP 00307_12126 HaaS Service Instance Admin (e.g. developer, data scientist) Hadoop Platform Admin service request Provisioning script “Welcome to HaaS” CLUSTER SERVICE TYPE SERVICE NO. BUS. APP. ID
  • 9. 9© Cloudera, Inc. All rights reserved. HaaS Tenant Reporting BT has developed a range of supporting tools & training materials to help on-board tenants and monitor the service For example the provisioning script and weekly HDFS capacity reports: One Project: NAD multiple services Service 123=prod Service 153=test P for Production T for Test D for Dev
  • 10. 10© Cloudera, Inc. All rights reserved. e.g. HAASAAP0067_05038: CMF Customer Master File 1 Pre-Load CSS COSMOSS DISE BTC C2B Antillia Glossi Cyclone Phoenix Radianz Siebel OV Siebel OS “Customer Master file (CMF) ” • A 10 year old batch app needing to re-platform (2014) • Data from 12 Source systems merged with D&B Legal Entities used as Reference Data • Existing SQL modules ported to HQL+PIG Benefits • Business able to do multiple runs in a day (15x faster) • Adding new sources is quicker (schema on read) • Data available for Self-Service Teams (DQ/Data Science)              HAASAAP0067_05038 OLD CMF DBStaging Source Systems 2 Load 3 Match / De-Dupe 4 Key Gen 5 Business Rule 6 Publish 7 Post Load CMF Reference Data
  • 11. 11© Cloudera, Inc. All rights reserved. HAASA AP 00101_2029 Faults 4369 Orders 3531 CRM 2029  Three existing business applications (CRM, Orders, Faults) extended into HaaS  RDBMS Customer Table RDBMS Orders Table RDBMS Faults Table T_CustomerHive DB HAASA AP 00101_2029 sqoop V_Customer HAASA AP 00202_3531 T_OrdersHive DB HAASA AP 0202_3531 sqoop V_Orders HAASA AP 00303_4369 T_FaultsHive DB HAASA AP 0303_4369 sqoop V_Faults Business Data Stewards Business Analysts / Data Scientists  CRM  Orders  Faults Target for Self-Service Data Access using HaaS 1. Browse & select data 2. Get Steward Approval 3. Create VIEWs & GRANTs 4. Select/join Views Data Catalogue • Self-service workflow-driven access to any table on any system (contrast with design/develop legacy warehouse approach) • Option to add homomorphic encryption to any table to anonymize PII data to further reduce risk
  • 12. 12© Cloudera, Inc. All rights reserved. Cloudera Manager 5.7 Easier Multi-Tenant Operations
  • 13. 13© Cloudera, Inc. All rights reserved. Major Enablers of Multi-Tenancy in Cloudera Manager • Dynamic Resource Pools • Cluster Utilization Reporting • HDFS Usage Reports
  • 14. 14© Cloudera, Inc. All rights reserved. Dynamic Resource Pools Define Tenants! • Hierarchical buckets that • Express prioritization • Protect fixed capacity • Create sensible guardrails
  • 15. 15© Cloudera, Inc. All rights reserved. Dynamic Resource Pools Define Tenants! • Hierarchical buckets that • Express prioritization • Protect fixed capacity • Create sensible guardrails • Make an admins’ life easy with • User/group-based creation • ACLs • Automatic preemption • Rotating service windows
  • 16. 16© Cloudera, Inc. All rights reserved. Dynamic Resource Pools Configuration
  • 17. 17© Cloudera, Inc. All rights reserved. Roadmap: Dynamic Resource Pools • Automatic user/group-based job placement under a tenant’s pool
  • 18. 18© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting BI Marketing Engineering
  • 19. 19© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting Usage Data Resource Allocations BI Marketing Engineering
  • 20. 20© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting Usage Data Resource Allocations Report BI Marketing Engineering • Configurable Time Window • Tenant Aggregation View • User Aggregation View
  • 21. 21© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting Usage Data Resource Allocations Report BI Marketing Engineering • “How much CPU & memory did each tenant use?” • “I set up fair scheduler. Did each of my tenants get their fair share?” • “Which tenants had to wait the longest for their applications to get resources? • “Which tenants asked for the most memory but used the least?” • “When do I need to add nodes to my cluster?” • Configurable Time Window • Tenant Aggregation View • User Aggregation View
  • 22. 22© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting
  • 23. 23© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting
  • 24. 24© Cloudera, Inc. All rights reserved. Cluster Utilization Reporting
  • 25. 25© Cloudera, Inc. All rights reserved. Roadmap: Cluster Utilization Reporting • Container Allocation Latency • A definitive wait metric for each bit of YARN workload • Support for more components • HDFS, HBase, Search, etc • Support additional metrics • Disk I/O, Network I/O • Add additional tools to existing metrics: • Showback/chargeback: associate $$ with resource usage • Capacity planning: trend lines • DBA tools: identify/flag rogue queries (Hive, Impala, HBase) • Workload management: tag critical apps with SLAs
  • 26. 26© Cloudera, Inc. All rights reserved. HDFS Usage Reports • Recently revamped based on known HaaS implementations • Drill-down by user/tenant to do housecleaning
  • 27. 27© Cloudera, Inc. All rights reserved. More Information & Next Steps Get Started • Download C5.7: www.cloudera.com/downloads Release Notes • www.cloudera.com/documentation/ enterprise/latest/topics/rg_release_ notes.html Training Classes • university.cloudera.com Check out Cloudera Manager Demo Videos at go.cloudera.com/hadoop- demo-cm1
  • 28. 28© Cloudera, Inc. All rights reserved. Questions?

Editor's Notes

  1. To set some context I thought I’d take a slide to give you the backstory to HaaS. As a business BT has always invested in R&D, our UK research campus Adastral Park was opened 40 years ago. Ever since we have invested in R&D, last year BT spent over £500 million. In addition to our in-house research work we have technology scouts in silicon valley and researchers at MIT. In 2010/11 our customer experience research team were working social media sentiment analysis when they came across Hadoop. They were working on small data samples on laptops in R-studio. Hadoops scale out architecture and schema on read made it easy for them to ingest millions of tweets so they built a research cluster. Pretty soon they were using Hadoop to answer different business questions like “What proportion of UK phone lines could support 50MB internet ?” “What would the fault rates be if 80% of customers had 50MB broadband ? How many additional engineers might we need” ? The business was catching onto big data spurred on by articles like the HBR Oct 2012 and the torrent of analyst waves and hype cycles. They started to rely on the research hadoop capability as they found they could get answers to big ad-hoc questions much faster from research and hadoop than they could from traditional data warehouses that weren’t setup to quickly ingest new data sets and run statistical models. Research now had a problem because they’re not set up to offer a production service with support and SLA. They came to the Chief architects Office for help in getting Hadoop out of Research and into BAU data centres ASAP. Within CAO we saw the lots of opportunities with Hadoop. The most significant being the ability to build a single enterprise data hub that we could use to deliver data democratisation, i.e. giving the data back to the business owners There were other short benefits such as the ability to re-platform old batch apps that needed to be kept running and provide low cost storage & archive. -oOo-
  2. Design Write Service description based on customer needs. MVP ! Sign Offs (Data centre Operations, Info Security) Try it out, use Cloudera Manager to setup & monitor services Reuse what the business already had Order Gateway, Active Directory Automate Provisioning Market & Communicate