SlideShare a Scribd company logo
1 of 13
1
Hadoop as a Service
Kumar Ramamurthy
Vice President and Global Practice Head | Enterprise Information Management (EIM)
2
Agenda
1 Hadoop and
adoption maturity 2 Challenges faced while
scaling up Hadoop
3 Characteristics of a
well-run Hadoop
environment
4 Hadoop on the
cloud wagon
5 Market overview
6 Business benefits
3
Hadoop and adoption maturity
Organizations have bought into the benefits of Hadoop, and are now looking for deploying and
maintaining a “well-run Hadoop environment”
Taking the first
Hadoop step
Deploying small
Hadoop clusters
Hadoop on
multiple business
use cases
Scaling up Hadoop
to enterprise-wide
operations
4
Configuration &
tuning ???
Infrastructure
management ???
Business pain points while scaling up on Hadoop
As enterprises seek effective and efficient ways to leverage Hadoop for direct and instant access to actionable business
insights, one of the questions frequently asked is “Can we run Hadoop in the cloud?”
Deploying, configuring and managing Hadoop data clusters can be difficult, expensive and time consuming
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
01010
10101
01010
1010
Hadoop
0101010101010101010101
01010101010101
0101010
0101
01010101010101
0101010
0101
Capital
expenditure ???
Provisioning &
availability ???
……
……
……
5
Characteristics of a well-run Hadoop environment
Satisfy the needs of data scientists and data administrators
Provide run it yourself environment: Full support service for job monitoring and
tuning support
Store data at rest in always-on HDFS
Provide elasticity and non-stop operation
Provide self-configuration options
6
Spectrum of Hadoop deployment options
On-
premise
full
custom
Hadoop
appliance
Hadoop
hosting
Hadoop
on the
cloud
Hadoop
as a
service
Bare metal Cloud
7
Getting Hadoop on the cloud-wagon
Hadoop as a Service (Haas) is a cloud computing solution that makes medium and large-scale data
processing accessible, easy, fast and cost effective.
Data Sources Hadoop Tools Data Visualization and BI
8
Key drivers to consider before deploying a Hadoop cluster in the cloud
Is your Hadoop cluster secure in the public cloud?
Does your Hadoop distribution support the operating system standards of your enterprise?
Is your business performance impacted while adopting any vendor specific platform/tools?
Are your analytical tools and platforms supported by the cloud platform?
Do you load data from your internal systems or from the cloud?
Analyze Security Criteria
Evaluate Hadoop Distributions
Understand Vendor Specific Dependencies
Consider the entire Hadoop Ecosystem
Analyze all your Data Sources
9
Hadoop as a Service market landscape
HaaS market expected to
reach $ 16.1 Bn by 2020
Key deployment methods in demand:
Run it Yourself & Pure Play
North America highest revenue generating region
throughout 2020 with a value of $11.6 Bn
Source: Allied market research
Global HaaS market to grow at a CAGR of 84.81%
over the period 2014-2018
Global Hadoop as a Service
market by end user
Manufacturing Financial services Retail Telecom Healthcare Media &
Entertainment
10
Leading providers of Hadoop as a Service
Altiscale Purpose-built, petabyte-scale infrastructure that delivers Apache Hadoop as a cloud
service
Amazon EMR Managed Hadoop framework to distribute and process vast amounts data across
dynamically scalable Amazon EC2 (Elastic Compute Cloud) instances
Google Google Cloud Storage connector for Hadoop to perform MapReduce jobs directly on
data in Google Cloud Storage
HP Cloud Elastic cloud computing and cloud storage platform to analyze and index large data
volumes in the hundreds of petabytes in size
IBM BigInsights Provides Hadoop-as-a-service on IBM’s SoftLayer global cloud infrastructure – a bare
metal design
Microsoft Scales to petabytes on demand; processes unstructured and semi-structured data;
deploys on Windows or Linux; integrates with on-premises Hadoop clusters (if
needed); and supports multiple development languages
Rackspace Offers several options for running Apache Hadoop including deploying Hadoop on
Rackspace managed dedicated servers; spinning up Hadoop on Rackspace’s public
cloud via virtual servers or on dedicated bare-metal cloud servers
Verizon Enterprise business inked a Cloudera partnership in 2013, and the IT services giant
now offers Cloudera atop its cloud infrastructure
11
Enterprises realize multifold business benefits through HaaS
With HaaS, businesses can eliminate the operational challenges of running
Hadoop and focus on business growth
Paying only for the compute
without a large hardware
acquisition cost
Scaling up and down as business
needs change through elastic
Hadoop clusters
Deploying and launching Hadoop
environments in minutes
Focusing on building applications and
answering business questions rather
than complex Hadoop clusters
Storing, processing and analyzing
large volumes of both relational
and non-relational data
Creating integrated backup
and disaster recovery
Providing distributed, fault-
tolerant computing framework
and resource management
12
In summary …why HaaS?
Reducing cost of innovation and focus on critical business areas
Providing instant access to hardware resources to scale up or scale
down as per business needs
Effectively optimizing and handling batch workloads
Managing variable resource requirements for different types of
machines and workloads
Running closer to the data
Simplifying Hadoop operations
13
© 2015 Virtusa Corporation. All rights reserved. Virtusa and all other related logos are either
registered trademarks or trademarks of Virtusa Corporation in the United States, the European
Union, and/or India. All other company and service names are the property of their respective holders
and may be registered trademarks or trademarks in the United States and/or other countries.
Thanks
Virtusa Corporation
2000 West Park Drive,
Westborough, MA 01581 USA

More Related Content

What's hot

Use the power of Microsoft Azure with NetApp Storage
Use the power of Microsoft Azure with NetApp StorageUse the power of Microsoft Azure with NetApp Storage
Use the power of Microsoft Azure with NetApp Storage
Proact Netherlands B.V.
 
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage HadoopActian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
DataWorks Summit
 

What's hot (20)

Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache Kudu
 
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr

 
Securing your Big Data Environments in the Cloud
Securing your Big Data Environments in the CloudSecuring your Big Data Environments in the Cloud
Securing your Big Data Environments in the Cloud
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartch
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the CloudPart 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
 
How Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsHow Data Drives Business at Choice Hotels
How Data Drives Business at Choice Hotels
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
Realizing the Promise of Big Data with Hadoop - Cloudera Summer Webinar Serie...
 
Use the power of Microsoft Azure with NetApp Storage
Use the power of Microsoft Azure with NetApp StorageUse the power of Microsoft Azure with NetApp Storage
Use the power of Microsoft Azure with NetApp Storage
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
 
Big data journey to the cloud maz chaudhri 5.30.18
Big data journey to the cloud   maz chaudhri 5.30.18Big data journey to the cloud   maz chaudhri 5.30.18
Big data journey to the cloud maz chaudhri 5.30.18
 
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage HadoopActian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
Actian Vector on Hadoop: First Industrial-strength DBMS to Truly Leverage Hadoop
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solution
 
Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...
Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...
Big Data Day LA 2016/ Use Case Driven track - How to Use Design Thinking to J...
 
Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to End
 

Similar to Why Hadoop as a Service?

Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
Hortonworks
 
Infochimps report 451 research impact report
Infochimps report   451 research impact reportInfochimps report   451 research impact report
Infochimps report 451 research impact report
Accenture
 
Infochimps report 451 research impact report
Infochimps report   451 research impact reportInfochimps report   451 research impact report
Infochimps report 451 research impact report
Accenture
 

Similar to Why Hadoop as a Service? (20)

Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Hadoop in the Cloud
Hadoop in the CloudHadoop in the Cloud
Hadoop in the Cloud
 
Infochimps report 451 research impact report
Infochimps report   451 research impact reportInfochimps report   451 research impact report
Infochimps report 451 research impact report
 
451 Research Impact Report
451 Research Impact Report451 Research Impact Report
451 Research Impact Report
 
Infochimps report 451 research impact report
Infochimps report   451 research impact reportInfochimps report   451 research impact report
Infochimps report 451 research impact report
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven World
 
Hadoop data-lake-white-paper
Hadoop data-lake-white-paperHadoop data-lake-white-paper
Hadoop data-lake-white-paper
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 

Recently uploaded

introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 

Recently uploaded (20)

introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 

Why Hadoop as a Service?

  • 1. 1 Hadoop as a Service Kumar Ramamurthy Vice President and Global Practice Head | Enterprise Information Management (EIM)
  • 2. 2 Agenda 1 Hadoop and adoption maturity 2 Challenges faced while scaling up Hadoop 3 Characteristics of a well-run Hadoop environment 4 Hadoop on the cloud wagon 5 Market overview 6 Business benefits
  • 3. 3 Hadoop and adoption maturity Organizations have bought into the benefits of Hadoop, and are now looking for deploying and maintaining a “well-run Hadoop environment” Taking the first Hadoop step Deploying small Hadoop clusters Hadoop on multiple business use cases Scaling up Hadoop to enterprise-wide operations
  • 4. 4 Configuration & tuning ??? Infrastructure management ??? Business pain points while scaling up on Hadoop As enterprises seek effective and efficient ways to leverage Hadoop for direct and instant access to actionable business insights, one of the questions frequently asked is “Can we run Hadoop in the cloud?” Deploying, configuring and managing Hadoop data clusters can be difficult, expensive and time consuming 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 01010 10101 01010 1010 Hadoop 0101010101010101010101 01010101010101 0101010 0101 01010101010101 0101010 0101 Capital expenditure ??? Provisioning & availability ??? …… …… ……
  • 5. 5 Characteristics of a well-run Hadoop environment Satisfy the needs of data scientists and data administrators Provide run it yourself environment: Full support service for job monitoring and tuning support Store data at rest in always-on HDFS Provide elasticity and non-stop operation Provide self-configuration options
  • 6. 6 Spectrum of Hadoop deployment options On- premise full custom Hadoop appliance Hadoop hosting Hadoop on the cloud Hadoop as a service Bare metal Cloud
  • 7. 7 Getting Hadoop on the cloud-wagon Hadoop as a Service (Haas) is a cloud computing solution that makes medium and large-scale data processing accessible, easy, fast and cost effective. Data Sources Hadoop Tools Data Visualization and BI
  • 8. 8 Key drivers to consider before deploying a Hadoop cluster in the cloud Is your Hadoop cluster secure in the public cloud? Does your Hadoop distribution support the operating system standards of your enterprise? Is your business performance impacted while adopting any vendor specific platform/tools? Are your analytical tools and platforms supported by the cloud platform? Do you load data from your internal systems or from the cloud? Analyze Security Criteria Evaluate Hadoop Distributions Understand Vendor Specific Dependencies Consider the entire Hadoop Ecosystem Analyze all your Data Sources
  • 9. 9 Hadoop as a Service market landscape HaaS market expected to reach $ 16.1 Bn by 2020 Key deployment methods in demand: Run it Yourself & Pure Play North America highest revenue generating region throughout 2020 with a value of $11.6 Bn Source: Allied market research Global HaaS market to grow at a CAGR of 84.81% over the period 2014-2018 Global Hadoop as a Service market by end user Manufacturing Financial services Retail Telecom Healthcare Media & Entertainment
  • 10. 10 Leading providers of Hadoop as a Service Altiscale Purpose-built, petabyte-scale infrastructure that delivers Apache Hadoop as a cloud service Amazon EMR Managed Hadoop framework to distribute and process vast amounts data across dynamically scalable Amazon EC2 (Elastic Compute Cloud) instances Google Google Cloud Storage connector for Hadoop to perform MapReduce jobs directly on data in Google Cloud Storage HP Cloud Elastic cloud computing and cloud storage platform to analyze and index large data volumes in the hundreds of petabytes in size IBM BigInsights Provides Hadoop-as-a-service on IBM’s SoftLayer global cloud infrastructure – a bare metal design Microsoft Scales to petabytes on demand; processes unstructured and semi-structured data; deploys on Windows or Linux; integrates with on-premises Hadoop clusters (if needed); and supports multiple development languages Rackspace Offers several options for running Apache Hadoop including deploying Hadoop on Rackspace managed dedicated servers; spinning up Hadoop on Rackspace’s public cloud via virtual servers or on dedicated bare-metal cloud servers Verizon Enterprise business inked a Cloudera partnership in 2013, and the IT services giant now offers Cloudera atop its cloud infrastructure
  • 11. 11 Enterprises realize multifold business benefits through HaaS With HaaS, businesses can eliminate the operational challenges of running Hadoop and focus on business growth Paying only for the compute without a large hardware acquisition cost Scaling up and down as business needs change through elastic Hadoop clusters Deploying and launching Hadoop environments in minutes Focusing on building applications and answering business questions rather than complex Hadoop clusters Storing, processing and analyzing large volumes of both relational and non-relational data Creating integrated backup and disaster recovery Providing distributed, fault- tolerant computing framework and resource management
  • 12. 12 In summary …why HaaS? Reducing cost of innovation and focus on critical business areas Providing instant access to hardware resources to scale up or scale down as per business needs Effectively optimizing and handling batch workloads Managing variable resource requirements for different types of machines and workloads Running closer to the data Simplifying Hadoop operations
  • 13. 13 © 2015 Virtusa Corporation. All rights reserved. Virtusa and all other related logos are either registered trademarks or trademarks of Virtusa Corporation in the United States, the European Union, and/or India. All other company and service names are the property of their respective holders and may be registered trademarks or trademarks in the United States and/or other countries. Thanks Virtusa Corporation 2000 West Park Drive, Westborough, MA 01581 USA