Features of Hadoop

•

0 likes•602 views

This presentation discusses the following features of Hadoop: Open source Fault Tolerance Distributed Processing Scalability Reliability High Availability Economic Flexibility Easy to use Data locality Conclusion

Software

Dr. C.V. Suresh Babu
(CentreforKnowledgeTransfer)
institute

 Open source
 Fault Tolerance
 Distributed Processing
 Scalability
 Reliability
 High Availability
 Economic
 Flexibility
 Easy to use
 Data locality
 Conclusion

 It is an open source Java-based
programming framework.
 Open source means it is freely available
and even we can change its source code
as per your requirements.
(CentreforKnowledgeTransfer)
institute

 Hadoop control faults by the process of replica
creation.
 When client stores a file in HDFS, Hadoop framework
divide the file into blocks.
 Then client distributes data blocks across different
machines present in HDFS cluster.
 And, then create the replica of each block is on other
machines present in the cluster.
 HDFS, by default, creates 3 copies of a block on other
machines present in the cluster.
 If any machine in the cluster goes down or fails due to
unfavorable conditions.
 Then also, the user can easily access that data from
other machines.
(CentreforKnowledgeTransfer)
institute

 Hadoop stores huge amount of data in a distributed manner in HDFS.
 Process the data in parallel on a cluster of nodes.
(CentreforKnowledgeTransfer)
institute

 Hadoop is an open-source platform.
 This makes it extremely scalable platform.
 So, new nodes can be easily added without any downtime.
 Hadoop provides horizontal scalability so new node added on the
fly model to the system.
 In Apache hadoop, applications run on more than thousands of
node.
(CentreforKnowledgeTransfer)
institute

 Data is reliably stored on the
cluster of machines despite
machine failure due to
replication of data.
 So, if any of the nodes fails,
then also we can store data
reliably.
(CentreforKnowledgeTransfer)
institute

 Due to multiple copies of data,
data is highly available and
accessible despite hardware
failure.
 So, any machine goes down
data can be retrieved from the
other path.
(CentreforKnowledgeTransfer)
institute

 Hadoop is not very expensive
as it runs on the cluster of
commodity hardware.
 As we are using low-cost
commodity hardware, we
don’t need to spend a huge
amount of money for scaling
out your Hadoop cluster
(CentreforKnowledgeTransfer)
institute

 Hadoop is very flexible
in terms of ability to
deal with all kinds of
data.
 It deals with structured,
semi-structured or
unstructured.
(CentreforKnowledgeTransfer)
institute

 No need of client to deal with
distributed computing, the
framework takes care of all the
things.
 So it is easy to use
(CentreforKnowledgeTransfer)
institute

 It refers to the ability to move the
computation close to where actual
data resides on the node. Instead of
moving data to computation.
 This minimizes network congestion
and increases the over throughput of
the system.
(CentreforKnowledgeTransfer)
institute

 Hadoop is highly fault-tolerant.
 It reliably stores huge amount of data
despite hardware failure.
 It provides High scalability and high
availability.
 Hadoop is cost efficient as it runs on a
cluster of commodity hardware.
 Hadoop work on Data locality as moving
computation is cheaper than moving data.
 All these features of Big data Hadoop
make it powerful for the Big data
processing.
(CentreforKnowledgeTransfer)
institute

What's hot

Hadoop MapReduce FundamentalsLynn Langit

Big Data technology LandscapeShivanandaVSeeri

Hive(ppt)Abhinav Tyagi

OLAP & DATA WAREHOUSEZalpa Rathod

Map ReducePrashant Gupta

Introduction to Hadoop TechnologyManish Borkar

Apache HBase™Prashant Gupta

01 Data Mining: Concepts and Techniques, 2nd ed.Institute of Technology Telkom

Introduction to Hadoop and Hadoop component rebeccatho

Big Data EcosystemLucian Neghina

The rise of “Big Data” on cloud computingMinhazul Arefin

Machine Learning with Decision treesKnoldus Inc.

Hadoop EcosystemSandip Darwade

Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability | Edureka!

Introduction to HadoopApache Apex

Data warehousing and online analytical processingVijayasankariS

Data mining Measuring similarity and desimilarityRushali Deshmukh

Data models in NoSQLDr-Dipali Meher

Data preprocessingankur bhalla

Introduction to Apache PigJason Shao

What's hot (20)

Hadoop MapReduce Fundamentals

Big Data technology Landscape

Hive(ppt)

OLAP & DATA WAREHOUSE

Map Reduce

Introduction to Hadoop Technology

Apache HBase™

01 Data Mining: Concepts and Techniques, 2nd ed.

Introduction to Hadoop and Hadoop component

Big Data Ecosystem

The rise of “Big Data” on cloud computing

Machine Learning with Decision trees

Hadoop Ecosystem

Hadoop 2.0 Architecture | HDFS Federation | NameNode High Availability |

Introduction to Hadoop

Data warehousing and online analytical processing

Data mining Measuring similarity and desimilarity

Data models in NoSQL

Data preprocessing

Introduction to Apache Pig

Similar to Features of Hadoop

Introduction to Hadoop at Data-360 ConferenceAvkash Chauhan

Unit 1SriKGangadharRaoAssi

Bigdata and Hadoop Introductionumapavankumar kethavarapu

Hadoop introductionChirag Ahuja

Introduction to Big Data Analytics on Apache HadoopAvkash Chauhan

Seminar pptRajatTripathi34

Hadoop Integration with Microstrategy snehal parikh

Managing Big data with HadoopNalini Mehta

Design of Hadoop Distributed File SystemDr. C.V. Suresh Babu

Hadoop Ecosystem at a GlanceNeev Technologies

Hadoop TechnologyAtul Kushwaha

Hadoop seminarKrishnenduKrishh

2.1-HADOOP.pdfMarianJRuben

Hadoop An IntroductionMohanasundaram Ponnusamy

Unit-1 Introduction to Big Data.pptxAnkitChauhan817826

OPERATING SYSTEM .pptxAltafKhadim

Big Data Technology Stack : NutshellKhalid Imran

Hdfs designKhông còn Phù Hợp

HadoopZubair Arshad

Big Data and Hadoop BasicsSonal Tiwari

Similar to Features of Hadoop (20)

Introduction to Hadoop at Data-360 Conference

Unit 1

Bigdata and Hadoop Introduction

Hadoop introduction

Introduction to Big Data Analytics on Apache Hadoop

Seminar ppt

Hadoop Integration with Microstrategy

Managing Big data with Hadoop

Design of Hadoop Distributed File System

Hadoop Ecosystem at a Glance

Hadoop Technology

Hadoop seminar

2.1-HADOOP.pdf

Hadoop An Introduction

Unit-1 Introduction to Big Data.pptx

OPERATING SYSTEM .pptx

Big Data Technology Stack : Nutshell

Hdfs design

Hadoop

Big Data and Hadoop Basics

Recently uploaded

WSO2CON 2024 - Does Open Source Still Matter?WSO2

%in Benoni+277-882-255-28 abortion pills for sale in Benonimasabamasaba

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba

Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd

WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2

tonesoftglanshi9

Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver

VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg

WSO2CON2024 - It's time to go PlatformlessWSO2

AI & Machine Learning Presentation TemplatePresentation.STUDIO

WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2

Artyushina_Guest lecture_YorkU CS May 2024.pptxAnnaArtyushina1

Microsoft AI Transformation Partner Playbook.pdfWilly Marroquin (WillyDevNET)

%in Soweto+277-882-255-28 abortion pills for sale in sowetomasabamasaba

Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg

8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82

%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba

Announcing Codolex 2.0 from GDK SoftwareJim McKeeth

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health

Recently uploaded (20)

WSO2CON 2024 - Does Open Source Still Matter?

%in Benoni+277-882-255-28 abortion pills for sale in Benoni

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...

Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...

WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...

tonesoftg

Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...

VTU technical seminar 8Th Sem on Scikit-learn

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...

WSO2CON2024 - It's time to go Platformless

AI & Machine Learning Presentation Template

WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation

Artyushina_Guest lecture_YorkU CS May 2024.pptx

Microsoft AI Transformation Partner Playbook.pdf

%in Soweto+277-882-255-28 abortion pills for sale in soweto

Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...

8257 interfacing 2 in microprocessor for btech students

%in ivory park+277-882-255-28 abortion pills for sale in ivory park

Announcing Codolex 2.0 from GDK Software

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...

Features of Hadoop

1. Dr. C.V. Suresh Babu (CentreforKnowledgeTransfer) institute

2.  Open source  Fault Tolerance  Distributed Processing  Scalability  Reliability  High Availability  Economic  Flexibility  Easy to use  Data locality  Conclusion

3.  It is an open source Java-based programming framework.  Open source means it is freely available and even we can change its source code as per your requirements. (CentreforKnowledgeTransfer) institute

4.  Hadoop control faults by the process of replica creation.  When client stores a file in HDFS, Hadoop framework divide the file into blocks.  Then client distributes data blocks across different machines present in HDFS cluster.  And, then create the replica of each block is on other machines present in the cluster.  HDFS, by default, creates 3 copies of a block on other machines present in the cluster.  If any machine in the cluster goes down or fails due to unfavorable conditions.  Then also, the user can easily access that data from other machines. (CentreforKnowledgeTransfer) institute

5.  Hadoop stores huge amount of data in a distributed manner in HDFS.  Process the data in parallel on a cluster of nodes. (CentreforKnowledgeTransfer) institute

6.  Hadoop is an open-source platform.  This makes it extremely scalable platform.  So, new nodes can be easily added without any downtime.  Hadoop provides horizontal scalability so new node added on the fly model to the system.  In Apache hadoop, applications run on more than thousands of node. (CentreforKnowledgeTransfer) institute

7.  Data is reliably stored on the cluster of machines despite machine failure due to replication of data.  So, if any of the nodes fails, then also we can store data reliably. (CentreforKnowledgeTransfer) institute

8.  Due to multiple copies of data, data is highly available and accessible despite hardware failure.  So, any machine goes down data can be retrieved from the other path. (CentreforKnowledgeTransfer) institute

9.  Hadoop is not very expensive as it runs on the cluster of commodity hardware.  As we are using low-cost commodity hardware, we don’t need to spend a huge amount of money for scaling out your Hadoop cluster (CentreforKnowledgeTransfer) institute

10.  Hadoop is very flexible in terms of ability to deal with all kinds of data.  It deals with structured, semi-structured or unstructured. (CentreforKnowledgeTransfer) institute

11.  No need of client to deal with distributed computing, the framework takes care of all the things.  So it is easy to use (CentreforKnowledgeTransfer) institute

12.  It refers to the ability to move the computation close to where actual data resides on the node. Instead of moving data to computation.  This minimizes network congestion and increases the over throughput of the system. (CentreforKnowledgeTransfer) institute

13.  Hadoop is highly fault-tolerant.  It reliably stores huge amount of data despite hardware failure.  It provides High scalability and high availability.  Hadoop is cost efficient as it runs on a cluster of commodity hardware.  Hadoop work on Data locality as moving computation is cheaper than moving data.  All these features of Big data Hadoop make it powerful for the Big data processing. (CentreforKnowledgeTransfer) institute

Features of Hadoop

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Features of Hadoop

Similar to Features of Hadoop (20)

More from Dr. C.V. Suresh Babu

More from Dr. C.V. Suresh Babu (20)

Recently uploaded

Recently uploaded (20)

Features of Hadoop