SlideShare a Scribd company logo
1 of 23
www.edureka.co/big-data-and-hadoop 
When not to use Hadoop 
View Big Data and Hadoop Course at: http://www.edureka.co/big-data-and-hadoop 
For more details please contact us: 
US : 1800 275 9730 (toll free) 
INDIA : +91 88808 62004 
Email Us : sales@edureka.co 
For Queries: 
Post on Twitter @edurekaIN: #askEdureka 
Post on Facebook /edurekaIN
www.edureka.co/big-data-and-hadoop 
Slide2 
Objectives 
At the end of this module, you will be able to… 
Understand When not to use Hadoop 
»Real Time Analytics 
»Not a Replacement 
»Dataset Size 
»Complexity 
»Security 
Understand When to use Hadoop 
»Huge Unstructured Datasets 
»Response Time is Not an Issue 
»Future Planning 
»Multiple Frameworks for Big Data 
»Lifetime Data Availability
Slide3 
www.edureka.co/big-data-and-hadoop 
Hadoop Mania
Slide4 
www.edureka.co/big-data-and-hadoop 
When Not To Use Hadoop
Slide5 
www.edureka.co/big-data-and-hadoop 
If you want to do some Real Time Analytics, where you are expecting the result quickly, Hadoop should not be used directly 
Hadoop works on Batch processing, hence the response time is high 
Day1 
Day2 
Day 3 
Day 4 
......... 
………. 
………. 
Day n 
Day1 
Day2 
Day 3 
Day 4 
......... 
………. 
………. 
Day n 
Input 
Data 
Processing 
Data 
Input 
Data 
Processing 
Data 
Input 
Data 
Processing 
Data 
Input Data 
Processing Data using MR 
Time Lag 
Real Time Analytics
Slide6 
www.edureka.co/big-data-and-hadoop 
Real Time Analytics –Accepted Way 
Streaming Data 
Storing
Slide7 
www.edureka.co/big-data-and-hadoop 
14 sec 
0.6 sec 
Real Time Analytics –Accepted Way (Contd.)
Slide8 
www.edureka.co/big-data-and-hadoop 
Hadoop is not a replacement for your existing data processing infrastructure 
After processing the data in Hadoop you need to send the output to relational database technologies for BI, decision support, reporting etc. 
It is not going to replace your database, but your database isn’t likely to replace Hadoop either 
Different tools for different jobs 
Not a Replacement for Existing Infrastructure
Slide9 
www.edureka.co/big-data-and-hadoop 
Hadoop framework is not recommendable for small structured datasets as you have other tools available in the market which can do this work quite easily and at a fast pace than Hadoop like MS excel, RDBMS etc. 
For a small data analytics, Hadoop can be costlier than other tools 
Merge all the small files into one 
Multiple Smaller Datasets –Accepted Way
Slide10 
www.edureka.co/big-data-and-hadoop 
Multiple Smaller Datasets –Accepted Way4225284 
EachfileofxMB 
Slow Execution –10400 ms4225284 
Alltheabovefilesmergedintoonefile(9xMB) 
Fast Execution –6140 ms 
Same Output 
Same Input
Slide11 
www.edureka.co/big-data-and-hadoop 
Unless you have a better understanding of the Hadoop framework, its not suggested to use Hadoop for production 
Learning Hadoop and its eco-system tools and deciding which technology suits your need is again a different level of complexity 
Novice Hadoopers
Slide12 
www.edureka.co/big-data-and-hadoop 
Many enterprises -especially within the highly regulated industries dealing with sensitive data -aren’t able to move as quick as they would like, towards implementing Big Data projects and Hadoop 
“Example Health-care data used by Insurance companies to calculate premium” 
Where Security is the Primary Concern? 
They don’t have to hesitate though, as many of the security and compliance challenges are being continuously worked upon and can be surmountable (for example, by using Apache Accumulo on top of Hadoop).
Slide13 
www.edureka.co/big-data-and-hadoop 
Where Security is the Primary Concern –Accepted way 
Healthcare Data 
Hadoop Analytic Integration 
Healthcare Data 
Hadoop Analytic Integration
Slide14 
www.edureka.co/big-data-and-hadoop 
When To Use Hadoop
Slide15 
www.edureka.co/big-data-and-hadoop 
Your have different types of data: structured, semi-structured and unstructured 
The data set is huge in size i.e. several Terabytes or Petabytes 
You are not in a hurry for Answers 
Data Size and Data Diversity
Slide16 
www.edureka.co/big-data-and-hadoop 
To implement Hadoop on your data you should first understand the level of complexity of data and the rate in which it is going to grow 
So we need a cluster planning, it may begin with building a small or medium cluster in your industry as per data (in GBs or few TBs ) available at present and scale up your cluster in future depending on the growth of your data 
Future Planning
Slide17 
www.edureka.co/big-data-and-hadoop 
Hadoop can be integrated with multiple analytic tools to get the best out of it, like M-Learning, R , Python, Spark, MongoDB etc. 
Multiple Frameworks for Big Data
Slide18 
www.edureka.co/big-data-and-hadoop 
When you want your data to be live and running forever, it can be achieved using Hadoop’s scalability 
Lifetime Data Availability
Slide19 
www.edureka.co/big-data-and-hadoop
LIVE Online Class 
Class Recording in LMS 
24/7 Post Class Support 
Module Wise Quiz 
Project Work 
Verifiable Certificate 
Slide20 
www.edureka.co/big-data-and-hadoop 
How it Works?
Slide21 
www.edureka.co/big-data-and-hadoop 
Module 1 
»Understanding Big Data and Hadoop 
Module 2 
»Hadoop Architecture and HDFS 
Module 3 
»Hadoop MapReduce Framework -I 
Module 4 
»Hadoop MapReduce Framework -II 
Module 5 
»Advance MapReduce 
Course Topics 
Module 6 
»PIG 
Module 7 
»HIVE 
Module 8 
»Advance HIVE and HBase 
Module 9 
»Advance HBase 
Module 10 
»Oozie and Hadoop Project
Slide22 
Questions 
Twitter @edurekaIN, Facebook /edurekaIN, use #askEdureka for Questions
Webinar: Big Data & Hadoop - When not to use Hadoop

More Related Content

What's hot

Introduction to Hadoop Administration
Introduction to Hadoop AdministrationIntroduction to Hadoop Administration
Introduction to Hadoop Administration
Edureka!
 
Whatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopWhatisbigdataandwhylearnhadoop
Whatisbigdataandwhylearnhadoop
Edureka!
 

What's hot (20)

Hadoop Adminstration with Latest Release (2.0)
Hadoop Adminstration with Latest Release (2.0)Hadoop Adminstration with Latest Release (2.0)
Hadoop Adminstration with Latest Release (2.0)
 
Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...
Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...
Apache Hadoop Tutorial | Hadoop Tutorial For Beginners | Big Data Hadoop | Ha...
 
Understanding Big Data And Hadoop
Understanding Big Data And HadoopUnderstanding Big Data And Hadoop
Understanding Big Data And Hadoop
 
Hadoop Interview Questions and Answers | Big Data Interview Questions | Hadoo...
Hadoop Interview Questions and Answers | Big Data Interview Questions | Hadoo...Hadoop Interview Questions and Answers | Big Data Interview Questions | Hadoo...
Hadoop Interview Questions and Answers | Big Data Interview Questions | Hadoo...
 
Introduction to Hadoop Administration
Introduction to Hadoop AdministrationIntroduction to Hadoop Administration
Introduction to Hadoop Administration
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
 
Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop Tutorial
 
Hadoop Career Path and Interview Preparation
Hadoop Career Path and Interview PreparationHadoop Career Path and Interview Preparation
Hadoop Career Path and Interview Preparation
 
Whatisbigdataandwhylearnhadoop
WhatisbigdataandwhylearnhadoopWhatisbigdataandwhylearnhadoop
Whatisbigdataandwhylearnhadoop
 
Hadoop : The Pile of Big Data
Hadoop : The Pile of Big DataHadoop : The Pile of Big Data
Hadoop : The Pile of Big Data
 
Hadoop and Big Data
Hadoop and Big DataHadoop and Big Data
Hadoop and Big Data
 
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |EdurekaHadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
Hadoop Training For Beginners | Hadoop Tutorial | Big Data Training |Edureka
 
Big Data and Hadoop Basics
Big Data and Hadoop BasicsBig Data and Hadoop Basics
Big Data and Hadoop Basics
 
Hadoop MapReduce Framework
Hadoop MapReduce FrameworkHadoop MapReduce Framework
Hadoop MapReduce Framework
 
Webinar : Talend : The Non-Programmer's Swiss Knife for Big Data
Webinar  : Talend : The Non-Programmer's Swiss Knife for Big DataWebinar  : Talend : The Non-Programmer's Swiss Knife for Big Data
Webinar : Talend : The Non-Programmer's Swiss Knife for Big Data
 
Hadoop Tutorial For Beginners
Hadoop Tutorial For BeginnersHadoop Tutorial For Beginners
Hadoop Tutorial For Beginners
 
Intro to HDFS and MapReduce
Intro to HDFS and MapReduceIntro to HDFS and MapReduce
Intro to HDFS and MapReduce
 
Distributed Cache With MapReduce
Distributed Cache With MapReduceDistributed Cache With MapReduce
Distributed Cache With MapReduce
 
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | EdurekaHadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
Hadoop Tutorial | What is Hadoop | Hadoop Project on Reddit | Edureka
 

Similar to Webinar: Big Data & Hadoop - When not to use Hadoop

Non-geek's big data playbook - Hadoop & EDW - SAS Best Practices
Non-geek's big data playbook - Hadoop & EDW - SAS Best PracticesNon-geek's big data playbook - Hadoop & EDW - SAS Best Practices
Non-geek's big data playbook - Hadoop & EDW - SAS Best Practices
Jyrki Määttä
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
lccinfotech
 

Similar to Webinar: Big Data & Hadoop - When not to use Hadoop (20)

5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop
 
ETL using Big Data Talend
ETL using Big Data Talend  ETL using Big Data Talend
ETL using Big Data Talend
 
Talend For Big Data : Secret Key to Hadoop
Talend For Big Data  : Secret Key to HadoopTalend For Big Data  : Secret Key to Hadoop
Talend For Big Data : Secret Key to Hadoop
 
Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
 
Webinar: Ways to Succeed with Hadoop in 2015
Webinar: Ways to Succeed with Hadoop in 2015Webinar: Ways to Succeed with Hadoop in 2015
Webinar: Ways to Succeed with Hadoop in 2015
 
Hadoop in action
Hadoop in actionHadoop in action
Hadoop in action
 
TSE_Pres12.pptx
TSE_Pres12.pptxTSE_Pres12.pptx
TSE_Pres12.pptx
 
Simplifying Big Data ETL with Talend
Simplifying Big Data ETL with TalendSimplifying Big Data ETL with Talend
Simplifying Big Data ETL with Talend
 
Non geeks-big-data-playbook-106947
Non geeks-big-data-playbook-106947Non geeks-big-data-playbook-106947
Non geeks-big-data-playbook-106947
 
Non-geek's big data playbook - Hadoop & EDW - SAS Best Practices
Non-geek's big data playbook - Hadoop & EDW - SAS Best PracticesNon-geek's big data playbook - Hadoop & EDW - SAS Best Practices
Non-geek's big data playbook - Hadoop & EDW - SAS Best Practices
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
 
Learn About Big Data and Hadoop The Most Significant Resource
Learn About Big Data and Hadoop The Most Significant ResourceLearn About Big Data and Hadoop The Most Significant Resource
Learn About Big Data and Hadoop The Most Significant Resource
 
Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14
 
Talend webinar
Talend webinarTalend webinar
Talend webinar
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
Hadoop's Problem and How to Fix it
Hadoop's Problem and How to Fix itHadoop's Problem and How to Fix it
Hadoop's Problem and How to Fix it
 
Hadoop for Finance - sample chapter
Hadoop for Finance - sample chapterHadoop for Finance - sample chapter
Hadoop for Finance - sample chapter
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 

More from Edureka!

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Webinar: Big Data & Hadoop - When not to use Hadoop

  • 1. www.edureka.co/big-data-and-hadoop When not to use Hadoop View Big Data and Hadoop Course at: http://www.edureka.co/big-data-and-hadoop For more details please contact us: US : 1800 275 9730 (toll free) INDIA : +91 88808 62004 Email Us : sales@edureka.co For Queries: Post on Twitter @edurekaIN: #askEdureka Post on Facebook /edurekaIN
  • 2. www.edureka.co/big-data-and-hadoop Slide2 Objectives At the end of this module, you will be able to… Understand When not to use Hadoop »Real Time Analytics »Not a Replacement »Dataset Size »Complexity »Security Understand When to use Hadoop »Huge Unstructured Datasets »Response Time is Not an Issue »Future Planning »Multiple Frameworks for Big Data »Lifetime Data Availability
  • 5. Slide5 www.edureka.co/big-data-and-hadoop If you want to do some Real Time Analytics, where you are expecting the result quickly, Hadoop should not be used directly Hadoop works on Batch processing, hence the response time is high Day1 Day2 Day 3 Day 4 ......... ………. ………. Day n Day1 Day2 Day 3 Day 4 ......... ………. ………. Day n Input Data Processing Data Input Data Processing Data Input Data Processing Data Input Data Processing Data using MR Time Lag Real Time Analytics
  • 6. Slide6 www.edureka.co/big-data-and-hadoop Real Time Analytics –Accepted Way Streaming Data Storing
  • 7. Slide7 www.edureka.co/big-data-and-hadoop 14 sec 0.6 sec Real Time Analytics –Accepted Way (Contd.)
  • 8. Slide8 www.edureka.co/big-data-and-hadoop Hadoop is not a replacement for your existing data processing infrastructure After processing the data in Hadoop you need to send the output to relational database technologies for BI, decision support, reporting etc. It is not going to replace your database, but your database isn’t likely to replace Hadoop either Different tools for different jobs Not a Replacement for Existing Infrastructure
  • 9. Slide9 www.edureka.co/big-data-and-hadoop Hadoop framework is not recommendable for small structured datasets as you have other tools available in the market which can do this work quite easily and at a fast pace than Hadoop like MS excel, RDBMS etc. For a small data analytics, Hadoop can be costlier than other tools Merge all the small files into one Multiple Smaller Datasets –Accepted Way
  • 10. Slide10 www.edureka.co/big-data-and-hadoop Multiple Smaller Datasets –Accepted Way4225284 EachfileofxMB Slow Execution –10400 ms4225284 Alltheabovefilesmergedintoonefile(9xMB) Fast Execution –6140 ms Same Output Same Input
  • 11. Slide11 www.edureka.co/big-data-and-hadoop Unless you have a better understanding of the Hadoop framework, its not suggested to use Hadoop for production Learning Hadoop and its eco-system tools and deciding which technology suits your need is again a different level of complexity Novice Hadoopers
  • 12. Slide12 www.edureka.co/big-data-and-hadoop Many enterprises -especially within the highly regulated industries dealing with sensitive data -aren’t able to move as quick as they would like, towards implementing Big Data projects and Hadoop “Example Health-care data used by Insurance companies to calculate premium” Where Security is the Primary Concern? They don’t have to hesitate though, as many of the security and compliance challenges are being continuously worked upon and can be surmountable (for example, by using Apache Accumulo on top of Hadoop).
  • 13. Slide13 www.edureka.co/big-data-and-hadoop Where Security is the Primary Concern –Accepted way Healthcare Data Hadoop Analytic Integration Healthcare Data Hadoop Analytic Integration
  • 15. Slide15 www.edureka.co/big-data-and-hadoop Your have different types of data: structured, semi-structured and unstructured The data set is huge in size i.e. several Terabytes or Petabytes You are not in a hurry for Answers Data Size and Data Diversity
  • 16. Slide16 www.edureka.co/big-data-and-hadoop To implement Hadoop on your data you should first understand the level of complexity of data and the rate in which it is going to grow So we need a cluster planning, it may begin with building a small or medium cluster in your industry as per data (in GBs or few TBs ) available at present and scale up your cluster in future depending on the growth of your data Future Planning
  • 17. Slide17 www.edureka.co/big-data-and-hadoop Hadoop can be integrated with multiple analytic tools to get the best out of it, like M-Learning, R , Python, Spark, MongoDB etc. Multiple Frameworks for Big Data
  • 18. Slide18 www.edureka.co/big-data-and-hadoop When you want your data to be live and running forever, it can be achieved using Hadoop’s scalability Lifetime Data Availability
  • 20. LIVE Online Class Class Recording in LMS 24/7 Post Class Support Module Wise Quiz Project Work Verifiable Certificate Slide20 www.edureka.co/big-data-and-hadoop How it Works?
  • 21. Slide21 www.edureka.co/big-data-and-hadoop Module 1 »Understanding Big Data and Hadoop Module 2 »Hadoop Architecture and HDFS Module 3 »Hadoop MapReduce Framework -I Module 4 »Hadoop MapReduce Framework -II Module 5 »Advance MapReduce Course Topics Module 6 »PIG Module 7 »HIVE Module 8 »Advance HIVE and HBase Module 9 »Advance HBase Module 10 »Oozie and Hadoop Project
  • 22. Slide22 Questions Twitter @edurekaIN, Facebook /edurekaIN, use #askEdureka for Questions