SlideShare a Scribd company logo
1 of 10
The Evolution of
Web and Big Data
  Edward J. Yoon
Who Am I
• Edward J. Yoon
  – @eddieyoon
• Founder of Apache Hama
• PMC member of Apache BigTop
• Oracle Employee
Early era of Web
          Google
•   2003: GFS
•   2004: MapReduce           OSS
•   2005: SawZall   • 2005: Hadoop
•   2006: BigTable    HDFS
                             MapReduce
                    • 2006: Pig
                    • 2007: Hive
                             HBase
Google?
• World best “Full-text search engine”
• In 2003,
  – 10,000+ Servers
  – 4+ billion Documents
  – 300+ Million Images
Hadoop 1.0
• HDFS + MapReduce
  – And Pig. Hive, Hbase, Mahout
The New era of Web

     Google                 OSS
• 2010: Pregel    • 2010: Hama
         Dremel            Twitter
• 2012: Spanner     Storm
                  • 2011: YARN
                           Giraph
                  • 2012: Drill
MR vs. Alternatives
YARN?
• Job scheduling and cluster resource
  management
Future of CDH4 and Hadoop
• CDH4 will be based on 0.23.x or later
• 0.23.0 doesn’t include Map/Reduce 1.0
  – Storm, Giraph, Hama, Spark, MPI, GraphLab

More Related Content

What's hot (13)

Hadoop online training usa uk
Hadoop online training usa ukHadoop online training usa uk
Hadoop online training usa uk
 
Bigdata slide
Bigdata slideBigdata slide
Bigdata slide
 
Hadoop big data online training
Hadoop big data online trainingHadoop big data online training
Hadoop big data online training
 
Bigdata training
Bigdata trainingBigdata training
Bigdata training
 
Big data training uk
Big data training ukBig data training uk
Big data training uk
 
Hadoop online training usa
Hadoop online training usaHadoop online training usa
Hadoop online training usa
 
Beauty and Big Data
Beauty and Big DataBeauty and Big Data
Beauty and Big Data
 
Big data developer training
Big data developer trainingBig data developer training
Big data developer training
 
Hadoop developer training
Hadoop developer trainingHadoop developer training
Hadoop developer training
 
Hadoop training uk
Hadoop training ukHadoop training uk
Hadoop training uk
 
MahoutNew
MahoutNewMahoutNew
MahoutNew
 
A Hadoop Primer
A Hadoop PrimerA Hadoop Primer
A Hadoop Primer
 
Big data analytics training
Big data analytics trainingBig data analytics training
Big data analytics training
 

Viewers also liked

Evolution of a big data project
Evolution of a big data projectEvolution of a big data project
Evolution of a big data projectMichael Peacock
 
The Evolution of Big Data Analytics
The Evolution of Big Data AnalyticsThe Evolution of Big Data Analytics
The Evolution of Big Data AnalyticsAYATA
 
Apache Hama at Samsung Open Source Conference
Apache Hama at Samsung Open Source ConferenceApache Hama at Samsung Open Source Conference
Apache Hama at Samsung Open Source ConferenceEdward Yoon
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
What's in your filter bubble? Or, how has the internet censored you today?
What's in your filter bubble? Or, how has the internet censored you today?What's in your filter bubble? Or, how has the internet censored you today?
What's in your filter bubble? Or, how has the internet censored you today?Emily Ford
 
A Brief History of Big Data
A Brief History of Big DataA Brief History of Big Data
A Brief History of Big DataBernard Marr
 
Τα παιδιά ως κοινωνικοί ερευνητές. Οδηγός για δασκάλους και ...
Τα   παιδιά   ως   κοινωνικοί   ερευνητές.   Οδηγός   για   δασκάλους   και  ...Τα   παιδιά   ως   κοινωνικοί   ερευνητές.   Οδηγός   για   δασκάλους   και  ...
Τα παιδιά ως κοινωνικοί ερευνητές. Οδηγός για δασκάλους και ...Αννα Παππα
 
Web 1.0 2.0 3.0 características, definiciones, ejemplos.
Web 1.0 2.0 3.0 características, definiciones, ejemplos.Web 1.0 2.0 3.0 características, definiciones, ejemplos.
Web 1.0 2.0 3.0 características, definiciones, ejemplos.SantiagoDiazSalamanca
 
Big Data: Evolution? Game Changer? Definitely
Big Data: Evolution? Game Changer? DefinitelyBig Data: Evolution? Game Changer? Definitely
Big Data: Evolution? Game Changer? DefinitelyEMC
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Web 3.0 The Semantic Web
Web 3.0 The Semantic WebWeb 3.0 The Semantic Web
Web 3.0 The Semantic WebHatem Mahmoud
 
Web 1.0, Web 2.0 & Web 3.0
Web 1.0, Web 2.0 & Web 3.0Web 1.0, Web 2.0 & Web 3.0
Web 1.0, Web 2.0 & Web 3.0tokey_sport
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 

Viewers also liked (20)

Evolution of a big data project
Evolution of a big data projectEvolution of a big data project
Evolution of a big data project
 
The Evolution of Big Data Analytics
The Evolution of Big Data AnalyticsThe Evolution of Big Data Analytics
The Evolution of Big Data Analytics
 
Apache Hama at Samsung Open Source Conference
Apache Hama at Samsung Open Source ConferenceApache Hama at Samsung Open Source Conference
Apache Hama at Samsung Open Source Conference
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
What's in your filter bubble? Or, how has the internet censored you today?
What's in your filter bubble? Or, how has the internet censored you today?What's in your filter bubble? Or, how has the internet censored you today?
What's in your filter bubble? Or, how has the internet censored you today?
 
Why Web 3.0?
Why Web 3.0?Why Web 3.0?
Why Web 3.0?
 
A Brief History of Big Data
A Brief History of Big DataA Brief History of Big Data
A Brief History of Big Data
 
Τα παιδιά ως κοινωνικοί ερευνητές. Οδηγός για δασκάλους και ...
Τα   παιδιά   ως   κοινωνικοί   ερευνητές.   Οδηγός   για   δασκάλους   και  ...Τα   παιδιά   ως   κοινωνικοί   ερευνητές.   Οδηγός   για   δασκάλους   και  ...
Τα παιδιά ως κοινωνικοί ερευνητές. Οδηγός για δασκάλους και ...
 
Web 1.0 2.0 3.0 características, definiciones, ejemplos.
Web 1.0 2.0 3.0 características, definiciones, ejemplos.Web 1.0 2.0 3.0 características, definiciones, ejemplos.
Web 1.0 2.0 3.0 características, definiciones, ejemplos.
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big Data: Evolution? Game Changer? Definitely
Big Data: Evolution? Game Changer? DefinitelyBig Data: Evolution? Game Changer? Definitely
Big Data: Evolution? Game Changer? Definitely
 
Big Data
Big DataBig Data
Big Data
 
Web 3.0 Intro
Web 3.0 IntroWeb 3.0 Intro
Web 3.0 Intro
 
Web 3.0
Web 3.0Web 3.0
Web 3.0
 
Web 3.0 The Semantic Web
Web 3.0 The Semantic WebWeb 3.0 The Semantic Web
Web 3.0 The Semantic Web
 
Web 1.0, Web 2.0 & Web 3.0
Web 1.0, Web 2.0 & Web 3.0Web 1.0, Web 2.0 & Web 3.0
Web 1.0, Web 2.0 & Web 3.0
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Similar to The evolution of web and big data

Akhil's hadoop
Akhil's hadoopAkhil's hadoop
Akhil's hadoopAkhil Prem
 
Hadoop Conference Japan 2011 Fallに行ってきました
Hadoop Conference Japan 2011 Fallに行ってきましたHadoop Conference Japan 2011 Fallに行ってきました
Hadoop Conference Japan 2011 Fallに行ってきましたmoai kids
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureDataWorks Summit
 
Presentation on Hadoop Technology
Presentation on Hadoop TechnologyPresentation on Hadoop Technology
Presentation on Hadoop TechnologyOpenDev
 
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and FutureHadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and FutureVinod Kumar Vavilapalli
 
Introduction to apache hadoop copy
Introduction to apache hadoop   copyIntroduction to apache hadoop   copy
Introduction to apache hadoop copyMohammad_Tariq
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and HadoopFebiyan Rachman
 
Intro to Hadoop and MapReduce
Intro to Hadoop and MapReduceIntro to Hadoop and MapReduce
Intro to Hadoop and MapReduceJosi Aranda
 
Big Data in the Microsoft Platform
Big Data in the Microsoft PlatformBig Data in the Microsoft Platform
Big Data in the Microsoft PlatformJesus Rodriguez
 
Data warehousing con hadoop y el paradigma map reduce
Data warehousing con hadoop y el paradigma map reduceData warehousing con hadoop y el paradigma map reduce
Data warehousing con hadoop y el paradigma map reduceIsmel Martínez Díaz
 
Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Hortonworks
 
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDYVenneladonthireddy1
 
Hadoop And Their Ecosystem ppt
 Hadoop And Their Ecosystem ppt Hadoop And Their Ecosystem ppt
Hadoop And Their Ecosystem pptsunera pathan
 
Hadoop And Their Ecosystem
 Hadoop And Their Ecosystem Hadoop And Their Ecosystem
Hadoop And Their Ecosystemsunera pathan
 
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, AmpoolThe Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, AmpoolYahoo Developer Network
 
Introduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopIntroduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopAmir Shaikh
 

Similar to The evolution of web and big data (20)

Akhil's hadoop
Akhil's hadoopAkhil's hadoop
Akhil's hadoop
 
Hadoop Conference Japan 2011 Fallに行ってきました
Hadoop Conference Japan 2011 Fallに行ってきましたHadoop Conference Japan 2011 Fallに行ってきました
Hadoop Conference Japan 2011 Fallに行ってきました
 
Apache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and FutureApache Hadoop YARN: Present and Future
Apache Hadoop YARN: Present and Future
 
Presentation on Hadoop Technology
Presentation on Hadoop TechnologyPresentation on Hadoop Technology
Presentation on Hadoop Technology
 
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and FutureHadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
Hadoop Summit Europe Talk 2014: Apache Hadoop YARN: Present and Future
 
Introduction to apache hadoop copy
Introduction to apache hadoop   copyIntroduction to apache hadoop   copy
Introduction to apache hadoop copy
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 
Intro to Hadoop and MapReduce
Intro to Hadoop and MapReduceIntro to Hadoop and MapReduce
Intro to Hadoop and MapReduce
 
Big Data in the Microsoft Platform
Big Data in the Microsoft PlatformBig Data in the Microsoft Platform
Big Data in the Microsoft Platform
 
Big data and hadoop anupama
Big data and hadoop anupamaBig data and hadoop anupama
Big data and hadoop anupama
 
Hadoop Eco system
Hadoop Eco systemHadoop Eco system
Hadoop Eco system
 
Data warehousing con hadoop y el paradigma map reduce
Data warehousing con hadoop y el paradigma map reduceData warehousing con hadoop y el paradigma map reduce
Data warehousing con hadoop y el paradigma map reduce
 
Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011Keynote from ApacheCon NA 2011
Keynote from ApacheCon NA 2011
 
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
 
Hadoop And Their Ecosystem ppt
 Hadoop And Their Ecosystem ppt Hadoop And Their Ecosystem ppt
Hadoop And Their Ecosystem ppt
 
Hadoop And Their Ecosystem
 Hadoop And Their Ecosystem Hadoop And Their Ecosystem
Hadoop And Their Ecosystem
 
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, AmpoolThe Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
The Future of Hadoop in an AI World, Milind Bhandarkar, CEO, Ampool
 
Hadoop training
Hadoop trainingHadoop training
Hadoop training
 
Hadoop..
Hadoop..Hadoop..
Hadoop..
 
Introduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopIntroduction to BIg Data and Hadoop
Introduction to BIg Data and Hadoop
 

More from Edward Yoon

(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
(소스콘 2015 발표자료) Apache HORN, a large scale deep learning(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
(소스콘 2015 발표자료) Apache HORN, a large scale deep learningEdward Yoon
 
Introduction to apache horn (incubating)
Introduction to apache horn (incubating)Introduction to apache horn (incubating)
Introduction to apache horn (incubating)Edward Yoon
 
K means 알고리즘을 이용한 영화배우 클러스터링
K means 알고리즘을 이용한 영화배우 클러스터링K means 알고리즘을 이용한 영화배우 클러스터링
K means 알고리즘을 이용한 영화배우 클러스터링Edward Yoon
 
차세대하둡과 주목해야할 오픈소스
차세대하둡과 주목해야할 오픈소스차세대하둡과 주목해야할 오픈소스
차세대하둡과 주목해야할 오픈소스Edward Yoon
 
Quick Understanding of NoSQL
Quick Understanding of NoSQLQuick Understanding of NoSQL
Quick Understanding of NoSQLEdward Yoon
 
Apache hama @ Samsung SW Academy
Apache hama @ Samsung SW AcademyApache hama @ Samsung SW Academy
Apache hama @ Samsung SW AcademyEdward Yoon
 
Introduction of Apache Hama - 2011
Introduction of Apache Hama - 2011Introduction of Apache Hama - 2011
Introduction of Apache Hama - 2011Edward Yoon
 
MongoDB introduction
MongoDB introductionMongoDB introduction
MongoDB introductionEdward Yoon
 
Monitoring and mining network traffic in clouds
Monitoring and mining network traffic in cloudsMonitoring and mining network traffic in clouds
Monitoring and mining network traffic in cloudsEdward Yoon
 
Apache hama 0.2-userguide
Apache hama 0.2-userguideApache hama 0.2-userguide
Apache hama 0.2-userguideEdward Yoon
 
Usage case of HBase for real-time application
Usage case of HBase for real-time applicationUsage case of HBase for real-time application
Usage case of HBase for real-time applicationEdward Yoon
 
Apache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
Apache HAMA: An Introduction toBulk Synchronization Parallel on HadoopApache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
Apache HAMA: An Introduction toBulk Synchronization Parallel on HadoopEdward Yoon
 
Understand Of Linear Algebra
Understand Of Linear AlgebraUnderstand Of Linear Algebra
Understand Of Linear AlgebraEdward Yoon
 
BigTable And Hbase
BigTable And HbaseBigTable And Hbase
BigTable And HbaseEdward Yoon
 

More from Edward Yoon (16)

(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
(소스콘 2015 발표자료) Apache HORN, a large scale deep learning(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
(소스콘 2015 발표자료) Apache HORN, a large scale deep learning
 
Introduction to apache horn (incubating)
Introduction to apache horn (incubating)Introduction to apache horn (incubating)
Introduction to apache horn (incubating)
 
K means 알고리즘을 이용한 영화배우 클러스터링
K means 알고리즘을 이용한 영화배우 클러스터링K means 알고리즘을 이용한 영화배우 클러스터링
K means 알고리즘을 이용한 영화배우 클러스터링
 
차세대하둡과 주목해야할 오픈소스
차세대하둡과 주목해야할 오픈소스차세대하둡과 주목해야할 오픈소스
차세대하둡과 주목해야할 오픈소스
 
Quick Understanding of NoSQL
Quick Understanding of NoSQLQuick Understanding of NoSQL
Quick Understanding of NoSQL
 
Apache hama @ Samsung SW Academy
Apache hama @ Samsung SW AcademyApache hama @ Samsung SW Academy
Apache hama @ Samsung SW Academy
 
Apache Hama 0.4
Apache Hama 0.4Apache Hama 0.4
Apache Hama 0.4
 
Introduction of Apache Hama - 2011
Introduction of Apache Hama - 2011Introduction of Apache Hama - 2011
Introduction of Apache Hama - 2011
 
MongoDB introduction
MongoDB introductionMongoDB introduction
MongoDB introduction
 
Monitoring and mining network traffic in clouds
Monitoring and mining network traffic in cloudsMonitoring and mining network traffic in clouds
Monitoring and mining network traffic in clouds
 
Apache hama 0.2-userguide
Apache hama 0.2-userguideApache hama 0.2-userguide
Apache hama 0.2-userguide
 
Usage case of HBase for real-time application
Usage case of HBase for real-time applicationUsage case of HBase for real-time application
Usage case of HBase for real-time application
 
Apache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
Apache HAMA: An Introduction toBulk Synchronization Parallel on HadoopApache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
Apache HAMA: An Introduction toBulk Synchronization Parallel on Hadoop
 
Understand Of Linear Algebra
Understand Of Linear AlgebraUnderstand Of Linear Algebra
Understand Of Linear Algebra
 
BigTable And Hbase
BigTable And HbaseBigTable And Hbase
BigTable And Hbase
 
Heart Proposal
Heart ProposalHeart Proposal
Heart Proposal
 

The evolution of web and big data

  • 1. The Evolution of Web and Big Data Edward J. Yoon
  • 2. Who Am I • Edward J. Yoon – @eddieyoon • Founder of Apache Hama • PMC member of Apache BigTop • Oracle Employee
  • 3. Early era of Web Google • 2003: GFS • 2004: MapReduce OSS • 2005: SawZall • 2005: Hadoop • 2006: BigTable HDFS MapReduce • 2006: Pig • 2007: Hive HBase
  • 4. Google? • World best “Full-text search engine” • In 2003, – 10,000+ Servers – 4+ billion Documents – 300+ Million Images
  • 5. Hadoop 1.0 • HDFS + MapReduce – And Pig. Hive, Hbase, Mahout
  • 6.
  • 7. The New era of Web Google OSS • 2010: Pregel • 2010: Hama Dremel Twitter • 2012: Spanner Storm • 2011: YARN Giraph • 2012: Drill
  • 9. YARN? • Job scheduling and cluster resource management
  • 10. Future of CDH4 and Hadoop • CDH4 will be based on 0.23.x or later • 0.23.0 doesn’t include Map/Reduce 1.0 – Storm, Giraph, Hama, Spark, MPI, GraphLab