SlideShare a Scribd company logo
1 of 11
BIG Data
4 Layers Everyone Must
Know
There is still so much confusion
surrounding Big Data.
I thought it might help to clarify the 4
key layers of a big data system - i.e.
the different stages the data itself has
to pass through on its journey from raw
statistic or snippet of unstructured data
(for example, social media post) to
actionable insight.
The whole point of a big data strategy
is to develop a system which moves
data along this path – raw data to
actionable insights.
Here, I will attempt to define the basic
layers you will need to have in place, if
you are getting to grips with how big
data could help your business.
Although people have
come up with different
names for these layers,
as we’re charting a brave
new world where little is
set in stone, I think this is
the simplest and most
accurate breakdown:
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data sources layer
This is where the data arrives at your
organization. It includes everything from
your sales records, customer database,
feedback, social media channels, marketing
list, email archives and any data gleaned
from monitoring or measuring aspects of
your operations. One of the first steps in
setting up a data strategy is assessing what
you have here, and measuring it against
what you need to answer the critical
questions you want help with. You might
have everything you need already, or you
might need to establish new sources.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data storage layer
This is where your Big Data lives, once it is
gathered from your sources. As the volume of
data generated and stored by companies has
started to explode, sophisticated but accessible
systems and tools have been developed – such
as Apache Hadoop DFS (distributed file system),
or Google File System, to help with this task. As
well as a system for storing data that your
computer system will understand (the file
system) you will need a system for organizing
and categorizing it in a way that people will
understand – the database. Hadoop has its own,
known as HBase, but others including Amazon’s
DynamoDB, MongoDB and Cassandra (used by
Facebook), all based on the NoSQL architecture,
are popular too.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data processing/ analysis layer
When you want to use the data you have
stored to find out something useful, you will
need to process and analyze it. A common
method is by using a MapReduce tool.
Essentially, this is used to select the elements
of the data that you want to analyze, and
putting it into a format from which insights
can be gleaned. If you are a large organization
which has invested in its own data analytics
team, they will form a part of this layer, too.
They will employ tools such as Apache PIG or
HIVE to query the data, and might use
automated pattern recognition tools to
determine trends, as well as drawing their
conclusions from manual analysis.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
Data output layer
This is how the insights gleaned through the
analysis is passed on to the people who can
take action to benefit from them. Clear and
concise communication (particularly if your
decision-makers don’t have a background in
statistics) is essential, and this output can
take the form of reports, charts, figures and
key recommendations. Ultimately, your Big
Data system’s main task is to show, at this
stage of the process, how measurable
improvement in at least one KPI that can be
achieved by taking action based on the
analysis you have carried out.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer
If you set up a system which works
through all those stages to arrive at this
destination, then congratulations!
You’re in Big Data.
And hopefully, ready to start reaping the
benefits!
4 Layers of Big Data
4 Layers of Big Data

More Related Content

What's hot (20)

Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Big Data
Big DataBig Data
Big Data
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data Analytics for Banking, a Point of View
Big Data Analytics for Banking, a Point of ViewBig Data Analytics for Banking, a Point of View
Big Data Analytics for Banking, a Point of View
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big data
Big dataBig data
Big data
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Modeling for Big Data
Data Modeling for Big DataData Modeling for Big Data
Data Modeling for Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data
Big dataBig data
Big data
 
Big Data Presentation
Big  Data PresentationBig  Data Presentation
Big Data Presentation
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Data Science
Data ScienceData Science
Data Science
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 

Similar to 4 Layers of Big Data

6 data-understanding-v2
6 data-understanding-v26 data-understanding-v2
6 data-understanding-v2ArdianDwiPraba
 
Big dataplatform operationalstrategy
Big dataplatform operationalstrategyBig dataplatform operationalstrategy
Big dataplatform operationalstrategyHimanshu Bari
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questionsSatyam Jaiswal
 
data analytics lecture 3.2.ppt
data analytics lecture 3.2.pptdata analytics lecture 3.2.ppt
data analytics lecture 3.2.pptRutujaPatil247341
 
data wrangling (1).pptx kjhiukjhknjbnkjh
data wrangling (1).pptx kjhiukjhknjbnkjhdata wrangling (1).pptx kjhiukjhknjbnkjh
data wrangling (1).pptx kjhiukjhknjbnkjhVISHALMARWADE1
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysNEWYORKSYS-IT SOLUTIONS
 
Big data
Big dataBig data
Big data26Nia
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data ScienceJohn B. Rollins, Ph.D.
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceMahir Haque
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperImpetus Technologies
 
About Streaming Data Solutions for Hadoop
About Streaming Data Solutions for HadoopAbout Streaming Data Solutions for Hadoop
About Streaming Data Solutions for HadoopLynn Langit
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceSukirti Garg
 
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docxlorainedeserre
 
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docxBHANU281672
 
big data and machine learning ppt.pptx
big data and machine learning ppt.pptxbig data and machine learning ppt.pptx
big data and machine learning ppt.pptxNATASHABANO
 

Similar to 4 Layers of Big Data (20)

6 data-understanding-v2
6 data-understanding-v26 data-understanding-v2
6 data-understanding-v2
 
Big dataplatform operationalstrategy
Big dataplatform operationalstrategyBig dataplatform operationalstrategy
Big dataplatform operationalstrategy
 
Data warehousing interview questions
Data warehousing interview questionsData warehousing interview questions
Data warehousing interview questions
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
data analytics lecture 3.2.ppt
data analytics lecture 3.2.pptdata analytics lecture 3.2.ppt
data analytics lecture 3.2.ppt
 
data wrangling (1).pptx kjhiukjhknjbnkjh
data wrangling (1).pptx kjhiukjhknjbnkjhdata wrangling (1).pptx kjhiukjhknjbnkjh
data wrangling (1).pptx kjhiukjhknjbnkjh
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Big data
Big dataBig data
Big data
 
Unit 5
Unit 5 Unit 5
Unit 5
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data Science
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White Paper
 
About Streaming Data Solutions for Hadoop
About Streaming Data Solutions for HadoopAbout Streaming Data Solutions for Hadoop
About Streaming Data Solutions for Hadoop
 
BD1.pptx
BD1.pptxBD1.pptx
BD1.pptx
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
 
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx
 
big data and machine learning ppt.pptx
big data and machine learning ppt.pptxbig data and machine learning ppt.pptx
big data and machine learning ppt.pptx
 

More from Bernard Marr

The Top 4 Telecom Trends In 2023
The Top 4 Telecom Trends In 2023The Top 4 Telecom Trends In 2023
The Top 4 Telecom Trends In 2023Bernard Marr
 
How To Use Meta’s Horizon Workrooms For Business
How To Use Meta’s Horizon Workrooms For BusinessHow To Use Meta’s Horizon Workrooms For Business
How To Use Meta’s Horizon Workrooms For BusinessBernard Marr
 
The Top 5 Healthcare Trends In 2023
The Top 5 Healthcare Trends In 2023The Top 5 Healthcare Trends In 2023
The Top 5 Healthcare Trends In 2023Bernard Marr
 
The Top 5 In-Demand Tech Skills For Jobs In 2023
The Top 5 In-Demand Tech Skills For Jobs In 2023The Top 5 In-Demand Tech Skills For Jobs In 2023
The Top 5 In-Demand Tech Skills For Jobs In 2023Bernard Marr
 
Policing In The Metaverse: What’s Happening Now
Policing In The Metaverse: What’s Happening NowPolicing In The Metaverse: What’s Happening Now
Policing In The Metaverse: What’s Happening NowBernard Marr
 
Banking In The Metaverse – The Next Frontier For Financial Services
Banking In The Metaverse – The Next Frontier For Financial Services Banking In The Metaverse – The Next Frontier For Financial Services
Banking In The Metaverse – The Next Frontier For Financial Services Bernard Marr
 
The 7 Biggest Business Challenges Every Company Is Facing In 2023
The 7 Biggest Business Challenges Every Company Is Facing In 2023The 7 Biggest Business Challenges Every Company Is Facing In 2023
The 7 Biggest Business Challenges Every Company Is Facing In 2023Bernard Marr
 
Is This The Downfall Of Meta And Social Media As We Know It?
Is This The Downfall Of Meta And Social Media As We Know It?Is This The Downfall Of Meta And Social Media As We Know It?
Is This The Downfall Of Meta And Social Media As We Know It?Bernard Marr
 
The Top Five Cybersecurity Trends In 2023
The Top Five Cybersecurity Trends In 2023The Top Five Cybersecurity Trends In 2023
The Top Five Cybersecurity Trends In 2023Bernard Marr
 
The Top 5 Technology Challenges In 2023
The Top 5 Technology Challenges In 2023The Top 5 Technology Challenges In 2023
The Top 5 Technology Challenges In 2023Bernard Marr
 
How To Build A Positive Hybrid And Remote Working Culture In 2023
How To Build A Positive Hybrid And Remote Working Culture In 2023How To Build A Positive Hybrid And Remote Working Culture In 2023
How To Build A Positive Hybrid And Remote Working Culture In 2023Bernard Marr
 
Beyond Dashboards: The Future Of Analytics And Business Intelligence?
Beyond Dashboards: The Future Of Analytics And Business Intelligence? Beyond Dashboards: The Future Of Analytics And Business Intelligence?
Beyond Dashboards: The Future Of Analytics And Business Intelligence? Bernard Marr
 
The Top 5 Data Science And Analytics Trends In 2023
The Top 5 Data Science And Analytics Trends In 2023The Top 5 Data Science And Analytics Trends In 2023
The Top 5 Data Science And Analytics Trends In 2023Bernard Marr
 
The 5 Biggest Business Trends For 2023
The 5 Biggest Business Trends For 2023The 5 Biggest Business Trends For 2023
The 5 Biggest Business Trends For 2023Bernard Marr
 
12 Practical Steps To Handle Change At Work
12 Practical Steps To Handle Change At Work 12 Practical Steps To Handle Change At Work
12 Practical Steps To Handle Change At Work Bernard Marr
 
The Top 12 Virtual Networking Tips To Boost Your Career
The Top 12 Virtual Networking Tips To Boost Your CareerThe Top 12 Virtual Networking Tips To Boost Your Career
The Top 12 Virtual Networking Tips To Boost Your CareerBernard Marr
 
How AI And Machine Learning Will Impact The Future Of Healthcare
How AI And Machine Learning Will Impact The Future Of HealthcareHow AI And Machine Learning Will Impact The Future Of Healthcare
How AI And Machine Learning Will Impact The Future Of HealthcareBernard Marr
 
Top 16 Essential Soft Skills For The Future of Work
Top 16 Essential Soft Skills For The Future of WorkTop 16 Essential Soft Skills For The Future of Work
Top 16 Essential Soft Skills For The Future of WorkBernard Marr
 
Artificial Intelligence And The Future Of Marketing
Artificial Intelligence And The Future Of MarketingArtificial Intelligence And The Future Of Marketing
Artificial Intelligence And The Future Of MarketingBernard Marr
 
Is AI Really a Job Killer? These Experts Say No
Is AI Really a Job Killer? These Experts Say NoIs AI Really a Job Killer? These Experts Say No
Is AI Really a Job Killer? These Experts Say NoBernard Marr
 

More from Bernard Marr (20)

The Top 4 Telecom Trends In 2023
The Top 4 Telecom Trends In 2023The Top 4 Telecom Trends In 2023
The Top 4 Telecom Trends In 2023
 
How To Use Meta’s Horizon Workrooms For Business
How To Use Meta’s Horizon Workrooms For BusinessHow To Use Meta’s Horizon Workrooms For Business
How To Use Meta’s Horizon Workrooms For Business
 
The Top 5 Healthcare Trends In 2023
The Top 5 Healthcare Trends In 2023The Top 5 Healthcare Trends In 2023
The Top 5 Healthcare Trends In 2023
 
The Top 5 In-Demand Tech Skills For Jobs In 2023
The Top 5 In-Demand Tech Skills For Jobs In 2023The Top 5 In-Demand Tech Skills For Jobs In 2023
The Top 5 In-Demand Tech Skills For Jobs In 2023
 
Policing In The Metaverse: What’s Happening Now
Policing In The Metaverse: What’s Happening NowPolicing In The Metaverse: What’s Happening Now
Policing In The Metaverse: What’s Happening Now
 
Banking In The Metaverse – The Next Frontier For Financial Services
Banking In The Metaverse – The Next Frontier For Financial Services Banking In The Metaverse – The Next Frontier For Financial Services
Banking In The Metaverse – The Next Frontier For Financial Services
 
The 7 Biggest Business Challenges Every Company Is Facing In 2023
The 7 Biggest Business Challenges Every Company Is Facing In 2023The 7 Biggest Business Challenges Every Company Is Facing In 2023
The 7 Biggest Business Challenges Every Company Is Facing In 2023
 
Is This The Downfall Of Meta And Social Media As We Know It?
Is This The Downfall Of Meta And Social Media As We Know It?Is This The Downfall Of Meta And Social Media As We Know It?
Is This The Downfall Of Meta And Social Media As We Know It?
 
The Top Five Cybersecurity Trends In 2023
The Top Five Cybersecurity Trends In 2023The Top Five Cybersecurity Trends In 2023
The Top Five Cybersecurity Trends In 2023
 
The Top 5 Technology Challenges In 2023
The Top 5 Technology Challenges In 2023The Top 5 Technology Challenges In 2023
The Top 5 Technology Challenges In 2023
 
How To Build A Positive Hybrid And Remote Working Culture In 2023
How To Build A Positive Hybrid And Remote Working Culture In 2023How To Build A Positive Hybrid And Remote Working Culture In 2023
How To Build A Positive Hybrid And Remote Working Culture In 2023
 
Beyond Dashboards: The Future Of Analytics And Business Intelligence?
Beyond Dashboards: The Future Of Analytics And Business Intelligence? Beyond Dashboards: The Future Of Analytics And Business Intelligence?
Beyond Dashboards: The Future Of Analytics And Business Intelligence?
 
The Top 5 Data Science And Analytics Trends In 2023
The Top 5 Data Science And Analytics Trends In 2023The Top 5 Data Science And Analytics Trends In 2023
The Top 5 Data Science And Analytics Trends In 2023
 
The 5 Biggest Business Trends For 2023
The 5 Biggest Business Trends For 2023The 5 Biggest Business Trends For 2023
The 5 Biggest Business Trends For 2023
 
12 Practical Steps To Handle Change At Work
12 Practical Steps To Handle Change At Work 12 Practical Steps To Handle Change At Work
12 Practical Steps To Handle Change At Work
 
The Top 12 Virtual Networking Tips To Boost Your Career
The Top 12 Virtual Networking Tips To Boost Your CareerThe Top 12 Virtual Networking Tips To Boost Your Career
The Top 12 Virtual Networking Tips To Boost Your Career
 
How AI And Machine Learning Will Impact The Future Of Healthcare
How AI And Machine Learning Will Impact The Future Of HealthcareHow AI And Machine Learning Will Impact The Future Of Healthcare
How AI And Machine Learning Will Impact The Future Of Healthcare
 
Top 16 Essential Soft Skills For The Future of Work
Top 16 Essential Soft Skills For The Future of WorkTop 16 Essential Soft Skills For The Future of Work
Top 16 Essential Soft Skills For The Future of Work
 
Artificial Intelligence And The Future Of Marketing
Artificial Intelligence And The Future Of MarketingArtificial Intelligence And The Future Of Marketing
Artificial Intelligence And The Future Of Marketing
 
Is AI Really a Job Killer? These Experts Say No
Is AI Really a Job Killer? These Experts Say NoIs AI Really a Job Killer? These Experts Say No
Is AI Really a Job Killer? These Experts Say No
 

Recently uploaded

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 

Recently uploaded (20)

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 

4 Layers of Big Data

  • 1. BIG Data 4 Layers Everyone Must Know
  • 2. There is still so much confusion surrounding Big Data. I thought it might help to clarify the 4 key layers of a big data system - i.e. the different stages the data itself has to pass through on its journey from raw statistic or snippet of unstructured data (for example, social media post) to actionable insight.
  • 3. The whole point of a big data strategy is to develop a system which moves data along this path – raw data to actionable insights. Here, I will attempt to define the basic layers you will need to have in place, if you are getting to grips with how big data could help your business.
  • 4. Although people have come up with different names for these layers, as we’re charting a brave new world where little is set in stone, I think this is the simplest and most accurate breakdown: 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 5. Data sources layer This is where the data arrives at your organization. It includes everything from your sales records, customer database, feedback, social media channels, marketing list, email archives and any data gleaned from monitoring or measuring aspects of your operations. One of the first steps in setting up a data strategy is assessing what you have here, and measuring it against what you need to answer the critical questions you want help with. You might have everything you need already, or you might need to establish new sources. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 6. Data storage layer This is where your Big Data lives, once it is gathered from your sources. As the volume of data generated and stored by companies has started to explode, sophisticated but accessible systems and tools have been developed – such as Apache Hadoop DFS (distributed file system), or Google File System, to help with this task. As well as a system for storing data that your computer system will understand (the file system) you will need a system for organizing and categorizing it in a way that people will understand – the database. Hadoop has its own, known as HBase, but others including Amazon’s DynamoDB, MongoDB and Cassandra (used by Facebook), all based on the NoSQL architecture, are popular too. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 7. Data processing/ analysis layer When you want to use the data you have stored to find out something useful, you will need to process and analyze it. A common method is by using a MapReduce tool. Essentially, this is used to select the elements of the data that you want to analyze, and putting it into a format from which insights can be gleaned. If you are a large organization which has invested in its own data analytics team, they will form a part of this layer, too. They will employ tools such as Apache PIG or HIVE to query the data, and might use automated pattern recognition tools to determine trends, as well as drawing their conclusions from manual analysis. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 8. Data output layer This is how the insights gleaned through the analysis is passed on to the people who can take action to benefit from them. Clear and concise communication (particularly if your decision-makers don’t have a background in statistics) is essential, and this output can take the form of reports, charts, figures and key recommendations. Ultimately, your Big Data system’s main task is to show, at this stage of the process, how measurable improvement in at least one KPI that can be achieved by taking action based on the analysis you have carried out. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer
  • 9. If you set up a system which works through all those stages to arrive at this destination, then congratulations! You’re in Big Data. And hopefully, ready to start reaping the benefits!