4 Layers of Big Data

•Download as PPTX, PDF•

179 likes•124,434 views

Bernard Marr

This slide deck outlines the four key layers of any big data system.

Data & Analytics

There is still so much confusion
surrounding Big Data.
I thought it might help to clarify the 4
key layers of a big data system - i.e.
the different stages the data itself has
to pass through on its journey from raw
statistic or snippet of unstructured data
(for example, social media post) to
actionable insight.

The whole point of a big data strategy
is to develop a system which moves
data along this path – raw data to
actionable insights.
Here, I will attempt to define the basic
layers you will need to have in place, if
you are getting to grips with how big
data could help your business.

Although people have
come up with different
names for these layers,
as we’re charting a brave
new world where little is
set in stone, I think this is
the simplest and most
accurate breakdown:
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer

Data sources layer
This is where the data arrives at your
organization. It includes everything from
your sales records, customer database,
feedback, social media channels, marketing
list, email archives and any data gleaned
from monitoring or measuring aspects of
your operations. One of the first steps in
setting up a data strategy is assessing what
you have here, and measuring it against
what you need to answer the critical
questions you want help with. You might
have everything you need already, or you
might need to establish new sources.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer

Data storage layer
This is where your Big Data lives, once it is
gathered from your sources. As the volume of
data generated and stored by companies has
started to explode, sophisticated but accessible
systems and tools have been developed – such
as Apache Hadoop DFS (distributed file system),
or Google File System, to help with this task. As
well as a system for storing data that your
computer system will understand (the file
system) you will need a system for organizing
and categorizing it in a way that people will
understand – the database. Hadoop has its own,
known as HBase, but others including Amazon’s
DynamoDB, MongoDB and Cassandra (used by
Facebook), all based on the NoSQL architecture,
are popular too.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer

Data processing/ analysis layer
When you want to use the data you have
stored to find out something useful, you will
need to process and analyze it. A common
method is by using a MapReduce tool.
Essentially, this is used to select the elements
of the data that you want to analyze, and
putting it into a format from which insights
can be gleaned. If you are a large organization
which has invested in its own data analytics
team, they will form a part of this layer, too.
They will employ tools such as Apache PIG or
HIVE to query the data, and might use
automated pattern recognition tools to
determine trends, as well as drawing their
conclusions from manual analysis.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer

Data output layer
This is how the insights gleaned through the
analysis is passed on to the people who can
take action to benefit from them. Clear and
concise communication (particularly if your
decision-makers don’t have a background in
statistics) is essential, and this output can
take the form of reports, charts, figures and
key recommendations. Ultimately, your Big
Data system’s main task is to show, at this
stage of the process, how measurable
improvement in at least one KPI that can be
achieved by taking action based on the
analysis you have carried out.
1. Data Source Layer
3. Data Processing /
Analysis Layer
2. Data Storage Layer
4. Data Output Layer

If you set up a system which works
through all those stages to arrive at this
destination, then congratulations!
You’re in Big Data.
And hopefully, ready to start reaping the
benefits!

What's hot

Data science applications and usecasesSreenatha Reddy K R

Data Analytics Life CycleDr. C.V. Suresh Babu

Big data by Mithlesh sadhMithlesh Sadh

Text miningKoshy Geoji

Big data in telecomShubham Bathe

Data miningAhmed Moussa

Data preprocessing using Machine Learning Gopal Sakarkar

Data SciencePrakhyath Rai

Big Data AnalyticsGhulam Imaduddin

Big dataNausheen Hasan

Big Data Analyticshumerashaziya

Three Big Data Case StudiesAtidan Technologies Pvt Ltd (India)

Big Data & Hadoop IntroductionJayant Mukherjee

Big DataPriyanka Tuteja

Big dataPooja Shah

Introduction to Data Mining Sushil Kulkarni

Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Edureka!

Big Data Platforms: An OverviewC. Scyphers

Presentation on Big DataMaruf Abdullah (Rion)

Presentation on Big DataMd. Salman Ahmed

What's hot (20)

Data science applications and usecases

Data Analytics Life Cycle

Big data by Mithlesh sadh

Text mining

Big data in telecom

Data mining

Data preprocessing using Machine Learning

Data Science

Big Data Analytics

Big data

Big Data Analytics

Three Big Data Case Studies

Big Data & Hadoop Introduction

Big Data

Big data

Introduction to Data Mining

Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...

Big Data Platforms: An Overview

Presentation on Big Data

Similar to 4 Layers of Big Data

6 data-understanding-v2ArdianDwiPraba

Big dataplatform operationalstrategyHimanshu Bari

Data warehousing interview questionsSatyam Jaiswal

Big data analyticsANAND PRAKASH

data analytics lecture 3.2.pptRutujaPatil247341

data wrangling (1).pptx kjhiukjhknjbnkjhVISHALMARWADE1

Big dataMithilesh Joshi - SEO & Digital Marketing Consultant

What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysNEWYORKSYS-IT SOLUTIONS

Big data26Nia

Unit 5 sasisanjeev

Foundational Methodology for Data ScienceJohn B. Rollins, Ph.D.

Unlocking big dataOrchestrate Mortgage and Title Solutions, LLC

Introduction to data scienceMahir Haque

Building a Big Data Analytics Platform- Impetus White PaperImpetus Technologies

About Streaming Data Solutions for HadoopLynn Langit

BD1.pptxKarthik Rohan

1 UNIT-DSP.pptxPothyeswariPothyes

Business IntelligenceSukirti Garg

2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docxlorainedeserre

2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docxBHANU281672

Similar to 4 Layers of Big Data (20)

6 data-understanding-v2

Big dataplatform operationalstrategy

Data warehousing interview questions

Big data analytics

data analytics lecture 3.2.ppt

data wrangling (1).pptx kjhiukjhknjbnkjh

Big data

What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys

Big data

Unit 5

Foundational Methodology for Data Science

Unlocking big data

Introduction to data science

Building a Big Data Analytics Platform- Impetus White Paper

About Streaming Data Solutions for Hadoop

BD1.pptx

1 UNIT-DSP.pptx

Business Intelligence

2Running Head BIG DATA PROCESSING OF SOFTWARE AND TOOLS2BIG.docx

Recently uploaded

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Mature dropshipping via API with DroFx.pptxolyaivanovalion

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Carero dropshipping via API with DroFx.pptxolyaivanovalion

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Ukraine War presentation: KNOW THE BASICSAishani27

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Midocean dropshipping via API with DroFxolyaivanovalion

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

Industrialised data - the key to AI success.pdfLars Albertsson

Recently uploaded (20)

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Mature dropshipping via API with DroFx.pptx

100-Concepts-of-AI by Anupama Kate .pptx

Carero dropshipping via API with DroFx.pptx

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

Unveiling Insights: The Role of a Data Analyst

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

Ukraine War presentation: KNOW THE BASICS

Edukaciniai dropshipping via API with DroFx

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Midocean dropshipping via API with DroFx

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

Industrialised data - the key to AI success.pdf

4 Layers of Big Data

1. BIG Data 4 Layers Everyone Must Know

2. There is still so much confusion surrounding Big Data. I thought it might help to clarify the 4 key layers of a big data system - i.e. the different stages the data itself has to pass through on its journey from raw statistic or snippet of unstructured data (for example, social media post) to actionable insight.

3. The whole point of a big data strategy is to develop a system which moves data along this path – raw data to actionable insights. Here, I will attempt to define the basic layers you will need to have in place, if you are getting to grips with how big data could help your business.

4. Although people have come up with different names for these layers, as we’re charting a brave new world where little is set in stone, I think this is the simplest and most accurate breakdown: 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer

5. Data sources layer This is where the data arrives at your organization. It includes everything from your sales records, customer database, feedback, social media channels, marketing list, email archives and any data gleaned from monitoring or measuring aspects of your operations. One of the first steps in setting up a data strategy is assessing what you have here, and measuring it against what you need to answer the critical questions you want help with. You might have everything you need already, or you might need to establish new sources. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer

6. Data storage layer This is where your Big Data lives, once it is gathered from your sources. As the volume of data generated and stored by companies has started to explode, sophisticated but accessible systems and tools have been developed – such as Apache Hadoop DFS (distributed file system), or Google File System, to help with this task. As well as a system for storing data that your computer system will understand (the file system) you will need a system for organizing and categorizing it in a way that people will understand – the database. Hadoop has its own, known as HBase, but others including Amazon’s DynamoDB, MongoDB and Cassandra (used by Facebook), all based on the NoSQL architecture, are popular too. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer

7. Data processing/ analysis layer When you want to use the data you have stored to find out something useful, you will need to process and analyze it. A common method is by using a MapReduce tool. Essentially, this is used to select the elements of the data that you want to analyze, and putting it into a format from which insights can be gleaned. If you are a large organization which has invested in its own data analytics team, they will form a part of this layer, too. They will employ tools such as Apache PIG or HIVE to query the data, and might use automated pattern recognition tools to determine trends, as well as drawing their conclusions from manual analysis. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer

8. Data output layer This is how the insights gleaned through the analysis is passed on to the people who can take action to benefit from them. Clear and concise communication (particularly if your decision-makers don’t have a background in statistics) is essential, and this output can take the form of reports, charts, figures and key recommendations. Ultimately, your Big Data system’s main task is to show, at this stage of the process, how measurable improvement in at least one KPI that can be achieved by taking action based on the analysis you have carried out. 1. Data Source Layer 3. Data Processing / Analysis Layer 2. Data Storage Layer 4. Data Output Layer

9. If you set up a system which works through all those stages to arrive at this destination, then congratulations! You’re in Big Data. And hopefully, ready to start reaping the benefits!

4 Layers of Big Data

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to 4 Layers of Big Data

Similar to 4 Layers of Big Data (20)

More from Bernard Marr

More from Bernard Marr (20)

Recently uploaded

Recently uploaded (20)

4 Layers of Big Data