*********TECHNO INDIA COLLEGE OF TECHNOLOGY**********
RAJARHAT,KOLKATA - 700156
A full powerpoint presentation on big data analytics and Hadoop. This is made by: SK IBRAHIM ANAM , SOUVIK JANA.
SK IBRAHIM ANAM
SK IBRAHIM ANAM.
Data is raw, unorganized facts that need to be
processed. Data can be something simple,
seemingly random and of itself worthless useless
until it is organized.
DIFFERENT TYPES OF DATA
Traditional RDBMS deals with
only Structured Data
Need of a Technology which deals with
Semi – Structured Data ,Unstructured
Data and Structured Data as well
Traditional Concept of Data Storage
Extract Data Transform Data
End Users Generate
Reports & Perform
Drawback of Using Traditional Approach
Expensive Time Consuming Scalability
Storage Size Resource Failure
The Model of Generating or Consuming Data
OLD MODEL - Few companies are generating the data, all
other consuming the data.
NEW MODEL -All of us generating the data, and all of us
consuming the data.
Big data means really a Big Data, it is a
collection of large datasets that cannot be
processed using traditional computing
techniques. It requires new architecture , new
techniques , various tools and frameworks .
WHERE THE BIG DATA IS USED
CHALLENGES IN HANDLING BIG DATA
There are two main challenges in handle BIG DATA
1. How do we store and manage such a huge volume
of DATA, efficiently.
2. How do we process & extract valuable information
from the huge volume of DATA within a given
Hadoop is a open Source Framework. It is designed to
store and Process huge volume of Data, efficiently.
Hadoop is a platform that provides both distributed
storage and computational capabilities.