MapReduce - Hadoop - Big Data

Problem Analysis
• Experiments in ICSI Desktop Cluster but in
reality Big Data dataset has to handle 100 pada
byte of data .
• Heavy network traffic is not considered .

Problem Analysis
• Mapreduce has latency as
• Mapping phase peak rate is not high .
• Need Bundle data for fast mapping .
• Limited Reducer as each reducer output file
is different .

Problem Analysis
• Mapreduce has latency as
• Hadoop do not support broadcasting
parameter references to all maps node thus all
map node has to bundle same parameter .
• Secondary buffer needed to swapping .

Problem Analysis
• Hadoop has drawbacks on implementing DFS .
• Mapreduce framework performs very poorly in
slot-base memory(1 slot 1 task) and iterative
processing tasks like graph processing.
• The MapReduce does not work when there are
computational dependencies in the data .

Problem Analysis
• To make the implementation of research
suggestion is more non-intuitive & complicated
than is necessary .
• If new data is added the jobs need to run over
the entire set again .
• A single failure kills all queued and running
jobs .

Suggestion
• Augmenting MapReduce with ad hoc support
may solve iterative and random access to its
dataset.
• Sampling also may use to solve iterative
problem .

Review Questions
• Why Mapping phase peak rate is not high ?
 It writes on intermediate data file .
• Why Hadoop do not support broadcasting ?
 As JAVA do not support sharing references
during mapping task .

Review Questions
• Mapreduce performs poorly in iterative why ?
 The system merge iterations and
materializing data only when required .
• Why new data cases to run whole job again ?
 Hadoop does not function well for random
access to its datasets . But YARN promise to
support that .

MapReduce - Hadoop - Big Data

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (13)

Similar to MapReduce - Hadoop - Big Data

Similar to MapReduce - Hadoop - Big Data (20)

More from Nafiz Ishtiaque Ahmed

More from Nafiz Ishtiaque Ahmed (20)

Recently uploaded

Recently uploaded (20)

MapReduce - Hadoop - Big Data