Evolving On-demand Infrastructure for Hadoop 2.0

•Download as PPTX, PDF•

0 likes•268 views

Prime Dimensions' Hadoop 2.0 Big Data infrastructure features YARN, a distributed data operating system and development platform that extends the batch processing functionality of MapReduce by allowing multiple types of applications to be deployed directly across Hadoop clusters. YARN represents a paradigm shift in processing, managing and analyzing Big Data. The real benefit of YARN is that it allows Hadoop clusters to execute workloads beyond MapReduce. With YARN, Hadoop now has a generic resource-management and distributed application framework, in which multiple data processing applications can run natively in Hadoop. YARN provides extensibility and scalability in Hadoop by splitting the roles of the Hadoop Job Tracker into two processes: (1) the resource management controls access to the clusters resources (memory, CPU, etc.), and (2) the application manager controls task execution. In conjunction with YARN, Prime Dimensions is also offering integration support for other Apache projects, such as Spark, an open source, in-memory data analytics platform that is compatible with Hadoop. Together, YARN and Spark make it possible to establish domain-specific enclaves over multi-tenant compute clusters, creating a virtualized data environment and unified analytics platform, as enterprises evolve from “systems of records” to “systems of engagement.” This often requires deploying in-memory, high performance, petascale technologies, but YARN and Spark offer new options for organizations seeking these analytic capabilities in Hadoop. As Hadoop gains widespread adoption not only as a Big Data technology but also as a data warehouse augmentation strategy, its basic functionality is evolving to meet the demands of increased performance and high scalability. YARN is not simply a new release; it represents a revolutionary advancement of Hadoop. We see tremendous opportunity for the adoption of YARN and Spark as enterprise solutions for generating advanced analytics with reduced time-to-value. There will be significant demand to upgrade early adopters to Hadoop 2.0. Moreover, with the advanced features and capabilities of YARN, the use cases that arise from this new paradigm span across industries with seemingly profound, endless possibilities. There are advantages of bringing together NoSQL, relational and/or in-memory solutions, both Open Source and proprietary, to establish a unified analytics environment.

Technology

Evolving On-Demand Infrastructure for Big Data
Analytic Offload
Scale-up/Scale-out

Analytic
Applications

NoSQL Database

REST/JSON APIs

Advanced Analytics

YARN

Storm

Tez

Dashboards &
Visualizations
Analytic
Database
E-L-T

In-memory
Columnar

HCatalog

Data Warehouse Augmentation

OLAP

Data
Warehouse

E-T-L

Multi-structured
And Stream
Source Data

Data
Discovery

Structured Source Data

Viewers also liked

An Introduction To NoSQL & MongoDBLee Theobald

Hadoop Security: OverviewCloudera, Inc.

Building an Event-oriented Data Platform with Kafka, Eric Sammer confluent

The Future of Hadoop Security - Hadoop Summit 2014Cloudera, Inc.

Hadoop and Data Access SecurityCloudera, Inc.

Big Data Security with HadoopCloudera, Inc.

Introduction to MongoDBRavi Teja

Hadoop Reporting and Analysis - JaspersoftHortonworks

Mongo DBEdureka!

Introduction to Graph DatabasesMax De Marzi

Introduction to NoSQL DatabasesDerek Stainer

Overview - IBM Big Data PlatformVikas Manoria

Hadoop data ingestionVinod Nayal

Hadoop Security ArchitectureOwen O'Malley

Intro To MongoDBAlex Sharp

Real time data ingestion and Hybrid CloudNeeraj Sabharwal

Viewers also liked (16)

An Introduction To NoSQL & MongoDB

Hadoop Security: Overview

Building an Event-oriented Data Platform with Kafka, Eric Sammer

The Future of Hadoop Security - Hadoop Summit 2014

Hadoop and Data Access Security

Big Data Security with Hadoop

Introduction to MongoDB

Hadoop Reporting and Analysis - Jaspersoft

Mongo DB

Introduction to Graph Databases

Introduction to NoSQL Databases

Overview - IBM Big Data Platform

Hadoop data ingestion

Hadoop Security Architecture

Intro To MongoDB

Real time data ingestion and Hybrid Cloud

Recently uploaded

Exploring Multimodal Embeddings with MilvusZilliz

ICT role in 21st century education and its challengesrafiqahmad00786416

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

MINDCTI Revenue Release Quarter One 2024MIND CTI

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

DBX First Quarter 2024 Investor PresentationDropbox

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Recently uploaded (20)

Exploring Multimodal Embeddings with Milvus

ICT role in 21st century education and its challenges

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam

Apidays New York 2024 - The value of a flexible API Management solution for O...

Cyberprint. Dark Pink Apt Group [EN].pdf

FWD Group - Insurer Innovation Award 2024

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Boost Fertility New Invention Ups Success Rates.pdf

MINDCTI Revenue Release Quarter One 2024

Axa Assurance Maroc - Insurer Innovation Award 2024

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Strategies for Landing an Oracle DBA Job as a Fresher

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

How to Troubleshoot Apps for the Modern Connected Worker

DBX First Quarter 2024 Investor Presentation

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Exploring the Future Potential of AI-Enabled Smartphone Processors