The Hortonworks Data Platform therefore addresses all of these capabilities – completely in Open Source.
YARN is the architectural center ogf HDP and Hadoop that not only enables multiple data access engines across batch, interactive and real-time to all work on a single set of data but also extends Hadoop to integrate with the existing systems and tools you already have in your data center.
HDP delivers on the key enterprise requirements of governance, security and operations.
And of course it is supported on the widest possible range of deployment options: from Linux, to Windows (the only hadoop offering on Windows), appliance (from Microsoft or Teradata) or Cloud (Microsoft, Rackspace and more).
HDP is a comprehensive data management platform with one goal in mind: to enable an enterprise architecture with Hadoop.
The Hortonworks Data Platform therefore addresses all of these capabilities – completely in Open Source.
YARN is the architectural center ogf HDP and Hadoop that not only enables multiple data access engines across batch, interactive and real-time to all work on a single set of data but also extends Hadoop to integrate with the existing systems and tools you already have in your data center.
HDP delivers on the key enterprise requirements of governance, security and operations.
And of course it is supported on the widest possible range of deployment options: from Linux, to Windows (the only hadoop offering on Windows), appliance (from Microsoft or Teradata) or Cloud (Microsoft, Rackspace and more).
HDP is a comprehensive data management platform with one goal in mind: to enable an enterprise architecture with Hadoop.
Apache Hadoop (“Hadoop”) is an open source software project of the Apache Software Foundation that allows users to store, process and manage massive amounts of structured and unstructured data quickly and without significant investment.
Founded in 2011 by engineers from the original Yahoo! Hadoop development and operations team, Hortonworks has amassed more Hadoop experience under one roof than any other organization. Hortonworks developers are active participants and leaders in Hadoop open source development -- designing, building and testing the core of the Hadoop platform.
Hortonworks performs all of its development within the processes of the Apache Software Foundation and contributes the software code it writes to the Apache Software Foundation’s Hadoop project. Our code is 100% open source with zero proprietary extensions. This pure open source model is unique to Hortonworks, and we believe it represents the future of enterprise software development. Because of the open source nature of Hadoop and the fact that, along with the Hortonworks development team, thousands of other developers also actively contribute to Hadoop, your company will benefit from the speed and quality of innovation that only open source community development provides.
What is the Hortonworks Data Platform?
The Hortonworks Data Platform (“HDP”) is a 100% open source software Hadoop data platform. It’s a certified, hardened and packaged set of Hadoop projects, all tested with real-world rigor in the world’s largest Hadoop clusters, specifically designed to simplify and accelerate your enterprise Hadoop implementation. HDP is deeply integrated with your strategic data center technologies and through this integration allows you to reuse and leverage existing skills and resources.
You will license HDP directly from the Apache Software Foundation subject to the Apache License, Version 2.0. There are no fees for HDP -- it is available to you free of charge. Hortonworks does not license to customers directly and cannot sign license agreements for HDP.
What are we buying from Hortonworks?
Hortonworks has deep and extensive experience in Hadoop operations and offers unmatched HDP enterprise training, consulting and support services. You may elect to purchase any of these services from Hortonworks.
Training: Only Hortonworks provides training designed by the leaders and committers of Hadoop. We provide valuable real-world, scenario-based training developed by the core architects and developers of Hadoop.
Consulting Services: Hortonworks has a world-class enterprise services organization with vast experience in the largest Hadoop deployments. We are uniquely qualified to help you simplify, accelerate and optimize your HDP implementation process.
Support Subscription: A Hortonworks support subscription provides you with access to a highly skilled team of Hadoop professionals with deep experience in running Hadoop in production, at scale and on the most demanding workloads. We provide you with expert technical assistance and break-fixes under enterprise-level SLAs. Our support services are designed to cover the entire lifecycle: from development and proof-of-concept to QA/test to production and deployment. Also, as the leading contributor to Hadoop, Hortonworks is uniquely positioned to ensure that patches and bug fixes are contributed back to, and included in future versions of, the open source Hadoop project. We listen to, amplify and represent our customers’ voices in the community to help ensure the innovation necessary to meet their needs and requirements.
sc=SparkContext
Finally it is collected into a single value and returned to the driving program.
_ are placeholders for the key and value present in the RDD.