Pivotal is a trusted partner for IT innovation and transformation. From the technology, to the people, to the way people interact with technology, Pivotal is transforming how the world builds software.
At Strata NYC 2015, Pivotal, announced it will Supercharge the Hadoop Ecosystem by contributing the HAWQ advanced SQL on Hadoop analytics and MADlib machine learning technologies to The Apache Software Foundation.
Thanks for visiting Pivotal at Strata.
My name is Julia and today I want to tell you about the Pivotal mission….
Pivotal is a Trusted partner for IT innovation and transformation.
From the technology, to the people, to the way people interact with technology, Pivotal is transforming how the world builds software.
Today we are going to talk about a major shift in the Hadoop ecosystem -
Read Agenda -
Folks, there is a new data imperative.
Enterprises are leveraging the cloud for applications but they also leverage it for data. They are really looking at the next generation of cloud databases. it's Hadoop.
They are also creating an enormous new slate of data-driven applications, and this is really important, great software companies are the best at taking the data from their systems and creating better user experiences.
You're engaging with your users via applications, and in order for your companies to provide a better experience for users, you have to really be able to use and make decisions around data.
Lastly, it's all about open source. If you look at innovative companies, they are leveraging open source. One, they're using it; two, they're creating it; three, they’re improving it. But what’s the advantage to open source?
The reality is if enterprises are really going to keep up with the pace of innovation that's occurring, they're going to have to work with open source. They can't wait for a vendor to add new functionality to the software they’re using.
And they’ll pick based on activity – does it have a vibrant ecosystem? Is there activity in the mailing list? Are there a lot of contributors?
All this will lead to shorter innovation cycles, better software and improved customer experience. That is the advantage of open source software.
So this week here at Strata, Pivotal, announced it will Supercharge the Hadoop Ecosystem by contributing the HAWQ advanced SQL on Hadoop analytics and MADlib machine learning technologies to The Apache Software Foundation.
The contribution of HAWQ analytics engine and MADlib data science tools is latest example of Pivotal’s commitment to open source technology following up the open sourcing of Apache Geode just a few short months ago.
Pivotal will continue its commitment to the development of open source software models through the ASF. As every organization transforms into a software business
… it is imperative they have easy access to the most powerful analytics tools and create new software-driven experiences for people and the world.
Launched in 2013, and leveraged from over a decade’s worth of intellectual property and expertise developed through the creation of the Pivotal Greenplum data warehousie and PostgreSQL, HAWQ has helped define a key enterprise application for Hadoop – advanced SQL analytics. Through these major enhancements and, critically, HAWQ’s contribution to ASF, Pivotal seeks to ensure Hadoop’s place as the cornerstone of advanced data science, business intelligence, and data warehousing.
In addition to HAWQ, Pivotal is contributing the MADlib machine-learning library to ASF. Apache MADlib (incubating) is a powerful collection of scale out, parallel machine learning algorithms seamlessly integrated with HAWQ.
MADlib was developed by Pivotal, in conjunction with researchers from the University of California, Berkeley, Stanford University, the University of Florida and Pivotal’s customers.
We’ve been talking about “Hadoop Native”, but what are we really saying? Well, in order to be Hadoop Native, we’ve identified 4 important key component.
1. The Apache way - Developers plug into the Apache Hadoop release plan and procedures, not the other way around.
2. Hadoop Native means that you don’t have to move your data to another analytics repository. Keep it all in Hadoop.
3. Hadoop Native SQL analytics enable data scientists to tackle datasets of any size, all executed directly in the native Apache Hadoop cluster.
4. Tools that integrate and smoothly interoperate with YARN for resource management and Apache Ambari for installation and operation simplify overall management.
So how does Pivotal deliver it’s Big Data solution with Hadoop Native technologies? Through the Pivotal Big Data Suite….
What today is about is actually trying to move past the notion of ‘SQL on Hadoop’ to something where SQL is synonymous with Hadoop, SQL is inside Hadoop. SQL is Hadoop.
That is Why Hadoop Native SQL Matters
These trends we think have really been a process of discovery for the industry and Pivotal is excited to participate.
Pivotal Big Data Suite is the World’s First Open Sourced Big Data Portfolio!
Pivotal has open –sourced or will Open source all Pivotal Big Data Suite components including:
Pivotal Greenplum Database - will be open sourced by the end of this year.
Pivotal HDB- announced today (this week) now part of Apache as Apache HAWQ (incubating) and
Pivotal GemFire -premium in-memory NoSQL database , now part of Apache as Apache Geode
This powerful suite comprises all the necessary database tools to build any class of data-driven application and deliver unique and intelligent customer experiences.
But how will I be sure that my Hadoop Application will run within the current robust hadoop native eco-system?
Well, ODPi, the open ecosystem of big data, was formed to tackle this challenge. Pivotal is proud to be a founding member.
They are accelerating the delivery of business outcomes by driving interoperability on an enterprise-ready core platform.
ODPi is a robust community of ISV’s, software vendors and end user companies across a broad spectrum of the Hadoop Native eco-system, all working to produce commercial products based on a common OSS Core.
Learn more about ODPi and think about joining! Help your Big Data solutions flourish.
Thank you for your time today, we welcome you to check out all the Pivotal presentations and encourage you to ask one of our technology experts any questions you might have.
We would love to spend time telling you more about how Hadoop Native Apache HAWQ can help you transform how you build software.