SlideShare a Scribd company logo
1 of 18
Download to read offline
1
Boston Hadoop User Group Meetup, July 7, 2015
Kamil Bajda-Pawlikowski
Matt Fuller
2
โ€ขโ€ฏ History of Teradata Center for Hadoop
โ€“โ€ฏ Formerly Hadapt Founded in July, 2010 by Borgman, Bajda-Pawlikowski, and
Abadi
โ€“โ€ฏ Pioneered SQL-on-Hadoop market
โ€“โ€ฏ Based on work done by database research group in Yale Computer Science
Department
โ€“โ€ฏ Hybrid of Hadoop scalability and DBMS performance
โ€ขโ€ฏ Today
โ€“โ€ฏ Acquired by Teradata in July, 2014, renamed Teradata Center for Hadoop
โ€“โ€ฏ 30 developers with deep Hadoop and database expertise
โ€“โ€ฏ Headquarters in Boston, MA
โ€“โ€ฏ Contributors to open source project Presto
Who are we? - Teradata Center for Hadoop!
3
โ€ขโ€ฏ What is Presto?
โ€ขโ€ฏ What is Teradata doing?
โ€ขโ€ฏ Can I see a Demo?
โ€ขโ€ฏ How can I contribute?
Talk Agenda
4
โ€ขโ€ฏ 100% open source distributed ANSI SQL engine for Big Data
โ€“โ€ฏ Modern code base
โ€“โ€ฏ Proven scalability
โ€“โ€ฏ Optimized for low latency, Interactive querying
โ€ขโ€ฏ Cross platform query capability, not only SQL on Hadoop
โ€ขโ€ฏ Distributed under the Apache license, now supported by Teradata
โ€ขโ€ฏ Used by a community of well known, well respected technology companies
What is Presto?
5
History of Presto
FALL 2012
4 developers
start Presto
development
FALL 2014
88 Releases
41 Contributors
3943 Commits
SPRING 2015
98 Releases
65 Contributors
4587 Commits
---------
Teradata joins
Presto community
& offers support
SPRING 2013
Presto rolled out
within Facebook
FALL 2013
Facebook open
sources Presto
FALL 2008
Facebook
open sources
Hive
Timeline image courtesy of Facebook
6
Presto Architecture
Data stream API
Worker
Data stream API
Worker
Coordinator
Metadata
API
Parser/
analyzer
Planner Scheduler
Worker
Client
Data location
API
Pluggable
https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920
7
Presto Extensibility โ€“ connectors
Parser/
analyzer
Planner
Worker
Data location API
Hive
Cassandra
Kafka
MySQL
โ€ฆ
Metadata API
Hive
Cassandra
Kafka
MySQL
โ€ฆ
Data stream API
Hive
Cassandra
Kafka
MySQL
โ€ฆ
Scheduler
Coordinator
https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920
8
โ€ขโ€ฏ Data stays in memory during execution and is pipelined across nodes MPP-style
โ€ขโ€ฏ Vectorized columnar processing
โ€ขโ€ฏ Presto is written in highly tuned Java
โ€“โ€ฏ Efficient in-memory data structures
โ€“โ€ฏ Very careful coding of inner loops
โ€“โ€ฏ Bytecode generation
โ€ขโ€ฏ Optimized ORC reader
Presto = Performance
9
โ€ขโ€ฏ Facebook
โ€“โ€ฏ Multiple production clusters (100s of nodes total)
-โ€ฏ Including 300PB Hadoop data warehouse
โ€“โ€ฏ 1000s of internal daily active users
โ€“โ€ฏ Millions of queries each month
โ€“โ€ฏ Multiple PBs scanned every day
โ€“โ€ฏ Trillions of rows a day
โ€ขโ€ฏ Netflix
โ€“โ€ฏ Over 200-node production cluster on EC2
โ€“โ€ฏ Over 15 PB in S3 (Parquet format)
โ€“โ€ฏ Over 300 users and 2.5K queries daily
Presto in Production
10
โ€ขโ€ฏ 100% open source contributions to Presto to
increase adoption in the enterprise
โ€ขโ€ฏ A multi-year roadmap commitment to
phased enhancements of the open source
code
โ€ขโ€ฏ The first ever commercial support offering for
Presto
What is Teradata Doing?
Teradata Certified Presto
www.teradata.com/presto
11
โ€ขโ€ฏ Hadoop Distro Agnostic
โ€ขโ€ฏ Modern Code Base
โ€“โ€ฏ Presto is well-designed open source software with proper database
architecture
โ€ขโ€ฏ Strong Like-Minded Community
โ€ขโ€ฏ Push down processing across multiple data platforms
โ€ขโ€ฏ Leverage Teradata expertise to make SQL for Hadoop viable
Why is Teradata Contributing to Presto?
12
Demo Time!
13
Implement Integrate Proliferate
โ€ขโ€ฏ Installer
โ€ขโ€ฏ Documentation
โ€ขโ€ฏ Monitoring & Support
Tools
โ€ขโ€ฏ Management Tool
Integration
โ€ขโ€ฏ YARN Integration
โ€ขโ€ฏ ODBC / JDBC Drivers
โ€ขโ€ฏ BI Certification
โ€ขโ€ฏ Security
โ€ขโ€ฏ Connectors
Commercial Support
Phase 1 Phase 2 Phase 3
June 8, 2015 Q4 2015 2016
Expanding ANSI SQL Coverage
Teradata Contributions to Presto
14
โ€ขโ€ฏ Ease of install and management via Presto-Admin tool
โ€“โ€ฏ www.github.com/prestodb/presto-admin
โ€“โ€ฏ Packaging Presto as an RPM
โ€ขโ€ฏ Testing Framework for Presto
โ€“โ€ฏ www.github.com/prestodb/tempto
โ€“โ€ฏ Added large number of tests
โ€ขโ€ฏ Improvements to JDBC driver
โ€“โ€ฏ To be open sourced on www.github.com/prestodb soon!
โ€ขโ€ฏ Various SQL improvements
Teradataโ€™s Contributions
15
โ€ขโ€ฏ YARN Integration
โ€ขโ€ฏ Ambari Integration
โ€ขโ€ฏ ODBC & JDBC Drivers that actually work
โ€ขโ€ฏ Security โ€“ Authentication & Authorization
โ€ขโ€ฏ Continued SQL Improvements
โ€ขโ€ฏ BI tool certifications โ€“ e.g. Tableau
โ€ขโ€ฏ More Connectors โ€“ e.g. Hbase
โ€ขโ€ฏ Open Source our Docker based Dev Env
โ€ขโ€ฏ Open our Continuous Integration platform to the community
Teradataโ€™s Contribution Product Roadmap
16
www.github.com/facebook/presto
www.github.com/prestodb
Certified Distro: www.teradata.com/presto
Website: www.prestodb.io
Presto Userโ€™s Group: www.groups.google.com/group/presto-users
Facebook Page: www.facebook.com/prestodb
Twitter: #prestodb
How can I contribute?
17
Available for Download
โ€“โ€ฏ Presto 101t Server, CLI, JDBC
โ€“โ€ฏ Presto-Admin 0.1
โ€“โ€ฏ Documentation
โ€“โ€ฏ HDP w/ Presto VM Sandbox
โ€“โ€ฏ CDH w/ Presto VM Sandbox
www.teradata.com/presto
Presto 101t certified by Teradata
18

More Related Content

What's hot

Presto Strata Hadoop SJ 2016 short talk
Presto Strata Hadoop SJ 2016 short talkPresto Strata Hadoop SJ 2016 short talk
Presto Strata Hadoop SJ 2016 short talkkbajda
ย 
Presto
PrestoPresto
PrestoChen Chun
ย 
Presto at Facebook - Presto Meetup @ Boston (10/6/2015)
Presto at Facebook - Presto Meetup @ Boston (10/6/2015)Presto at Facebook - Presto Meetup @ Boston (10/6/2015)
Presto at Facebook - Presto Meetup @ Boston (10/6/2015)Martin Traverso
ย 
Introduction to Presto at Treasure Data
Introduction to Presto at Treasure DataIntroduction to Presto at Treasure Data
Introduction to Presto at Treasure DataTaro L. Saito
ย 
Understanding Presto - Presto meetup @ Tokyo #1
Understanding Presto - Presto meetup @ Tokyo #1Understanding Presto - Presto meetup @ Tokyo #1
Understanding Presto - Presto meetup @ Tokyo #1Sadayuki Furuhashi
ย 
Presto at Twitter
Presto at TwitterPresto at Twitter
Presto at TwitterBill Graham
ย 
Presto in my_use_case
Presto in my_use_casePresto in my_use_case
Presto in my_use_casewyukawa
ย 
Presto meetup 2015-03-19 @Facebook
Presto meetup 2015-03-19 @FacebookPresto meetup 2015-03-19 @Facebook
Presto meetup 2015-03-19 @FacebookTreasure Data, Inc.
ย 
Facebook Presto presentation
Facebook Presto presentationFacebook Presto presentation
Facebook Presto presentationCyanny LIANG
ย 
Prestogres, ODBC & JDBC connectivity for Presto
Prestogres, ODBC & JDBC connectivity for PrestoPrestogres, ODBC & JDBC connectivity for Presto
Prestogres, ODBC & JDBC connectivity for PrestoSadayuki Furuhashi
ย 
Presto+MySQLใงๅˆ†ๆ•ฃSQL
Presto+MySQLใงๅˆ†ๆ•ฃSQLPresto+MySQLใงๅˆ†ๆ•ฃSQL
Presto+MySQLใงๅˆ†ๆ•ฃSQLSadayuki Furuhashi
ย 
Bullet: A Real Time Data Query Engine
Bullet: A Real Time Data Query EngineBullet: A Real Time Data Query Engine
Bullet: A Real Time Data Query EngineDataWorks Summit
ย 
Continuous Processing in Structured Streaming with Jose Torres
 Continuous Processing in Structured Streaming with Jose Torres Continuous Processing in Structured Streaming with Jose Torres
Continuous Processing in Structured Streaming with Jose TorresDatabricks
ย 
03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving
03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving
03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful ServingDatabricks
ย 
Presto as a Service - Tips for operation and monitoring
Presto as a Service - Tips for operation and monitoringPresto as a Service - Tips for operation and monitoring
Presto as a Service - Tips for operation and monitoringTaro L. Saito
ย 
How to ensure Presto scalability โ€จin multi use case
How to ensure Presto scalability โ€จin multi use case How to ensure Presto scalability โ€จin multi use case
How to ensure Presto scalability โ€จin multi use case Kai Sasaki
ย 
HBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbmsHBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbmsMichael Stack
ย 
Stream All Thingsโ€”Patterns of Modern Data Integration with Gwen Shapira
Stream All Thingsโ€”Patterns of Modern Data Integration with Gwen ShapiraStream All Thingsโ€”Patterns of Modern Data Integration with Gwen Shapira
Stream All Thingsโ€”Patterns of Modern Data Integration with Gwen ShapiraDatabricks
ย 
Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...
Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...
Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...Spark Summit
ย 
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Databricks
ย 

What's hot (20)

Presto Strata Hadoop SJ 2016 short talk
Presto Strata Hadoop SJ 2016 short talkPresto Strata Hadoop SJ 2016 short talk
Presto Strata Hadoop SJ 2016 short talk
ย 
Presto
PrestoPresto
Presto
ย 
Presto at Facebook - Presto Meetup @ Boston (10/6/2015)
Presto at Facebook - Presto Meetup @ Boston (10/6/2015)Presto at Facebook - Presto Meetup @ Boston (10/6/2015)
Presto at Facebook - Presto Meetup @ Boston (10/6/2015)
ย 
Introduction to Presto at Treasure Data
Introduction to Presto at Treasure DataIntroduction to Presto at Treasure Data
Introduction to Presto at Treasure Data
ย 
Understanding Presto - Presto meetup @ Tokyo #1
Understanding Presto - Presto meetup @ Tokyo #1Understanding Presto - Presto meetup @ Tokyo #1
Understanding Presto - Presto meetup @ Tokyo #1
ย 
Presto at Twitter
Presto at TwitterPresto at Twitter
Presto at Twitter
ย 
Presto in my_use_case
Presto in my_use_casePresto in my_use_case
Presto in my_use_case
ย 
Presto meetup 2015-03-19 @Facebook
Presto meetup 2015-03-19 @FacebookPresto meetup 2015-03-19 @Facebook
Presto meetup 2015-03-19 @Facebook
ย 
Facebook Presto presentation
Facebook Presto presentationFacebook Presto presentation
Facebook Presto presentation
ย 
Prestogres, ODBC & JDBC connectivity for Presto
Prestogres, ODBC & JDBC connectivity for PrestoPrestogres, ODBC & JDBC connectivity for Presto
Prestogres, ODBC & JDBC connectivity for Presto
ย 
Presto+MySQLใงๅˆ†ๆ•ฃSQL
Presto+MySQLใงๅˆ†ๆ•ฃSQLPresto+MySQLใงๅˆ†ๆ•ฃSQL
Presto+MySQLใงๅˆ†ๆ•ฃSQL
ย 
Bullet: A Real Time Data Query Engine
Bullet: A Real Time Data Query EngineBullet: A Real Time Data Query Engine
Bullet: A Real Time Data Query Engine
ย 
Continuous Processing in Structured Streaming with Jose Torres
 Continuous Processing in Structured Streaming with Jose Torres Continuous Processing in Structured Streaming with Jose Torres
Continuous Processing in Structured Streaming with Jose Torres
ย 
03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving
03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving
03 2014 Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving
ย 
Presto as a Service - Tips for operation and monitoring
Presto as a Service - Tips for operation and monitoringPresto as a Service - Tips for operation and monitoring
Presto as a Service - Tips for operation and monitoring
ย 
How to ensure Presto scalability โ€จin multi use case
How to ensure Presto scalability โ€จin multi use case How to ensure Presto scalability โ€จin multi use case
How to ensure Presto scalability โ€จin multi use case
ย 
HBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbmsHBaseConEast2016: Splice machine open source rdbms
HBaseConEast2016: Splice machine open source rdbms
ย 
Stream All Thingsโ€”Patterns of Modern Data Integration with Gwen Shapira
Stream All Thingsโ€”Patterns of Modern Data Integration with Gwen ShapiraStream All Thingsโ€”Patterns of Modern Data Integration with Gwen Shapira
Stream All Thingsโ€”Patterns of Modern Data Integration with Gwen Shapira
ย 
Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...
Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...
Migrating from Redshift to Spark at Stitch Fix: Spark Summit East talk by Sky...
ย 
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
Building the Petcare Data Platform using Delta Lake and 'Kyte': Our Spark ETL...
ย 

Viewers also liked

What is the best Healthcare Data Warehouse Model for Your Organization?
What is the best Healthcare Data Warehouse Model for Your Organization?What is the best Healthcare Data Warehouse Model for Your Organization?
What is the best Healthcare Data Warehouse Model for Your Organization?Health Catalyst
ย 
A New GIS-driven Approach to Optimize Service Area Boundaries for ACOs
A New GIS-driven Approach to Optimize Service Area Boundaries for ACOsA New GIS-driven Approach to Optimize Service Area Boundaries for ACOs
A New GIS-driven Approach to Optimize Service Area Boundaries for ACOsHealth Catalyst
ย 
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...Health Catalyst
ย 
Is Value-Based Healthcare Here to Stay? Looking for Answers in New Policies
Is Value-Based Healthcare Here to Stay? Looking for Answers in New PoliciesIs Value-Based Healthcare Here to Stay? Looking for Answers in New Policies
Is Value-Based Healthcare Here to Stay? Looking for Answers in New PoliciesHealth Catalyst
ย 
INIA- CISA: Anรกlisis de las amenazas en la fauna silvestre
INIA- CISA: Anรกlisis de las amenazas en la fauna silvestreINIA- CISA: Anรกlisis de las amenazas en la fauna silvestre
INIA- CISA: Anรกlisis de las amenazas en la fauna silvestreEsri
ย 
Getting The Most Out of Your Data Analyst - HAS Session 9
Getting The Most Out of Your Data Analyst - HAS Session 9Getting The Most Out of Your Data Analyst - HAS Session 9
Getting The Most Out of Your Data Analyst - HAS Session 9Health Catalyst
ย 
Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...
Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...
Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...Health Catalyst
ย 
Five Strategies for Easing the Burden of Clinical Quality Measures
Five Strategies for Easing the Burden of Clinical Quality MeasuresFive Strategies for Easing the Burden of Clinical Quality Measures
Five Strategies for Easing the Burden of Clinical Quality MeasuresHealth Catalyst
ย 
7 Features of Highly Effective Outcomes Improvement Projects
7 Features of Highly Effective Outcomes Improvement Projects7 Features of Highly Effective Outcomes Improvement Projects
7 Features of Highly Effective Outcomes Improvement ProjectsHealth Catalyst
ย 
The Who, What, and How of Health Outcome Measures
The Who, What, and How of Health Outcome MeasuresThe Who, What, and How of Health Outcome Measures
The Who, What, and How of Health Outcome MeasuresHealth Catalyst
ย 
The Top Five Recommendations for Improving the Patient Experience
The Top Five Recommendations for Improving the Patient ExperienceThe Top Five Recommendations for Improving the Patient Experience
The Top Five Recommendations for Improving the Patient ExperienceHealth Catalyst
ย 
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...Health Catalyst
ย 
Healthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and TechnologyHealthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and TechnologyHealth Catalyst
ย 
Why You Need to Understand Value-Based Reimbursement and How to Survive It
Why You Need to Understand Value-Based Reimbursement and How to Survive ItWhy You Need to Understand Value-Based Reimbursement and How to Survive It
Why You Need to Understand Value-Based Reimbursement and How to Survive ItHealth Catalyst
ย 
The Top 7 Outcomes Measures and 3 Measurement Essentials
The Top 7 Outcomes Measures and 3 Measurement EssentialsThe Top 7 Outcomes Measures and 3 Measurement Essentials
The Top 7 Outcomes Measures and 3 Measurement EssentialsHealth Catalyst
ย 
Outcomes improvement: what you get when you mix good data with physician enga...
Outcomes improvement: what you get when you mix good data with physician enga...Outcomes improvement: what you get when you mix good data with physician enga...
Outcomes improvement: what you get when you mix good data with physician enga...Health Catalyst
ย 
Patient Flight Path Analytics: From Airline Operations to Healthcare Outcomes
Patient Flight Path Analytics: From Airline Operations to Healthcare OutcomesPatient Flight Path Analytics: From Airline Operations to Healthcare Outcomes
Patient Flight Path Analytics: From Airline Operations to Healthcare OutcomesHealth Catalyst
ย 
Why We Need to Shift Healthcare Quality Measures from Volume to Value
Why We Need to Shift Healthcare Quality Measures from Volume to ValueWhy We Need to Shift Healthcare Quality Measures from Volume to Value
Why We Need to Shift Healthcare Quality Measures from Volume to ValueHealth Catalyst
ย 
Demystifying Healthcare Data Governance
Demystifying Healthcare Data GovernanceDemystifying Healthcare Data Governance
Demystifying Healthcare Data GovernanceHealth Catalyst
ย 
Understanding Risk Stratification, Comorbidities, and the Future of Healthcare
Understanding Risk Stratification, Comorbidities, and the Future of HealthcareUnderstanding Risk Stratification, Comorbidities, and the Future of Healthcare
Understanding Risk Stratification, Comorbidities, and the Future of HealthcareHealth Catalyst
ย 

Viewers also liked (20)

What is the best Healthcare Data Warehouse Model for Your Organization?
What is the best Healthcare Data Warehouse Model for Your Organization?What is the best Healthcare Data Warehouse Model for Your Organization?
What is the best Healthcare Data Warehouse Model for Your Organization?
ย 
A New GIS-driven Approach to Optimize Service Area Boundaries for ACOs
A New GIS-driven Approach to Optimize Service Area Boundaries for ACOsA New GIS-driven Approach to Optimize Service Area Boundaries for ACOs
A New GIS-driven Approach to Optimize Service Area Boundaries for ACOs
ย 
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
ย 
Is Value-Based Healthcare Here to Stay? Looking for Answers in New Policies
Is Value-Based Healthcare Here to Stay? Looking for Answers in New PoliciesIs Value-Based Healthcare Here to Stay? Looking for Answers in New Policies
Is Value-Based Healthcare Here to Stay? Looking for Answers in New Policies
ย 
INIA- CISA: Anรกlisis de las amenazas en la fauna silvestre
INIA- CISA: Anรกlisis de las amenazas en la fauna silvestreINIA- CISA: Anรกlisis de las amenazas en la fauna silvestre
INIA- CISA: Anรกlisis de las amenazas en la fauna silvestre
ย 
Getting The Most Out of Your Data Analyst - HAS Session 9
Getting The Most Out of Your Data Analyst - HAS Session 9Getting The Most Out of Your Data Analyst - HAS Session 9
Getting The Most Out of Your Data Analyst - HAS Session 9
ย 
Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...
Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...
Improving Patient Safety and Quality Through Culture, Clinical Analytics, Evi...
ย 
Five Strategies for Easing the Burden of Clinical Quality Measures
Five Strategies for Easing the Burden of Clinical Quality MeasuresFive Strategies for Easing the Burden of Clinical Quality Measures
Five Strategies for Easing the Burden of Clinical Quality Measures
ย 
7 Features of Highly Effective Outcomes Improvement Projects
7 Features of Highly Effective Outcomes Improvement Projects7 Features of Highly Effective Outcomes Improvement Projects
7 Features of Highly Effective Outcomes Improvement Projects
ย 
The Who, What, and How of Health Outcome Measures
The Who, What, and How of Health Outcome MeasuresThe Who, What, and How of Health Outcome Measures
The Who, What, and How of Health Outcome Measures
ย 
The Top Five Recommendations for Improving the Patient Experience
The Top Five Recommendations for Improving the Patient ExperienceThe Top Five Recommendations for Improving the Patient Experience
The Top Five Recommendations for Improving the Patient Experience
ย 
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
ย 
Healthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and TechnologyHealthcare Interoperability: New Tactics and Technology
Healthcare Interoperability: New Tactics and Technology
ย 
Why You Need to Understand Value-Based Reimbursement and How to Survive It
Why You Need to Understand Value-Based Reimbursement and How to Survive ItWhy You Need to Understand Value-Based Reimbursement and How to Survive It
Why You Need to Understand Value-Based Reimbursement and How to Survive It
ย 
The Top 7 Outcomes Measures and 3 Measurement Essentials
The Top 7 Outcomes Measures and 3 Measurement EssentialsThe Top 7 Outcomes Measures and 3 Measurement Essentials
The Top 7 Outcomes Measures and 3 Measurement Essentials
ย 
Outcomes improvement: what you get when you mix good data with physician enga...
Outcomes improvement: what you get when you mix good data with physician enga...Outcomes improvement: what you get when you mix good data with physician enga...
Outcomes improvement: what you get when you mix good data with physician enga...
ย 
Patient Flight Path Analytics: From Airline Operations to Healthcare Outcomes
Patient Flight Path Analytics: From Airline Operations to Healthcare OutcomesPatient Flight Path Analytics: From Airline Operations to Healthcare Outcomes
Patient Flight Path Analytics: From Airline Operations to Healthcare Outcomes
ย 
Why We Need to Shift Healthcare Quality Measures from Volume to Value
Why We Need to Shift Healthcare Quality Measures from Volume to ValueWhy We Need to Shift Healthcare Quality Measures from Volume to Value
Why We Need to Shift Healthcare Quality Measures from Volume to Value
ย 
Demystifying Healthcare Data Governance
Demystifying Healthcare Data GovernanceDemystifying Healthcare Data Governance
Demystifying Healthcare Data Governance
ย 
Understanding Risk Stratification, Comorbidities, and the Future of Healthcare
Understanding Risk Stratification, Comorbidities, and the Future of HealthcareUnderstanding Risk Stratification, Comorbidities, and the Future of Healthcare
Understanding Risk Stratification, Comorbidities, and the Future of Healthcare
ย 

Similar to Boston Hadoop Meetup: Presto for the Enterprise

Open Source SQL for Hadoop: Where are we and Where are we Going?
Open Source SQL for Hadoop: Where are we and Where are we Going?Open Source SQL for Hadoop: Where are we and Where are we Going?
Open Source SQL for Hadoop: Where are we and Where are we Going?DataWorks Summit
ย 
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...viirya
ย 
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...ssuserd3a367
ย 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeNicolas Morales
ย 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceInfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceWilfried Hoge
ย 
Twitter with hadoop for oow
Twitter with hadoop for oowTwitter with hadoop for oow
Twitter with hadoop for oowGwen (Chen) Shapira
ย 
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...Data Con LA
ย 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Cask Data
ย 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksDataWorks Summit
ย 
ECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClass
ECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClassECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClass
ECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClassEuropean Collaboration Summit
ย 
Interoperability Ms Sap
Interoperability Ms SapInteroperability Ms Sap
Interoperability Ms Sapricharoy
ย 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Dataconomy Media
ย 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Mats Uddenfeldt
ย 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksMapR Technologies
ย 
LeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesLeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesMichael Stephenson
ย 
Powering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimSpark Summit
ย 
Big Data Introduction - Solix empower
Big Data Introduction - Solix empowerBig Data Introduction - Solix empower
Big Data Introduction - Solix empowerDurga Gadiraju
ย 
E2E Data Pipeline - Apache Spark/Airflow/Livy
E2E Data Pipeline - Apache Spark/Airflow/LivyE2E Data Pipeline - Apache Spark/Airflow/Livy
E2E Data Pipeline - Apache Spark/Airflow/LivyRikin Tanna
ย 
London Oracle Developer Meetup April 18
London Oracle Developer Meetup April 18London Oracle Developer Meetup April 18
London Oracle Developer Meetup April 18Phil Wilkins
ย 

Similar to Boston Hadoop Meetup: Presto for the Enterprise (20)

Open Source SQL for Hadoop: Where are we and Where are we Going?
Open Source SQL for Hadoop: Where are we and Where are we Going?Open Source SQL for Hadoop: Where are we and Where are we Going?
Open Source SQL for Hadoop: Where are we and Where are we Going?
ย 
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
ย 
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
ย 
Big SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor LandscapeBig SQL Competitive Summary - Vendor Landscape
Big SQL Competitive Summary - Vendor Landscape
ย 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experienceInfoSphere BigInsights - Analytics power for Hadoop - field experience
InfoSphere BigInsights - Analytics power for Hadoop - field experience
ย 
Twitter with hadoop for oow
Twitter with hadoop for oowTwitter with hadoop for oow
Twitter with hadoop for oow
ย 
Apache drill
Apache drillApache drill
Apache drill
ย 
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
Stream your Operational Data with Apache Spark & Kafka into Hadoop using Couc...
ย 
Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?Webinar: What's new in CDAP 3.5?
Webinar: What's new in CDAP 3.5?
ย 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
ย 
ECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClass
ECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClassECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClass
ECS19 - Vesa Juvonen - SharePoint and Office 365 Development PowerClass
ย 
Interoperability Ms Sap
Interoperability Ms SapInteroperability Ms Sap
Interoperability Ms Sap
ย 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
ย 
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
ย 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
ย 
LeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration ServicesLeedsSharp May 2023 - Azure Integration Services
LeedsSharp May 2023 - Azure Integration Services
ย 
Powering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin Kim
ย 
Big Data Introduction - Solix empower
Big Data Introduction - Solix empowerBig Data Introduction - Solix empower
Big Data Introduction - Solix empower
ย 
E2E Data Pipeline - Apache Spark/Airflow/Livy
E2E Data Pipeline - Apache Spark/Airflow/LivyE2E Data Pipeline - Apache Spark/Airflow/Livy
E2E Data Pipeline - Apache Spark/Airflow/Livy
ย 
London Oracle Developer Meetup April 18
London Oracle Developer Meetup April 18London Oracle Developer Meetup April 18
London Oracle Developer Meetup April 18
ย 

Recently uploaded

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
ย 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
ย 
CHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR
ย 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfWilly Marroquin (WillyDevNET)
ย 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
ย 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
ย 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
ย 
Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...Steffen Staab
ย 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
ย 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...OnePlan Solutions
ย 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
ย 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
ย 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
ย 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
ย 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
ย 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
ย 
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online โ˜‚๏ธ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online  โ˜‚๏ธCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online  โ˜‚๏ธ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online โ˜‚๏ธanilsa9823
ย 
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female service
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female serviceCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female service
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female serviceanilsa9823
ย 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
ย 

Recently uploaded (20)

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
ย 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
ย 
CHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )๐Ÿ” 9953056974๐Ÿ”(=)/CALL GIRLS SERVICE
ย 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
ย 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
ย 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
ย 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
ย 
Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spacesย - and Epistemic Querying of RDF-...
ย 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
ย 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlanโ€™s ...
ย 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
ย 
Vip Call Girls Noida โžก๏ธ Delhi โžก๏ธ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida โžก๏ธ Delhi โžก๏ธ 9999965857 No Advance 24HRS LiveVip Call Girls Noida โžก๏ธ Delhi โžก๏ธ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida โžก๏ธ Delhi โžก๏ธ 9999965857 No Advance 24HRS Live
ย 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
ย 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
ย 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
ย 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
ย 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
ย 
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online โ˜‚๏ธ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online  โ˜‚๏ธCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online  โ˜‚๏ธ
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Kakori Lucknow best sexual service Online โ˜‚๏ธ
ย 
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female service
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female serviceCALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female service
CALL ON โžฅ8923113531 ๐Ÿ”Call Girls Badshah Nagar Lucknow best Female service
ย 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
ย 

Boston Hadoop Meetup: Presto for the Enterprise

  • 1. 1 Boston Hadoop User Group Meetup, July 7, 2015 Kamil Bajda-Pawlikowski Matt Fuller
  • 2. 2 โ€ขโ€ฏ History of Teradata Center for Hadoop โ€“โ€ฏ Formerly Hadapt Founded in July, 2010 by Borgman, Bajda-Pawlikowski, and Abadi โ€“โ€ฏ Pioneered SQL-on-Hadoop market โ€“โ€ฏ Based on work done by database research group in Yale Computer Science Department โ€“โ€ฏ Hybrid of Hadoop scalability and DBMS performance โ€ขโ€ฏ Today โ€“โ€ฏ Acquired by Teradata in July, 2014, renamed Teradata Center for Hadoop โ€“โ€ฏ 30 developers with deep Hadoop and database expertise โ€“โ€ฏ Headquarters in Boston, MA โ€“โ€ฏ Contributors to open source project Presto Who are we? - Teradata Center for Hadoop!
  • 3. 3 โ€ขโ€ฏ What is Presto? โ€ขโ€ฏ What is Teradata doing? โ€ขโ€ฏ Can I see a Demo? โ€ขโ€ฏ How can I contribute? Talk Agenda
  • 4. 4 โ€ขโ€ฏ 100% open source distributed ANSI SQL engine for Big Data โ€“โ€ฏ Modern code base โ€“โ€ฏ Proven scalability โ€“โ€ฏ Optimized for low latency, Interactive querying โ€ขโ€ฏ Cross platform query capability, not only SQL on Hadoop โ€ขโ€ฏ Distributed under the Apache license, now supported by Teradata โ€ขโ€ฏ Used by a community of well known, well respected technology companies What is Presto?
  • 5. 5 History of Presto FALL 2012 4 developers start Presto development FALL 2014 88 Releases 41 Contributors 3943 Commits SPRING 2015 98 Releases 65 Contributors 4587 Commits --------- Teradata joins Presto community & offers support SPRING 2013 Presto rolled out within Facebook FALL 2013 Facebook open sources Presto FALL 2008 Facebook open sources Hive Timeline image courtesy of Facebook
  • 6. 6 Presto Architecture Data stream API Worker Data stream API Worker Coordinator Metadata API Parser/ analyzer Planner Scheduler Worker Client Data location API Pluggable https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920
  • 7. 7 Presto Extensibility โ€“ connectors Parser/ analyzer Planner Worker Data location API Hive Cassandra Kafka MySQL โ€ฆ Metadata API Hive Cassandra Kafka MySQL โ€ฆ Data stream API Hive Cassandra Kafka MySQL โ€ฆ Scheduler Coordinator https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920
  • 8. 8 โ€ขโ€ฏ Data stays in memory during execution and is pipelined across nodes MPP-style โ€ขโ€ฏ Vectorized columnar processing โ€ขโ€ฏ Presto is written in highly tuned Java โ€“โ€ฏ Efficient in-memory data structures โ€“โ€ฏ Very careful coding of inner loops โ€“โ€ฏ Bytecode generation โ€ขโ€ฏ Optimized ORC reader Presto = Performance
  • 9. 9 โ€ขโ€ฏ Facebook โ€“โ€ฏ Multiple production clusters (100s of nodes total) -โ€ฏ Including 300PB Hadoop data warehouse โ€“โ€ฏ 1000s of internal daily active users โ€“โ€ฏ Millions of queries each month โ€“โ€ฏ Multiple PBs scanned every day โ€“โ€ฏ Trillions of rows a day โ€ขโ€ฏ Netflix โ€“โ€ฏ Over 200-node production cluster on EC2 โ€“โ€ฏ Over 15 PB in S3 (Parquet format) โ€“โ€ฏ Over 300 users and 2.5K queries daily Presto in Production
  • 10. 10 โ€ขโ€ฏ 100% open source contributions to Presto to increase adoption in the enterprise โ€ขโ€ฏ A multi-year roadmap commitment to phased enhancements of the open source code โ€ขโ€ฏ The first ever commercial support offering for Presto What is Teradata Doing? Teradata Certified Presto www.teradata.com/presto
  • 11. 11 โ€ขโ€ฏ Hadoop Distro Agnostic โ€ขโ€ฏ Modern Code Base โ€“โ€ฏ Presto is well-designed open source software with proper database architecture โ€ขโ€ฏ Strong Like-Minded Community โ€ขโ€ฏ Push down processing across multiple data platforms โ€ขโ€ฏ Leverage Teradata expertise to make SQL for Hadoop viable Why is Teradata Contributing to Presto?
  • 13. 13 Implement Integrate Proliferate โ€ขโ€ฏ Installer โ€ขโ€ฏ Documentation โ€ขโ€ฏ Monitoring & Support Tools โ€ขโ€ฏ Management Tool Integration โ€ขโ€ฏ YARN Integration โ€ขโ€ฏ ODBC / JDBC Drivers โ€ขโ€ฏ BI Certification โ€ขโ€ฏ Security โ€ขโ€ฏ Connectors Commercial Support Phase 1 Phase 2 Phase 3 June 8, 2015 Q4 2015 2016 Expanding ANSI SQL Coverage Teradata Contributions to Presto
  • 14. 14 โ€ขโ€ฏ Ease of install and management via Presto-Admin tool โ€“โ€ฏ www.github.com/prestodb/presto-admin โ€“โ€ฏ Packaging Presto as an RPM โ€ขโ€ฏ Testing Framework for Presto โ€“โ€ฏ www.github.com/prestodb/tempto โ€“โ€ฏ Added large number of tests โ€ขโ€ฏ Improvements to JDBC driver โ€“โ€ฏ To be open sourced on www.github.com/prestodb soon! โ€ขโ€ฏ Various SQL improvements Teradataโ€™s Contributions
  • 15. 15 โ€ขโ€ฏ YARN Integration โ€ขโ€ฏ Ambari Integration โ€ขโ€ฏ ODBC & JDBC Drivers that actually work โ€ขโ€ฏ Security โ€“ Authentication & Authorization โ€ขโ€ฏ Continued SQL Improvements โ€ขโ€ฏ BI tool certifications โ€“ e.g. Tableau โ€ขโ€ฏ More Connectors โ€“ e.g. Hbase โ€ขโ€ฏ Open Source our Docker based Dev Env โ€ขโ€ฏ Open our Continuous Integration platform to the community Teradataโ€™s Contribution Product Roadmap
  • 16. 16 www.github.com/facebook/presto www.github.com/prestodb Certified Distro: www.teradata.com/presto Website: www.prestodb.io Presto Userโ€™s Group: www.groups.google.com/group/presto-users Facebook Page: www.facebook.com/prestodb Twitter: #prestodb How can I contribute?
  • 17. 17 Available for Download โ€“โ€ฏ Presto 101t Server, CLI, JDBC โ€“โ€ฏ Presto-Admin 0.1 โ€“โ€ฏ Documentation โ€“โ€ฏ HDP w/ Presto VM Sandbox โ€“โ€ฏ CDH w/ Presto VM Sandbox www.teradata.com/presto Presto 101t certified by Teradata
  • 18. 18