Submit Search
Upload
A Continuously Deployed Hadoop Analytics Platform?
•
1 like
•
1,308 views
DataWorks Summit/Hadoop Summit
Follow
A Continuously Deployed Hadoop Analytics Platform?
Read less
Read more
Technology
Report
Share
Report
Share
1 of 21
Download now
Download to read offline
Recommended
Big Data in Azure
Big Data in Azure
DataWorks Summit/Hadoop Summit
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
John Archer
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Mark Rittman
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
NoSQLmatters
Big data on Azure for Architects
Big data on Azure for Architects
Tomasz Kopacz
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
Informatica
Big Data 2.0: ETL & Analytics: Implementing a next generation platform
Big Data 2.0: ETL & Analytics: Implementing a next generation platform
Caserta
Big Data with Azure
Big Data with Azure
Aaron (Ari) Bornstein
Recommended
Big Data in Azure
Big Data in Azure
DataWorks Summit/Hadoop Summit
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
John Archer
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Mark Rittman
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
NoSQLmatters
Big data on Azure for Architects
Big data on Azure for Architects
Tomasz Kopacz
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
Informatica
Big Data 2.0: ETL & Analytics: Implementing a next generation platform
Big Data 2.0: ETL & Analytics: Implementing a next generation platform
Caserta
Big Data with Azure
Big Data with Azure
Aaron (Ari) Bornstein
Big Data on Azure Tutorial
Big Data on Azure Tutorial
rustd
Data Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
DataWorks Summit
2014.07.11 biginsights data2014
2014.07.11 biginsights data2014
Wilfried Hoge
Hadoop Journey at Walgreens
Hadoop Journey at Walgreens
DataWorks Summit
NYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services Overview
Travis Wright
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015
DataWorks Summit
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
DataWorks Summit
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
DataWorks Summit
Hadoop Trends
Hadoop Trends
Hortonworks
Scaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Amr Awadallah
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
Ambari Meetup: 2nd April 2013: Teradata Viewpoint Hadoop Integration with Ambari
Ambari Meetup: 2nd April 2013: Teradata Viewpoint Hadoop Integration with Ambari
Hortonworks
CWIN17 India / Insights platform architecture v1 0 virtual - subhadeep dutta
CWIN17 India / Insights platform architecture v1 0 virtual - subhadeep dutta
Capgemini
Introduction to Azure Databricks
Introduction to Azure Databricks
James Serra
Govern This! Data Discovery and the application of data governance with new s...
Govern This! Data Discovery and the application of data governance with new s...
Cloudera, Inc.
Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration
Hortonworks
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
Architecting a multi-tenanted platform
Architecting a multi-tenanted platform
DataWorks Summit/Hadoop Summit
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...
Evans Ye
More Related Content
What's hot
Big Data on Azure Tutorial
Big Data on Azure Tutorial
rustd
Data Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
DataWorks Summit
2014.07.11 biginsights data2014
2014.07.11 biginsights data2014
Wilfried Hoge
Hadoop Journey at Walgreens
Hadoop Journey at Walgreens
DataWorks Summit
NYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services Overview
Travis Wright
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015
DataWorks Summit
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
DataWorks Summit
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
DataWorks Summit
Hadoop Trends
Hadoop Trends
Hortonworks
Scaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Amr Awadallah
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
Ambari Meetup: 2nd April 2013: Teradata Viewpoint Hadoop Integration with Ambari
Ambari Meetup: 2nd April 2013: Teradata Viewpoint Hadoop Integration with Ambari
Hortonworks
CWIN17 India / Insights platform architecture v1 0 virtual - subhadeep dutta
CWIN17 India / Insights platform architecture v1 0 virtual - subhadeep dutta
Capgemini
Introduction to Azure Databricks
Introduction to Azure Databricks
James Serra
Govern This! Data Discovery and the application of data governance with new s...
Govern This! Data Discovery and the application of data governance with new s...
Cloudera, Inc.
Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration
Hortonworks
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
What's hot
(20)
Big Data on Azure Tutorial
Big Data on Azure Tutorial
Data Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
2014.07.11 biginsights data2014
2014.07.11 biginsights data2014
Hadoop Journey at Walgreens
Hadoop Journey at Walgreens
NYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services Overview
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
Extending Data Lake using the Lambda Architecture June 2015
Extending Data Lake using the Lambda Architecture June 2015
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
Hadoop Trends
Hadoop Trends
Scaling Data Science on Big Data
Scaling Data Science on Big Data
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
Ambari Meetup: 2nd April 2013: Teradata Viewpoint Hadoop Integration with Ambari
Ambari Meetup: 2nd April 2013: Teradata Viewpoint Hadoop Integration with Ambari
CWIN17 India / Insights platform architecture v1 0 virtual - subhadeep dutta
CWIN17 India / Insights platform architecture v1 0 virtual - subhadeep dutta
Introduction to Azure Databricks
Introduction to Azure Databricks
Govern This! Data Discovery and the application of data governance with new s...
Govern This! Data Discovery and the application of data governance with new s...
Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Viewers also liked
Architecting a multi-tenanted platform
Architecting a multi-tenanted platform
DataWorks Summit/Hadoop Summit
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...
Evans Ye
H2O on Hadoop Dec 12
H2O on Hadoop Dec 12
Sri Ambati
Big Data Analytics-Open Source Toolkits
Big Data Analytics-Open Source Toolkits
DataWorks Summit
Sri Ambati – CEO, 0xdata at MLconf ATL
Sri Ambati – CEO, 0xdata at MLconf ATL
MLconf
H2O Big Data Environments
H2O Big Data Environments
Sri Ambati
Protecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache Hadoop
DataWorks Summit/Hadoop Summit
The Future of Apache Storm
The Future of Apache Storm
DataWorks Summit/Hadoop Summit
Data Process Systems, connecting everything
Data Process Systems, connecting everything
DataWorks Summit/Hadoop Summit
The key to unlocking the Value in the IoT? Managing the Data!
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
Log I am your father
Log I am your father
DataWorks Summit/Hadoop Summit
Cooperative Data Exploration with iPython Notebook
Cooperative Data Exploration with iPython Notebook
DataWorks Summit/Hadoop Summit
Powering a Virtual Power Station with Big Data
Powering a Virtual Power Station with Big Data
DataWorks Summit/Hadoop Summit
The Heterogeneous Data lake
The Heterogeneous Data lake
DataWorks Summit/Hadoop Summit
Hadoop Everywhere
Hadoop Everywhere
DataWorks Summit/Hadoop Summit
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
DataWorks Summit/Hadoop Summit
Practical advice to build a data driven company
Practical advice to build a data driven company
DataWorks Summit/Hadoop Summit
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
DataWorks Summit/Hadoop Summit
Hadoop Platform at Yahoo
Hadoop Platform at Yahoo
DataWorks Summit/Hadoop Summit
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
DataWorks Summit/Hadoop Summit
Viewers also liked
(20)
Architecting a multi-tenanted platform
Architecting a multi-tenanted platform
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...
H2O on Hadoop Dec 12
H2O on Hadoop Dec 12
Big Data Analytics-Open Source Toolkits
Big Data Analytics-Open Source Toolkits
Sri Ambati – CEO, 0xdata at MLconf ATL
Sri Ambati – CEO, 0xdata at MLconf ATL
H2O Big Data Environments
H2O Big Data Environments
Protecting Enterprise Data in Apache Hadoop
Protecting Enterprise Data in Apache Hadoop
The Future of Apache Storm
The Future of Apache Storm
Data Process Systems, connecting everything
Data Process Systems, connecting everything
The key to unlocking the Value in the IoT? Managing the Data!
The key to unlocking the Value in the IoT? Managing the Data!
Log I am your father
Log I am your father
Cooperative Data Exploration with iPython Notebook
Cooperative Data Exploration with iPython Notebook
Powering a Virtual Power Station with Big Data
Powering a Virtual Power Station with Big Data
The Heterogeneous Data lake
The Heterogeneous Data lake
Hadoop Everywhere
Hadoop Everywhere
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
Practical advice to build a data driven company
Practical advice to build a data driven company
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
Hadoop Platform at Yahoo
Hadoop Platform at Yahoo
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Similar to A Continuously Deployed Hadoop Analytics Platform?
Lisa_DiFazio_SQA_Resume
Lisa_DiFazio_SQA_Resume
Lisa DiFazio
DevOps at TestausOSY 20june2017
DevOps at TestausOSY 20june2017
Jouni Jätyri
Continuous Deployment at Etsy — TimesOpen NYC
Continuous Deployment at Etsy — TimesOpen NYC
Mike Brittain
Zagat.com Case Study (DrupalCon Denver 2012)
Zagat.com Case Study (DrupalCon Denver 2012)
Phase2
Tech foundations-slides
Tech foundations-slides
tranquynh93
Continuous Build To Continuous Release - Experience
Continuous Build To Continuous Release - Experience
Raja Soundaramourty
Fllow con 2014
Fllow con 2014
gbgruver
Let Data Flow: Removing the Latest DevOps Constraints with DataOps
Let Data Flow: Removing the Latest DevOps Constraints with DataOps
Delphix
Venkata Sateesh_BigData_Latest-Resume
Venkata Sateesh_BigData_Latest-Resume
venkata sateeshs
Continuous Testing
Continuous Testing
Karim Fanadka
Continuous Testing 2016
Continuous Testing 2016
Karim Fanadka
Explainable Artificial Intelligence (XAI) to Predict and Explain Future Soft...
Explainable Artificial Intelligence (XAI) to Predict and Explain Future Soft...
Chakkrit (Kla) Tantithamthavorn
Karim Fanadka
Karim Fanadka
CodeFest
Harry Childs Resume Sept 2016
Harry Childs Resume Sept 2016
Harry Childs
CV_SyedShoeb_2015
CV_SyedShoeb_2015
Syed Shoeb
Automation in seo. Tools and tricks
Automation in seo. Tools and tricks
NetpeakBG
Automation in seo. Tools and tricks
Automation in seo. Tools and tricks
Netpeak
Machine learning powered regression - KraQA 42 - Pawel Dyrek
Machine learning powered regression - KraQA 42 - Pawel Dyrek
kraqa
Grafana overview deck - Tech - 2023 May v1.pdf
Grafana overview deck - Tech - 2023 May v1.pdf
BillySin5
Untangling Continuous Delivery
Untangling Continuous Delivery
Perforce
Similar to A Continuously Deployed Hadoop Analytics Platform?
(20)
Lisa_DiFazio_SQA_Resume
Lisa_DiFazio_SQA_Resume
DevOps at TestausOSY 20june2017
DevOps at TestausOSY 20june2017
Continuous Deployment at Etsy — TimesOpen NYC
Continuous Deployment at Etsy — TimesOpen NYC
Zagat.com Case Study (DrupalCon Denver 2012)
Zagat.com Case Study (DrupalCon Denver 2012)
Tech foundations-slides
Tech foundations-slides
Continuous Build To Continuous Release - Experience
Continuous Build To Continuous Release - Experience
Fllow con 2014
Fllow con 2014
Let Data Flow: Removing the Latest DevOps Constraints with DataOps
Let Data Flow: Removing the Latest DevOps Constraints with DataOps
Venkata Sateesh_BigData_Latest-Resume
Venkata Sateesh_BigData_Latest-Resume
Continuous Testing
Continuous Testing
Continuous Testing 2016
Continuous Testing 2016
Explainable Artificial Intelligence (XAI) to Predict and Explain Future Soft...
Explainable Artificial Intelligence (XAI) to Predict and Explain Future Soft...
Karim Fanadka
Karim Fanadka
Harry Childs Resume Sept 2016
Harry Childs Resume Sept 2016
CV_SyedShoeb_2015
CV_SyedShoeb_2015
Automation in seo. Tools and tricks
Automation in seo. Tools and tricks
Automation in seo. Tools and tricks
Automation in seo. Tools and tricks
Machine learning powered regression - KraQA 42 - Pawel Dyrek
Machine learning powered regression - KraQA 42 - Pawel Dyrek
Grafana overview deck - Tech - 2023 May v1.pdf
Grafana overview deck - Tech - 2023 May v1.pdf
Untangling Continuous Delivery
Untangling Continuous Delivery
More from DataWorks Summit/Hadoop Summit
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
Hadoop Crash Course
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
Data Science Crash Course
Data Science Crash Course
DataWorks Summit/Hadoop Summit
Apache Spark Crash Course
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
Dataflow with Apache NiFi
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
Schema Registry - Set you Data Free
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
HBase in Practice
HBase in Practice
DataWorks Summit/Hadoop Summit
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
More from DataWorks Summit/Hadoop Summit
(20)
Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
Hadoop Crash Course
Hadoop Crash Course
Data Science Crash Course
Data Science Crash Course
Apache Spark Crash Course
Apache Spark Crash Course
Dataflow with Apache NiFi
Dataflow with Apache NiFi
Schema Registry - Set you Data Free
Schema Registry - Set you Data Free
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
HBase in Practice
HBase in Practice
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
Recently uploaded
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
Florian Wilhelm
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Safe Software
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
Lorenzo Miniero
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
Manik S Magar
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
NavinnSomaal
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
RankYa
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
carlostorres15106
Training state-of-the-art general text embedding
Training state-of-the-art general text embedding
Zilliz
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
Hervé Boutemy
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
comworks
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
Mattias Andersson
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Ridwan Fadjar
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Mark Simos
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
hariprasad279825
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
Padma Pradeep
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Sergiu Bodiu
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Fwdays
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Alfredo García Lavilla
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
gvaughan
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
BookNet Canada
Recently uploaded
(20)
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Training state-of-the-art general text embedding
Training state-of-the-art general text embedding
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
A Continuously Deployed Hadoop Analytics Platform?
1.
A Con&nuously Deployed Hadoop Analy&cs pla2orm? Graham Gear, Director, Systems Engineering, APJ
2.
3.
Logical Pilot Delivery Pipeline Opera&ons Monitor Provision Automate Produc&on Data Scien&sts Hourly 100% Bugs Produc&on Data Science Pre-Produc&on Development
4.
Produc&on Worksta&on Logical Nascent Delivery Pipeline Data Engineers Monthly 0% Bugs Development User Acceptance Test Backup Data Produc&on Workload Data Science DS, Analysts, Apps Monthly - Yearly 100% Bugs Opera&ons Monitor Provision Automate Development Produc&on Governance Audit Security Lineage Pre-Produc&on
5.
SLA Worksta&on Logical Staged Delivery Pipeline Dev Ops Weekly 10% Bugs System Smoke Test Data Engineers Weekly 0% Bugs Development DS, Analysts, Apps Monthly - Yearly 90% Bugs Opera&ons Monitor Provision Automate Development Pre-Produc&on Produc&on Governance Audit Security Lineage Produc&on User Acceptance Test Backup Data Produc&on Workload Data Science
6.
Non-SLA SLA Worksta&on Logical Manual Delivery Pipeline Dev Ops Weekly 10% Bugs System Smoke Test Data Engineers Weekly 0% Bugs Development User Acceptance Test Backup Data Disaster Recovery Data Science Data Scien&sts Weekly - Monthly 60% Bugs Opera&ons Monitor Provision Automate Development Pre-Produc&on
Produc&on Governance Audit Security Lineage SLA Analysts, Apps Monthly - Yearly 30% Bugs Produc&on Workload
7.
Logical Con&nuous Delivery Pipeline Test Ar&fact Repo Build Acceptance Test Release Ar&fact Unit Suite Test Bake Ar&fact Deploy Pipeline Dev Ops Hourly – Daily 15% Bugs System Smoke Test Worksta&on Source Repo Release Tag Data Engineers Hourly 70% Bugs Light Unit Test Development Non-SLA User Acceptance Test Backup Data Disaster Recovery Data Science Data Scien&sts Weekly - Monthly 15% Bugs Acceptance Test Opera&ons Monitor Provision Automate Development Pre-Produc&on Produc&on Governance Audit Security Lineage SLA Analysts, Apps Weekly - Monthly 0% Bugs Produc&on Workload
8.
Source Repo Git Gerrit Physical Con&nuous Delivery Pipeline Serial-tenant < 10 nodes Physical, Cloud Single-tenant 1 Laptop, Desktop Physical Development Pre-Produc&on Produc&on Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator
9.
10.
Source Repo Git Gerrit Data Engineer Development Pipeline Serial-tenant < 10 nodes Physical, Cloud Pre-Produc&on Produc&on Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator Single-tenant 1 Laptop, Desktop Physical Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Development 1. Create a Maven module from a Maven Archetype, providing a baseline project encoding all corporate standards and and targe&ng a specific produc&on version 2.
Develop a dataset ingest and prepara&on pipeline using Flume, Kaca, Hive and MapReduce using Eclipse and Maven 3. Build a suite of unit tests and synthe&c data to exercise the codebase, iden&fying and resolving bugs
11.
Source Repo Git Gerrit Data Engineer Source Pipeline Serial-tenant < 10 nodes Physical, Cloud Pre-Produc&on Produc&on Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator Single-tenant 1 Laptop, Desktop Physical Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Development 1. Via Maven, Gerrit and Git interac&ons, show developer ini&ated project source code stages: •
Stage • Review • Commit • Release
12.
Source Repo Git Gerrit Automated Bake Pipeline Serial-tenant < 10 nodes Physical, Cloud Single-tenant 1 Laptop, Desktop Physical Produc&on Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator 1. Simulate an automa&cally triggered Jenkins unit test suite, bake and smoke test pipeline against a Director provisioned Test cluster served by Ar&fcatory and Parcel repositories Pre-Produc&on Development
13.
Automated Deploy & Test Pipeline Serial-tenant < 10 nodes Physical, Cloud Single-tenant 1 Laptop, Desktop Physical Development Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator Source Repo Git Gerrit 1. Show deploy, smoke and user acceptance test stages, crea&ng the opera&onal Manager dashboards and Navigator meta-data Pre-Produc&on Produc&on
14.
Data Scien&st & Analyst Dev Pipeline Serial-tenant < 10 nodes Physical, Cloud Single-tenant 1 Laptop, Desktop Physical Development Pre-Produc&on Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Test CDH Cluster Synthe&c Data Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator Source Repo Git Gerrit Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory 1. Query dataset using Impala via Hue, capture SQL logs and feed them back into Op&mizer and project, show dependency verifica&on under schema evolu&on 2.
Analyse dataset using Python and Ibis via the DS Workbench applica&on feeding back into project, show dependency checking during dataset evolu&on Produc&on
15.
Source Repo Git Gerrit Applica&on Delivery Pipeline Serial-tenant < 10 nodes Physical, Cloud Single-tenant 1 Laptop, Desktop Physical Development Pre-Produc&on Produc&on Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator 1.
Applica&on rev pipeline, show comparison to previous version
16.
Pla2orm Delivery Pipeline Serial-tenant < 10 nodes Physical, Cloud Development Pre-Produc&on Produc&on Test CDH Cluster Synthe&c Data Build Cloudera Director Jenkins Ar&fact Repo Parcel Repository Ar&fcatory Mul&-tenant > 10 nodes Physical, Cloud Non-SLA Produc&on Data CDH Cluster DS Workbench Mul&-tenant > 10 nodes Physical, Cloud SLA Produc&on Data CDH Cluster Tableau JDBC Opera&ons Cloudera BDR Cloudera Manager Governance Cloudera Op&mizer Cloudera Navigator Single-tenant 1 Laptop, Desktop Physical Worksta&on Synthe&c Data CDH Single Node Eclipse, Maven Linux, OS-X Source Repo Git Gerrit 1.
Pla2orm rev pipeline, show comparison to previous version
17.
Logical Con&nuous Delivery Pipeline Test Ar&fact Repo Build Acceptance Test Release Ar&fact Unit Suite Test Bake Ar&fact Deploy Pipeline Dev Ops Hourly – Daily 15% Bugs System Smoke Test Worksta&on Source Repo Release Tag Data Engineers Hourly 70% Bugs Light Unit Test Development Non-SLA User Acceptance Test Backup Data Disaster Recovery Data Science Data Scien&sts Weekly - Monthly 15% Bugs Acceptance Test Opera&ons Monitor Provision Automate Development Pre-Produc&on Produc&on Governance Audit Security Lineage SLA Analysts, Apps Weekly - Monthly 0% Bugs Produc&on Workload
18.
19.
20.
21.
Ques&ons? • Cloudera Framework Example •
https://github.com/ggear/cloudera-framework • Cloudera Parcel Maven Plugin • https://github.com/ggear/cloudera-parcel • Cloudera Manager API • https://cloudera.github.io/cm_api/apidocs/v12/index.html • Cloudera Navigator API • http://cloudera.github.io/navigator/apidocs/v3 • Cloudera Director • https://director.cloudera.com • Cloudera Optimizer • https://optimizer.cloudera.com graham@cloudera.com
Download now