SlideShare a Scribd company logo
1 of 80
Progress for big data in Kubernetes
TED DUNNING
Chief Application Architect
2 © 2018 MapR Technologies, Inc. // MapR Confidential
Ted Dunning, PhD
Chief Application Architect, MapR Technologies
Board member, Apache Software Foundation
O’Reilly author
Email tdunning@mapr.com tdunning@apache.org
Twitter @ted_dunning
Contact Information
3 © 2018 MapR Technologies, Inc. // MapR Confidential
kubernetes is coming!
4 © 2018 MapR Technologies, Inc. // MapR Confidential
why?
5 © 2018 MapR Technologies, Inc. // MapR Confidential
Source: Shippable.com http://blog.shippable.com/why-the-adoption-of-kubernetes-will-explode-in-2018
kubernetes = major community support
6 © 2018 MapR Technologies, Inc. // MapR Confidential
every cloud supports kubernetes
https://www.sinax.be/en/aws/
https://www.westconcomstor.com/za/en/vendors/wc-vendors/microsoft-azure-EN-UK.html
https://www.g2crowd.com/products/google-kubernetes-engine-gke/details
7 © 2018 MapR Technologies, Inc. // MapR Confidential
massive customer adoption rate
8 © 2018 MapR Technologies, Inc. // MapR Confidential
9 © 2018 MapR Technologies, Inc. // MapR Confidential
what is kubernetes?
10 © 2018 MapR Technologies, Inc. // MapR Confidential
kubernetes (n.) - greek word for pilot or
helm
11 © 2018 MapR Technologies, Inc. // MapR Confidential
kubernetes started life as a
successor to google’s borg
project...
https://cloud.google.com/security/encryption-in-transit/
12 © 2018 MapR Technologies, Inc. // MapR Confidential
kubernetes is an ecosystem...
Source: Redmonk - http://redmonk.com/sogrady/2017/09/22/cloud-native-license-choices/
13 © 2018 MapR Technologies, Inc. // MapR Confidential
container and resource orchestration engine...
14 © 2018 MapR Technologies, Inc. // MapR Confidential
Source: Shippable.com http://blog.shippable.com/why-the-adoption-of-kubernetes-will-explode-in-2018
kubernetes won the container orchestration war...
15 © 2018 MapR Technologies, Inc. // MapR Confidential
what is kubernetes?
16 © 2018 MapR Technologies, Inc. // MapR Confidential
it runs containers
17 © 2018 MapR Technologies, Inc. // MapR Confidential
what is a container?
18 © 2018 MapR Technologies, Inc. // MapR Confidential
not a vm
19 © 2018 MapR Technologies, Inc. // MapR Confidential
hardware
vm vs container
os
hypervisor
vm
os
libs
app
vm
os
libs
app
hardware
os
container
libs
app
container
libs
app
container
libs
app
20 © 2018 MapR Technologies, Inc. // MapR Confidential
pets vs cattle
https://fwallpapers.com/view/cat-jeans
http://www.clipartpanda.com/clipart_images/free-clip-art-1083418
21 © 2018 MapR Technologies, Inc. // MapR Confidential
pets vs cattle
- long lived
- name them
- care for them
- ephemeral
- brand them with #’s
- well..vets are expensive
22 © 2018 MapR Technologies, Inc. // MapR Confidential
27 © 2018 MapR Technologies, Inc. // MapR Confidential
File File File Container Image
chroot
container = image + isolation
cgroups
● cpu
● memory
● network
● etc.
namespaces
● pids
● mnts
● etc.
28 © 2018 MapR Technologies, Inc. // MapR Confidential
containers are good
29 © 2018 MapR Technologies, Inc. // MapR Confidential
containers are good
30 © 2018 MapR Technologies, Inc. // MapR Confidential
containers have a problem
31 © 2018 MapR Technologies, Inc. // MapR Confidential
you can never get away from
pets unless:
- you handle the problem of
container state
- you need an environment to
support cattle
MapR and kubernetes are the
solution
32 © 2018 MapR Technologies, Inc. // MapR Confidential
solve port mapping hell
monitor running containers
handle dead containers
move containers so utilization improves
autoscale container instances to handle load
Things docker can’t (or won’t) do...
33 © 2018 MapR Technologies, Inc. // MapR Confidential
Magical View of Kubernetes
34 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1
Kubernetes
Magical View of Kubernetes
Kubernetes starts application
containers “somewhere”
35 © 2018 MapR Technologies, Inc. // MapR Confidential
Magical View of Kubernetes
App 1 App 3
Kubernetes
Later containers may be started
elsewhere due to “affinities”
36 © 2018 MapR Technologies, Inc. // MapR Confidential
Magical View of Kubernetes
App 1 App 2 App 3
Kubernetes
Kubernetes provides super fast
naming via DNS so containers
can find each other
37 © 2018 MapR Technologies, Inc. // MapR Confidential
Note that you don’t think about
which machine at all
38 © 2018 MapR Technologies, Inc. // MapR Confidential
You don’t think about which
machine at all
No more names from The Hobbit
Just cattle
39 © 2018 MapR Technologies, Inc. // MapR Confidential
Software engineering can be viewed as freezing bits
Initially, everything is possible, nothing is actual
We freeze the source
Then the binary
Then the package
Then the environment
Ultimately the system
The Impact of Kubernetes
40 © 2018 MapR Technologies, Inc. // MapR Confidential
41 © 2018 MapR Technologies, Inc. // MapR Confidential
42 © 2018 MapR Technologies, Inc. // MapR Confidential
43 © 2018 MapR Technologies, Inc. // MapR Confidential
git cc/ld
java/jar
Build
docker build
resources
config libraries
Package
helm package
Construct
44 © 2018 MapR Technologies, Inc. // MapR Confidential
git cc/ld
java/jar
Build
docker build
resources
config libraries
Package
helm package
Construct
helm install/scale
Load balancer
Deploy
45 © 2018 MapR Technologies, Inc. // MapR Confidential
This is glorious
46 © 2018 MapR Technologies, Inc. // MapR Confidential
but we still have a problem
47 © 2018 MapR Technologies, Inc. // MapR Confidential
state
48 © 2018 MapR Technologies, Inc. // MapR Confidential
Not Done Yet
Load balancer
49 © 2018 MapR Technologies, Inc. // MapR Confidential
Not Done Yet
Load balancer
Here’s the problem
50 © 2018 MapR Technologies, Inc. // MapR Confidential
Not Really Ready at All
State in containers messes things up
Restarts lose the state
Replicating state makes services
complex
Application developers just aren’t
systems developers
State life-cycle doesn’t match app life-
cycle
Load balancer
Here’s the problem
51 © 2018 MapR Technologies, Inc. // MapR Confidential
What is a Service Anyway?
Load balancer
RPC in
52 © 2018 MapR Technologies, Inc. // MapR Confidential
Synchronous RPC-based services only serve one need
In a synchronous service it’s common to do some, defer some
But deferring work is hard in a synchronous world … we have to give up the return call
in some sense
This is the germ of streaming architecture
But … Not Entirely
53 © 2018 MapR Technologies, Inc. // MapR Confidential
Deferred
What is a Service Anyway?
Load balancer
RPC in
54 © 2018 MapR Technologies, Inc. // MapR Confidential
If I can hide details of who and where, I have a service
If I can hide details of deployment, I have a micro-service
If I can hide details of when, I have a streaming micro-service
Isolation is The Defining Characteristic
55 © 2018 MapR Technologies, Inc. // MapR Confidential
Temporal and Geo Isolation
Producer Consumer isn’t even running
56 © 2018 MapR Technologies, Inc. // MapR Confidential
Temporal and Geo Isolation
Producer Consumer
57 © 2018 MapR Technologies, Inc. // MapR Confidential
Producer
Consumer
Temporal and Geo Isolation
Consumer could be an ocean away
58 © 2018 MapR Technologies, Inc. // MapR Confidential
Files are important
• Config files, image files, archival data data
• Legacy applications like machine learning, web
Tables are important
• Critical to have random update for some applications
• Should scale transparently without dedicated cluster
Streams are important
• Should be co-equal form of persistence
We Need Multiple Forms of Persistence
59 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
60 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
stream
LogFile
61 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
stream
LogFile
62 © 2018 MapR Technologies, Inc. // MapR Confidential
App 1 App 2 App 3
63 © 2018 MapR Technologies, Inc. // MapR Confidential
Global namespace across entire Kubernetes cluster
• Between clusters as well if possible
All three forms of primitive persistence
• Files, streams, tables
Inherently scalable
• Performance, cardinality, locality
Uniform access and control
• Path names for all objects, identical permission scheme
What Does This Data Platform Need to Have?
64 © 2018 MapR Technologies, Inc. // MapR Confidential
Global namespace across entire Kubernetes cluster
• Between clusters as well if possible
All three forms of primitive persistence
• Files, streams, tables
Inherently scalable
• Performance, cardinality, locality
Uniform access and control
• Path names for all objects, identical permission scheme
Oh…. got that already. Just need to wire it up to Kubernetes
What Does This Data Platform Need to Have?
65 © 2018 MapR Technologies, Inc. // MapR Confidential
66 © 2018 MapR Technologies, Inc. // MapR Confidential
67 © 2018 MapR Technologies, Inc. // MapR Confidential
kubelet
docker
pod
Normally pods interact
directly with node resources
68 © 2018 MapR Technologies, Inc. // MapR Confidential
kubelet
docker
pod
plugin We can install a volume
plugin (recently introduced)
69 © 2018 MapR Technologies, Inc. // MapR Confidential
mapr-
fs
kubelet
docker
pod
plugin
fuse
This allows uniform access
to files, tables and streams
70 © 2018 MapR Technologies, Inc. // MapR Confidential
Where does that take us?
71 © 2018 MapR Technologies, Inc. // MapR Confidential
Installation of plugin is K8S level operation
• No per-node attention required
Use of plugin is overlay operation
• No change needed for an container
• Any Helm chart can use the plugin for conventional file access
Can share storage/compute or isolate or scale independently
Consequences
72 © 2018 MapR Technologies, Inc. // MapR Confidential
State is no longer a dirty word for Kubernetes
HPC can run on K8S
Boring things can run on K8S without storage appliances
Previously crazy ideas can now be valuable
Complexity is largely not visible
More Consequences
73 © 2018 MapR Technologies, Inc. // MapR Confidential
Container orchestration is awesome
74 © 2018 MapR Technologies, Inc. // MapR Confidential
Container orchestration is awesome
Data orchestration is, too
75 © 2018 MapR Technologies, Inc. // MapR Confidential
Cloud as-is: No unified data access or security concepts
Public Cloud
Application
✓
API
API Connector
Single cloud vendor strategy:
• Vendor lock in
• No failover in case of global outage
• Limited Edge capabilities
AWS Services:
• Kinesis & Elastic MapReduce
• Redshift & DynamoDB
• S3 & Glacier
76 © 2018 MapR Technologies, Inc. // MapR Confidential
Cloud as-is: No unified data access or security concepts
Public Cloud
Application
✓
API
API Connector
AWS Services:
• Kinesis & Elastic MapReduce
• Redshift & DynamoDB
• S3 & Glacier
API
Azure Services:
• HD Insight
• SQL Server & CosmosDB
• Blob & DataLakeStore
Multi cloud strategy:
• Complex data movement between clouds
• On any other cloud:
• Different API‘s: application breaks
• Different Security concept
77 © 2018 MapR Technologies, Inc. // MapR Confidential
Cloud as-is: No unified data access or security concepts
Edge Private Cloud
On Premise
Public Cloud Public Cloud Public Cloud
API
Application
✓
API API API API
API Connector
Multi cloud strategy:
• Complex data movement between clouds
• On any other cloud:
• Different API‘s, application breaks
• Different Security concept
78 © 2018 MapR Technologies, Inc. // MapR Confidential
Open APIs
How a “Media Company” is Unifying Compute Environments
Application
• Unified Security Model
• Data access decoupled from physical
storage location. Globally.
• No lock-in to proprietary APIs
• Full openness
• Data made portable
API Connector
✓
GLOBAL DATA MANAGEMENT
Edge Private Cloud
On Premise
Public Cloud Public Cloud Public Cloud
Uniform computing
environment everywhere
79 © 2018 MapR Technologies, Inc. // MapR Confidential
Open APIs
How “Manufacturing Company” is Orchestrating Data
Application
• Unified Security Model
• Data access decoupled from physical
storage location. Globally.
• No lock-in to proprietary APIs
• Full openness
• Data made portable
API Connector
✓
GLOBAL DATA MANAGEMENT
Edge Private Cloud
On Premise
Public Cloud Public Cloud Public Cloud
Platform level
data replication
80 © 2018 MapR Technologies, Inc. // MapR Confidential
Tier 1 Bank #1 Creating a Global Filesystem
/mapr/edge1
/mapr/edge2
/mapr/edge3
/mapr/newyork
/mapr/amsterdam
/mapr/azure /mapr/gcp
/mapr/aws-eu-west
NFS POSIX HDFSREST
HOT
WARM
COLD
/mapr
Kafka JSON HBASE SQL S3
Application
✓
Global access
to local data
81 © 2018 MapR Technologies, Inc. // MapR Confidential
Tier 1 Bank #2: Creating a Universal Application Platform
Application
GLOBAL DATA MANAGEMENT
Edge Private Cloud
On Premise
Public Cloud Public Cloud Public Cloud
PodPod Pod Image Classification using
Tensorflow in a Docker container
Classic ETL
Scheduling & Scaling
MapR Kubernetes Volume Driver
Single pane of glass to
control jobs anywhere
82 © 2018 MapR Technologies, Inc. // MapR Confidential
Additional Resources
O’Reilly report by Ted Dunning & Ellen Friedman © September 2017
Read free courtesy of MapR:
https://mapr.com/ebook/machine-learning-logistics/
O’Reilly book by Ted Dunning & Ellen Friedman
© March 2016
Read free courtesy of MapR:
https://mapr.com/streaming-architecture-using-
apache-kafka-mapr-streams/
Ted Dunning & Ellen Friedman
Model Management in theReal World
MachineLearning
Logistics
83 © 2018 MapR Technologies, Inc. // MapR Confidential
Additional Resources
O’Reilly book by Ted Dunning & Ellen Friedman
© June 2014
Read free courtesy of MapR:
https://mapr.com/practical-machine-learning-new-look-
anomaly-detection/
O’Reilly book by Ellen Friedman & Ted Dunning
© February 2014
Read free courtesy of MapR:
https://mapr.com/practical-machine-learning/
84 © 2018 MapR Technologies, Inc. // MapR Confidential
AI & Analytics in Production:
How to Make It Work
Free book signing with authors Ted Dunning & Ellen Friedman
MapR stand #145:
Tues 1:00 pm – 1:45 pm
Wed 1:00 pm – 1:45 pm
Download free pdf via MapR:
https://mapr.com/ebook/ai-and-analytics-in-production/

More Related Content

What's hot

Resilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete AtamelResilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete AtamelITCamp
 
In-Memory Computing Essentials for Software Engineers
In-Memory Computing Essentials for Software EngineersIn-Memory Computing Essentials for Software Engineers
In-Memory Computing Essentials for Software EngineersDenis Magda
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Chris Fregly
 
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPLA Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPLMario-Leander Reimer
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersMesosphere Inc.
 
In-Memory Computing Essentials
In-Memory Computing EssentialsIn-Memory Computing Essentials
In-Memory Computing EssentialsDenis Magda
 
GKE Tip Series how do i choose between gke standard, autopilot and cloud run
GKE Tip Series   how do i choose between gke standard, autopilot and cloud run GKE Tip Series   how do i choose between gke standard, autopilot and cloud run
GKE Tip Series how do i choose between gke standard, autopilot and cloud run Sreenivas Makam
 
Save 60% of Kubernetes storage costs on AWS & others with OpenEBS
Save 60% of Kubernetes storage costs on AWS & others with OpenEBSSave 60% of Kubernetes storage costs on AWS & others with OpenEBS
Save 60% of Kubernetes storage costs on AWS & others with OpenEBSMayaData Inc
 
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARNHadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARNDataWorks Summit
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Technologies
 
MongoDB, Cloudformation and Chef
MongoDB, Cloudformation and ChefMongoDB, Cloudformation and Chef
MongoDB, Cloudformation and ChefMongoDB
 
Webinar: Using Litmus Chaos Engineering and AI for auto incident detection
Webinar: Using Litmus Chaos Engineering and AI for auto incident detectionWebinar: Using Litmus Chaos Engineering and AI for auto incident detection
Webinar: Using Litmus Chaos Engineering and AI for auto incident detectionMayaData Inc
 
Google Cloud - Stand Out Features
Google Cloud - Stand Out FeaturesGoogle Cloud - Stand Out Features
Google Cloud - Stand Out FeaturesGDG Cloud Bengaluru
 
Continuous Machine and Deep Learning with Apache Ignite
Continuous Machine and Deep Learning with Apache IgniteContinuous Machine and Deep Learning with Apache Ignite
Continuous Machine and Deep Learning with Apache IgniteDenis Magda
 
Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...
Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...
Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...Ontico
 
High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...
High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...
High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...Mathias Renner
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...
Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...
Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...Sunil Govindan
 
A real time architecture using Hadoop and Storm @ FOSDEM 2013
A real time architecture using Hadoop and Storm @ FOSDEM 2013A real time architecture using Hadoop and Storm @ FOSDEM 2013
A real time architecture using Hadoop and Storm @ FOSDEM 2013Nathan Bijnens
 
Google Cloud Networking Deep Dive
Google Cloud Networking Deep DiveGoogle Cloud Networking Deep Dive
Google Cloud Networking Deep DiveMichelle Holley
 

What's hot (20)

Resilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete AtamelResilient microservices with Kubernetes - Mete Atamel
Resilient microservices with Kubernetes - Mete Atamel
 
In-Memory Computing Essentials for Software Engineers
In-Memory Computing Essentials for Software EngineersIn-Memory Computing Essentials for Software Engineers
In-Memory Computing Essentials for Software Engineers
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
 
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPLA Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
A Hitchhiker’s Guide to the Cloud Native Stack. #DevoxxPL
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containers
 
In-Memory Computing Essentials
In-Memory Computing EssentialsIn-Memory Computing Essentials
In-Memory Computing Essentials
 
GKE Tip Series how do i choose between gke standard, autopilot and cloud run
GKE Tip Series   how do i choose between gke standard, autopilot and cloud run GKE Tip Series   how do i choose between gke standard, autopilot and cloud run
GKE Tip Series how do i choose between gke standard, autopilot and cloud run
 
Save 60% of Kubernetes storage costs on AWS & others with OpenEBS
Save 60% of Kubernetes storage costs on AWS & others with OpenEBSSave 60% of Kubernetes storage costs on AWS & others with OpenEBS
Save 60% of Kubernetes storage costs on AWS & others with OpenEBS
 
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARNHadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARN
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
MongoDB, Cloudformation and Chef
MongoDB, Cloudformation and ChefMongoDB, Cloudformation and Chef
MongoDB, Cloudformation and Chef
 
Webinar: Using Litmus Chaos Engineering and AI for auto incident detection
Webinar: Using Litmus Chaos Engineering and AI for auto incident detectionWebinar: Using Litmus Chaos Engineering and AI for auto incident detection
Webinar: Using Litmus Chaos Engineering and AI for auto incident detection
 
Google Cloud - Stand Out Features
Google Cloud - Stand Out FeaturesGoogle Cloud - Stand Out Features
Google Cloud - Stand Out Features
 
Continuous Machine and Deep Learning with Apache Ignite
Continuous Machine and Deep Learning with Apache IgniteContinuous Machine and Deep Learning with Apache Ignite
Continuous Machine and Deep Learning with Apache Ignite
 
Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...
Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...
Evaluation of High Availability Performance of Kubernetes and Docker Swarm on...
 
High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...
High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...
High Availability Performance of Kubernetes and Docker Swarm on a Raspberry P...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...
Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...
Cloudera DataTalks 2019 Bangalore - YuniKorn A next generation scheduler for ...
 
A real time architecture using Hadoop and Storm @ FOSDEM 2013
A real time architecture using Hadoop and Storm @ FOSDEM 2013A real time architecture using Hadoop and Storm @ FOSDEM 2013
A real time architecture using Hadoop and Storm @ FOSDEM 2013
 
Google Cloud Networking Deep Dive
Google Cloud Networking Deep DiveGoogle Cloud Networking Deep Dive
Google Cloud Networking Deep Dive
 

Similar to Big Data Progress in Kubernetes with MapR

Containers and Kubernetes without limits
Containers and Kubernetes without limitsContainers and Kubernetes without limits
Containers and Kubernetes without limitsAntje Barth
 
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Matt Stubbs
 
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Ellen Friedman
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Matt Stubbs
 
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 20187 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018Ellen Friedman
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
AWS Earth and Space 2018 - Element 84 Processing and Streaming GOES-16 Data...
AWS Earth and Space 2018 -   Element 84 Processing and Streaming GOES-16 Data...AWS Earth and Space 2018 -   Element 84 Processing and Streaming GOES-16 Data...
AWS Earth and Space 2018 - Element 84 Processing and Streaming GOES-16 Data...Dan Pilone
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Carol McDonald
 
Operating Kubernetes at Scale (Australia Presentation)
Operating Kubernetes at Scale (Australia Presentation)Operating Kubernetes at Scale (Australia Presentation)
Operating Kubernetes at Scale (Australia Presentation)Mesosphere Inc.
 
Webinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at ScaleWebinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at ScaleMesosphere Inc.
 
Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)Wei-Chiu Chuang
 
Boston hug-2012-07
Boston hug-2012-07Boston hug-2012-07
Boston hug-2012-07Ted Dunning
 
Overpowered Kubernetes: CI/CD for K8s on Enterprise IaaS
Overpowered Kubernetes: CI/CD for K8s on Enterprise IaaSOverpowered Kubernetes: CI/CD for K8s on Enterprise IaaS
Overpowered Kubernetes: CI/CD for K8s on Enterprise IaaSJ On The Beach
 
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...InfluxData
 
A hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stackA hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stackQAware GmbH
 
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17Mario-Leander Reimer
 
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...Mathieu Dumoulin
 
Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures Carol McDonald
 
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...James Anderson
 

Similar to Big Data Progress in Kubernetes with MapR (20)

Containers and Kubernetes without limits
Containers and Kubernetes without limitsContainers and Kubernetes without limits
Containers and Kubernetes without limits
 
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
Big Data LDN 2018: 7 SUCCESSFUL HABITS FOR DATA-INTENSIVE APPLICATIONS IN PRO...
 
Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018Surprising Advantages of Streaming - ACM March 2018
Surprising Advantages of Streaming - ACM March 2018
 
The rise of microservices
The rise of microservicesThe rise of microservices
The rise of microservices
 
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
Big Data LDN 2018: DATA OPERATIONS PROBLEMS CREATED BY DEEP LEARNING, AND HOW...
 
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 20187 Habits for Big Data in Production - keynote Big Data London Nov 2018
7 Habits for Big Data in Production - keynote Big Data London Nov 2018
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
AWS Earth and Space 2018 - Element 84 Processing and Streaming GOES-16 Data...
AWS Earth and Space 2018 -   Element 84 Processing and Streaming GOES-16 Data...AWS Earth and Space 2018 -   Element 84 Processing and Streaming GOES-16 Data...
AWS Earth and Space 2018 - Element 84 Processing and Streaming GOES-16 Data...
 
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
 
Operating Kubernetes at Scale (Australia Presentation)
Operating Kubernetes at Scale (Australia Presentation)Operating Kubernetes at Scale (Australia Presentation)
Operating Kubernetes at Scale (Australia Presentation)
 
Webinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at ScaleWebinar: Operating Kubernetes at Scale
Webinar: Operating Kubernetes at Scale
 
Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)Hadoop 3 (2017 hadoop taiwan workshop)
Hadoop 3 (2017 hadoop taiwan workshop)
 
Boston hug-2012-07
Boston hug-2012-07Boston hug-2012-07
Boston hug-2012-07
 
Overpowered Kubernetes: CI/CD for K8s on Enterprise IaaS
Overpowered Kubernetes: CI/CD for K8s on Enterprise IaaSOverpowered Kubernetes: CI/CD for K8s on Enterprise IaaS
Overpowered Kubernetes: CI/CD for K8s on Enterprise IaaS
 
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
Scaling Prometheus Metrics in Kubernetes with Telegraf | Chris Goller | Influ...
 
A hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stackA hitchhiker‘s guide to the cloud native stack
A hitchhiker‘s guide to the cloud native stack
 
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
 
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
Converged and Containerized Distributed Deep Learning With TensorFlow and Kub...
 
Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures
 
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
 

More from Ted Dunning

Dunning - SIGMOD - Data Economy.pptx
Dunning - SIGMOD - Data Economy.pptxDunning - SIGMOD - Data Economy.pptx
Dunning - SIGMOD - Data Economy.pptxTed Dunning
 
Anomaly Detection: How to find what you didn’t know to look for
Anomaly Detection: How to find what you didn’t know to look forAnomaly Detection: How to find what you didn’t know to look for
Anomaly Detection: How to find what you didn’t know to look forTed Dunning
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningTed Dunning
 
Machine Learning Logistics
Machine Learning LogisticsMachine Learning Logistics
Machine Learning LogisticsTed Dunning
 
Tensor Abuse - how to reuse machine learning frameworks
Tensor Abuse - how to reuse machine learning frameworksTensor Abuse - how to reuse machine learning frameworks
Tensor Abuse - how to reuse machine learning frameworksTed Dunning
 
Machine Learning logistics
Machine Learning logisticsMachine Learning logistics
Machine Learning logisticsTed Dunning
 
Finding Changes in Real Data
Finding Changes in Real DataFinding Changes in Real Data
Finding Changes in Real DataTed Dunning
 
Where is Data Going? - RMDC Keynote
Where is Data Going? - RMDC KeynoteWhere is Data Going? - RMDC Keynote
Where is Data Going? - RMDC KeynoteTed Dunning
 
Real time-hadoop
Real time-hadoopReal time-hadoop
Real time-hadoopTed Dunning
 
Cheap learning-dunning-9-18-2015
Cheap learning-dunning-9-18-2015Cheap learning-dunning-9-18-2015
Cheap learning-dunning-9-18-2015Ted Dunning
 
Sharing Sensitive Data Securely
Sharing Sensitive Data SecurelySharing Sensitive Data Securely
Sharing Sensitive Data SecurelyTed Dunning
 
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-timeReal-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-timeTed Dunning
 
How the Internet of Things is Turning the Internet Upside Down
How the Internet of Things is Turning the Internet Upside DownHow the Internet of Things is Turning the Internet Upside Down
How the Internet of Things is Turning the Internet Upside DownTed Dunning
 
Apache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopApache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopTed Dunning
 
Dunning time-series-2015
Dunning time-series-2015Dunning time-series-2015
Dunning time-series-2015Ted Dunning
 
Doing-the-impossible
Doing-the-impossibleDoing-the-impossible
Doing-the-impossibleTed Dunning
 
Anomaly Detection - New York Machine Learning
Anomaly Detection - New York Machine LearningAnomaly Detection - New York Machine Learning
Anomaly Detection - New York Machine LearningTed Dunning
 
Cognitive computing with big data, high tech and low tech approaches
Cognitive computing with big data, high tech and low tech approachesCognitive computing with big data, high tech and low tech approaches
Cognitive computing with big data, high tech and low tech approachesTed Dunning
 
Recommendation Techn
Recommendation TechnRecommendation Techn
Recommendation TechnTed Dunning
 

More from Ted Dunning (20)

Dunning - SIGMOD - Data Economy.pptx
Dunning - SIGMOD - Data Economy.pptxDunning - SIGMOD - Data Economy.pptx
Dunning - SIGMOD - Data Economy.pptx
 
Anomaly Detection: How to find what you didn’t know to look for
Anomaly Detection: How to find what you didn’t know to look forAnomaly Detection: How to find what you didn’t know to look for
Anomaly Detection: How to find what you didn’t know to look for
 
Streaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine LearningStreaming Architecture including Rendezvous for Machine Learning
Streaming Architecture including Rendezvous for Machine Learning
 
Machine Learning Logistics
Machine Learning LogisticsMachine Learning Logistics
Machine Learning Logistics
 
Tensor Abuse - how to reuse machine learning frameworks
Tensor Abuse - how to reuse machine learning frameworksTensor Abuse - how to reuse machine learning frameworks
Tensor Abuse - how to reuse machine learning frameworks
 
Machine Learning logistics
Machine Learning logisticsMachine Learning logistics
Machine Learning logistics
 
T digest-update
T digest-updateT digest-update
T digest-update
 
Finding Changes in Real Data
Finding Changes in Real DataFinding Changes in Real Data
Finding Changes in Real Data
 
Where is Data Going? - RMDC Keynote
Where is Data Going? - RMDC KeynoteWhere is Data Going? - RMDC Keynote
Where is Data Going? - RMDC Keynote
 
Real time-hadoop
Real time-hadoopReal time-hadoop
Real time-hadoop
 
Cheap learning-dunning-9-18-2015
Cheap learning-dunning-9-18-2015Cheap learning-dunning-9-18-2015
Cheap learning-dunning-9-18-2015
 
Sharing Sensitive Data Securely
Sharing Sensitive Data SecurelySharing Sensitive Data Securely
Sharing Sensitive Data Securely
 
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-timeReal-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
Real-time Puppies and Ponies - Evolving Indicator Recommendations in Real-time
 
How the Internet of Things is Turning the Internet Upside Down
How the Internet of Things is Turning the Internet Upside DownHow the Internet of Things is Turning the Internet Upside Down
How the Internet of Things is Turning the Internet Upside Down
 
Apache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on HadoopApache Kylin - OLAP Cubes for SQL on Hadoop
Apache Kylin - OLAP Cubes for SQL on Hadoop
 
Dunning time-series-2015
Dunning time-series-2015Dunning time-series-2015
Dunning time-series-2015
 
Doing-the-impossible
Doing-the-impossibleDoing-the-impossible
Doing-the-impossible
 
Anomaly Detection - New York Machine Learning
Anomaly Detection - New York Machine LearningAnomaly Detection - New York Machine Learning
Anomaly Detection - New York Machine Learning
 
Cognitive computing with big data, high tech and low tech approaches
Cognitive computing with big data, high tech and low tech approachesCognitive computing with big data, high tech and low tech approaches
Cognitive computing with big data, high tech and low tech approaches
 
Recommendation Techn
Recommendation TechnRecommendation Techn
Recommendation Techn
 

Recently uploaded

Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?Watsoo Telematics
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 

Recently uploaded (20)

Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 

Big Data Progress in Kubernetes with MapR

  • 1. Progress for big data in Kubernetes TED DUNNING Chief Application Architect
  • 2. 2 © 2018 MapR Technologies, Inc. // MapR Confidential Ted Dunning, PhD Chief Application Architect, MapR Technologies Board member, Apache Software Foundation O’Reilly author Email tdunning@mapr.com tdunning@apache.org Twitter @ted_dunning Contact Information
  • 3. 3 © 2018 MapR Technologies, Inc. // MapR Confidential kubernetes is coming!
  • 4. 4 © 2018 MapR Technologies, Inc. // MapR Confidential why?
  • 5. 5 © 2018 MapR Technologies, Inc. // MapR Confidential Source: Shippable.com http://blog.shippable.com/why-the-adoption-of-kubernetes-will-explode-in-2018 kubernetes = major community support
  • 6. 6 © 2018 MapR Technologies, Inc. // MapR Confidential every cloud supports kubernetes https://www.sinax.be/en/aws/ https://www.westconcomstor.com/za/en/vendors/wc-vendors/microsoft-azure-EN-UK.html https://www.g2crowd.com/products/google-kubernetes-engine-gke/details
  • 7. 7 © 2018 MapR Technologies, Inc. // MapR Confidential massive customer adoption rate
  • 8. 8 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 9. 9 © 2018 MapR Technologies, Inc. // MapR Confidential what is kubernetes?
  • 10. 10 © 2018 MapR Technologies, Inc. // MapR Confidential kubernetes (n.) - greek word for pilot or helm
  • 11. 11 © 2018 MapR Technologies, Inc. // MapR Confidential kubernetes started life as a successor to google’s borg project... https://cloud.google.com/security/encryption-in-transit/
  • 12. 12 © 2018 MapR Technologies, Inc. // MapR Confidential kubernetes is an ecosystem... Source: Redmonk - http://redmonk.com/sogrady/2017/09/22/cloud-native-license-choices/
  • 13. 13 © 2018 MapR Technologies, Inc. // MapR Confidential container and resource orchestration engine...
  • 14. 14 © 2018 MapR Technologies, Inc. // MapR Confidential Source: Shippable.com http://blog.shippable.com/why-the-adoption-of-kubernetes-will-explode-in-2018 kubernetes won the container orchestration war...
  • 15. 15 © 2018 MapR Technologies, Inc. // MapR Confidential what is kubernetes?
  • 16. 16 © 2018 MapR Technologies, Inc. // MapR Confidential it runs containers
  • 17. 17 © 2018 MapR Technologies, Inc. // MapR Confidential what is a container?
  • 18. 18 © 2018 MapR Technologies, Inc. // MapR Confidential not a vm
  • 19. 19 © 2018 MapR Technologies, Inc. // MapR Confidential hardware vm vs container os hypervisor vm os libs app vm os libs app hardware os container libs app container libs app container libs app
  • 20. 20 © 2018 MapR Technologies, Inc. // MapR Confidential pets vs cattle https://fwallpapers.com/view/cat-jeans http://www.clipartpanda.com/clipart_images/free-clip-art-1083418
  • 21. 21 © 2018 MapR Technologies, Inc. // MapR Confidential pets vs cattle - long lived - name them - care for them - ephemeral - brand them with #’s - well..vets are expensive
  • 22. 22 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 23. 27 © 2018 MapR Technologies, Inc. // MapR Confidential File File File Container Image chroot container = image + isolation cgroups ● cpu ● memory ● network ● etc. namespaces ● pids ● mnts ● etc.
  • 24. 28 © 2018 MapR Technologies, Inc. // MapR Confidential containers are good
  • 25. 29 © 2018 MapR Technologies, Inc. // MapR Confidential containers are good
  • 26. 30 © 2018 MapR Technologies, Inc. // MapR Confidential containers have a problem
  • 27. 31 © 2018 MapR Technologies, Inc. // MapR Confidential you can never get away from pets unless: - you handle the problem of container state - you need an environment to support cattle MapR and kubernetes are the solution
  • 28. 32 © 2018 MapR Technologies, Inc. // MapR Confidential solve port mapping hell monitor running containers handle dead containers move containers so utilization improves autoscale container instances to handle load Things docker can’t (or won’t) do...
  • 29. 33 © 2018 MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes
  • 30. 34 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 Kubernetes Magical View of Kubernetes Kubernetes starts application containers “somewhere”
  • 31. 35 © 2018 MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes App 1 App 3 Kubernetes Later containers may be started elsewhere due to “affinities”
  • 32. 36 © 2018 MapR Technologies, Inc. // MapR Confidential Magical View of Kubernetes App 1 App 2 App 3 Kubernetes Kubernetes provides super fast naming via DNS so containers can find each other
  • 33. 37 © 2018 MapR Technologies, Inc. // MapR Confidential Note that you don’t think about which machine at all
  • 34. 38 © 2018 MapR Technologies, Inc. // MapR Confidential You don’t think about which machine at all No more names from The Hobbit Just cattle
  • 35. 39 © 2018 MapR Technologies, Inc. // MapR Confidential Software engineering can be viewed as freezing bits Initially, everything is possible, nothing is actual We freeze the source Then the binary Then the package Then the environment Ultimately the system The Impact of Kubernetes
  • 36. 40 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 37. 41 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 38. 42 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 39. 43 © 2018 MapR Technologies, Inc. // MapR Confidential git cc/ld java/jar Build docker build resources config libraries Package helm package Construct
  • 40. 44 © 2018 MapR Technologies, Inc. // MapR Confidential git cc/ld java/jar Build docker build resources config libraries Package helm package Construct helm install/scale Load balancer Deploy
  • 41. 45 © 2018 MapR Technologies, Inc. // MapR Confidential This is glorious
  • 42. 46 © 2018 MapR Technologies, Inc. // MapR Confidential but we still have a problem
  • 43. 47 © 2018 MapR Technologies, Inc. // MapR Confidential state
  • 44. 48 © 2018 MapR Technologies, Inc. // MapR Confidential Not Done Yet Load balancer
  • 45. 49 © 2018 MapR Technologies, Inc. // MapR Confidential Not Done Yet Load balancer Here’s the problem
  • 46. 50 © 2018 MapR Technologies, Inc. // MapR Confidential Not Really Ready at All State in containers messes things up Restarts lose the state Replicating state makes services complex Application developers just aren’t systems developers State life-cycle doesn’t match app life- cycle Load balancer Here’s the problem
  • 47. 51 © 2018 MapR Technologies, Inc. // MapR Confidential What is a Service Anyway? Load balancer RPC in
  • 48. 52 © 2018 MapR Technologies, Inc. // MapR Confidential Synchronous RPC-based services only serve one need In a synchronous service it’s common to do some, defer some But deferring work is hard in a synchronous world … we have to give up the return call in some sense This is the germ of streaming architecture But … Not Entirely
  • 49. 53 © 2018 MapR Technologies, Inc. // MapR Confidential Deferred What is a Service Anyway? Load balancer RPC in
  • 50. 54 © 2018 MapR Technologies, Inc. // MapR Confidential If I can hide details of who and where, I have a service If I can hide details of deployment, I have a micro-service If I can hide details of when, I have a streaming micro-service Isolation is The Defining Characteristic
  • 51. 55 © 2018 MapR Technologies, Inc. // MapR Confidential Temporal and Geo Isolation Producer Consumer isn’t even running
  • 52. 56 © 2018 MapR Technologies, Inc. // MapR Confidential Temporal and Geo Isolation Producer Consumer
  • 53. 57 © 2018 MapR Technologies, Inc. // MapR Confidential Producer Consumer Temporal and Geo Isolation Consumer could be an ocean away
  • 54. 58 © 2018 MapR Technologies, Inc. // MapR Confidential Files are important • Config files, image files, archival data data • Legacy applications like machine learning, web Tables are important • Critical to have random update for some applications • Should scale transparently without dedicated cluster Streams are important • Should be co-equal form of persistence We Need Multiple Forms of Persistence
  • 55. 59 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3
  • 56. 60 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 stream LogFile
  • 57. 61 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3 stream LogFile
  • 58. 62 © 2018 MapR Technologies, Inc. // MapR Confidential App 1 App 2 App 3
  • 59. 63 © 2018 MapR Technologies, Inc. // MapR Confidential Global namespace across entire Kubernetes cluster • Between clusters as well if possible All three forms of primitive persistence • Files, streams, tables Inherently scalable • Performance, cardinality, locality Uniform access and control • Path names for all objects, identical permission scheme What Does This Data Platform Need to Have?
  • 60. 64 © 2018 MapR Technologies, Inc. // MapR Confidential Global namespace across entire Kubernetes cluster • Between clusters as well if possible All three forms of primitive persistence • Files, streams, tables Inherently scalable • Performance, cardinality, locality Uniform access and control • Path names for all objects, identical permission scheme Oh…. got that already. Just need to wire it up to Kubernetes What Does This Data Platform Need to Have?
  • 61. 65 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 62. 66 © 2018 MapR Technologies, Inc. // MapR Confidential
  • 63. 67 © 2018 MapR Technologies, Inc. // MapR Confidential kubelet docker pod Normally pods interact directly with node resources
  • 64. 68 © 2018 MapR Technologies, Inc. // MapR Confidential kubelet docker pod plugin We can install a volume plugin (recently introduced)
  • 65. 69 © 2018 MapR Technologies, Inc. // MapR Confidential mapr- fs kubelet docker pod plugin fuse This allows uniform access to files, tables and streams
  • 66. 70 © 2018 MapR Technologies, Inc. // MapR Confidential Where does that take us?
  • 67. 71 © 2018 MapR Technologies, Inc. // MapR Confidential Installation of plugin is K8S level operation • No per-node attention required Use of plugin is overlay operation • No change needed for an container • Any Helm chart can use the plugin for conventional file access Can share storage/compute or isolate or scale independently Consequences
  • 68. 72 © 2018 MapR Technologies, Inc. // MapR Confidential State is no longer a dirty word for Kubernetes HPC can run on K8S Boring things can run on K8S without storage appliances Previously crazy ideas can now be valuable Complexity is largely not visible More Consequences
  • 69. 73 © 2018 MapR Technologies, Inc. // MapR Confidential Container orchestration is awesome
  • 70. 74 © 2018 MapR Technologies, Inc. // MapR Confidential Container orchestration is awesome Data orchestration is, too
  • 71. 75 © 2018 MapR Technologies, Inc. // MapR Confidential Cloud as-is: No unified data access or security concepts Public Cloud Application ✓ API API Connector Single cloud vendor strategy: • Vendor lock in • No failover in case of global outage • Limited Edge capabilities AWS Services: • Kinesis & Elastic MapReduce • Redshift & DynamoDB • S3 & Glacier
  • 72. 76 © 2018 MapR Technologies, Inc. // MapR Confidential Cloud as-is: No unified data access or security concepts Public Cloud Application ✓ API API Connector AWS Services: • Kinesis & Elastic MapReduce • Redshift & DynamoDB • S3 & Glacier API Azure Services: • HD Insight • SQL Server & CosmosDB • Blob & DataLakeStore Multi cloud strategy: • Complex data movement between clouds • On any other cloud: • Different API‘s: application breaks • Different Security concept
  • 73. 77 © 2018 MapR Technologies, Inc. // MapR Confidential Cloud as-is: No unified data access or security concepts Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud API Application ✓ API API API API API Connector Multi cloud strategy: • Complex data movement between clouds • On any other cloud: • Different API‘s, application breaks • Different Security concept
  • 74. 78 © 2018 MapR Technologies, Inc. // MapR Confidential Open APIs How a “Media Company” is Unifying Compute Environments Application • Unified Security Model • Data access decoupled from physical storage location. Globally. • No lock-in to proprietary APIs • Full openness • Data made portable API Connector ✓ GLOBAL DATA MANAGEMENT Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud Uniform computing environment everywhere
  • 75. 79 © 2018 MapR Technologies, Inc. // MapR Confidential Open APIs How “Manufacturing Company” is Orchestrating Data Application • Unified Security Model • Data access decoupled from physical storage location. Globally. • No lock-in to proprietary APIs • Full openness • Data made portable API Connector ✓ GLOBAL DATA MANAGEMENT Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud Platform level data replication
  • 76. 80 © 2018 MapR Technologies, Inc. // MapR Confidential Tier 1 Bank #1 Creating a Global Filesystem /mapr/edge1 /mapr/edge2 /mapr/edge3 /mapr/newyork /mapr/amsterdam /mapr/azure /mapr/gcp /mapr/aws-eu-west NFS POSIX HDFSREST HOT WARM COLD /mapr Kafka JSON HBASE SQL S3 Application ✓ Global access to local data
  • 77. 81 © 2018 MapR Technologies, Inc. // MapR Confidential Tier 1 Bank #2: Creating a Universal Application Platform Application GLOBAL DATA MANAGEMENT Edge Private Cloud On Premise Public Cloud Public Cloud Public Cloud PodPod Pod Image Classification using Tensorflow in a Docker container Classic ETL Scheduling & Scaling MapR Kubernetes Volume Driver Single pane of glass to control jobs anywhere
  • 78. 82 © 2018 MapR Technologies, Inc. // MapR Confidential Additional Resources O’Reilly report by Ted Dunning & Ellen Friedman © September 2017 Read free courtesy of MapR: https://mapr.com/ebook/machine-learning-logistics/ O’Reilly book by Ted Dunning & Ellen Friedman © March 2016 Read free courtesy of MapR: https://mapr.com/streaming-architecture-using- apache-kafka-mapr-streams/ Ted Dunning & Ellen Friedman Model Management in theReal World MachineLearning Logistics
  • 79. 83 © 2018 MapR Technologies, Inc. // MapR Confidential Additional Resources O’Reilly book by Ted Dunning & Ellen Friedman © June 2014 Read free courtesy of MapR: https://mapr.com/practical-machine-learning-new-look- anomaly-detection/ O’Reilly book by Ellen Friedman & Ted Dunning © February 2014 Read free courtesy of MapR: https://mapr.com/practical-machine-learning/
  • 80. 84 © 2018 MapR Technologies, Inc. // MapR Confidential AI & Analytics in Production: How to Make It Work Free book signing with authors Ted Dunning & Ellen Friedman MapR stand #145: Tues 1:00 pm – 1:45 pm Wed 1:00 pm – 1:45 pm Download free pdf via MapR: https://mapr.com/ebook/ai-and-analytics-in-production/