SlideShare a Scribd company logo
1 of 41
Download to read offline
Grab some
coffee and
enjoy the
pre-show
banter
before the
top of the
hour! !
The Briefing Room
Time's Up! Getting Value from Big Data Now
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
@eric_kavanagh
u  Reveal the essential characteristics of enterprise
software, good and bad
u  Provide a forum for detailed analysis of today s innovative
technologies
u  Give vendors a chance to explain their product to savvy
analysts
u  Allow audience members to pose serious questions... and
get answers!
Mission
Big Integration
u  Old infrastructure
lacking
u  New pipes are
needed
u  Well begun is half
done!
Analyst
Robin Bloor is
Chief Analyst at
The Bloor Group
robin.bloor@bloorgroup.com
@robinbloor
CASK
u  CASK offers a unified integration platform for big data
applications and data lakes
u  Its CDAP architecture provides data containers,
program containers and application containers for data
and applications on Hadoop
u  CASK also offers Hydrator for building and managing
data pipelines and data lakes, and Tracker for data
lake governance
Guest
Jonathan Gray
Jonathan Gray, Founder & CEO of Cask, is an entrepreneur
and software engineer with a background in startups, open
source and all things data. Prior to founding Cask, Jonathan
was a software engineer at Facebook where he drove HBase
engineering efforts, including Facebook Messages and
several other large-scale projects from inception to
production.
An open source evangelist, Jonathan was responsible for
helping build the Facebook engineering brand through
developer outreach and refocusing the open source strategy
of the company. Prior to Facebook, Jonathan founded
Streamy.com, where he became an early adopter of Hadoop
and HBase and is now a core contributor and active
committer in the community.
Jonathan holds a bachelor’s degree in Electrical and
Computer Engineering and Business Administration from
Carnegie Mellon University.
Big Data on Tap
cask.co November 1, 2016
The Briefing Room
Jonathan Gray
Founder & CEO
cask.co
Hadoop Enables New Applications and Architectures
2
ENTERPRISE DATA LAKES BIG DATA ANALYTICS PRODUCTION DATA APPS
Batch and Realtime
Data Ingestion
Any type of data from any
type of source in any volume
Batch and Streaming ETL
Code-free self-service creation
and management of pipelines
SQL Exploration and
Data Science
All data is automatically
accessible via SQL and client SDKs
Data as a Service
Easily expose generic or
custom REST APIs on any data
360o
Customer View
Integrate data from any source
and expose through queries
and APIs
Realtime Dashboards
Perform realtime OLAP
aggregations and serve them
through REST APIs
Time Series Analysis
Store, process and serve massive
volumes of time-series data
Realtime Log Analytics
Ingestion and processing of
high-throughput streaming
log events
Recommendation Engines
Build models in batch using
historical data and serve them
in realtime
Anomaly Detection Systems
Process streaming events and
predictably compare them in
realtime to historical data
NRT Event Monitoring
Reliably monitor large streams of
data and perform defined actions
within a specified time
Internet of Things
Ingestion, storage and processing
of events that is highly-available,
scalable and consistent
ENTERPRISE DATA LAKES BIG DATA ANALYTICS PRODUCTION DATA APPS
Batch and Realtime
Data Ingestion
Any type of data from any
type of source in any volume
Batch and Streaming ETL
Code-free self-service creation
and management of pipelines
SQL Exploration and
Data Science
All data is automatically
accessible via SQL and client SDKs
Data as a Service
Easily expose generic or
custom REST APIs on any data
360o
Customer View
Integrate data from any source
and expose through queries
and APIs
Realtime Dashboards
Perform realtime OLAP
aggregations and serve them
through REST APIs
Time Series Analysis
Store, process and serve massive
volumes of time-series data
Realtime Log Analytics
Ingestion and processing of
high-throughput streaming
log events
Recommendation Engines
Build models in batch using
historical data and serve them
in realtime
Anomaly Detection Systems
Process streaming events and
predictably compare them in
realtime to historical data
NRT Event Monitoring
Reliably monitor large streams of
data and perform defined actions
within a specified time
Internet of Things
Ingestion, storage and processing
of events that is highly-available,
scalable and consistent
ENTERPRISE DATA LAKES BIG DATA ANALYTICS PRODUCTION DATA APPS
Batch and Realtime
Data Ingestion
Any type of data from any
type of source in any volume
Batch and Streaming ETL
Code-free self-service creation
and management of pipelines
SQL Exploration and
Data Science
All data is automatically
accessible via SQL and client SDKs
Data as a Service
Easily expose generic or
custom REST APIs on any data
360o
Customer View
Integrate data from any source
and expose through queries
and APIs
Realtime Dashboards
Perform realtime OLAP
aggregations and serve them
through REST APIs
Time Series Analysis
Store, process and serve massive
volumes of time-series data
Realtime Log Analytics
Ingestion and processing of
high-throughput streaming
log events
Recommendation Engines
Build models in batch using
historical data and serve them
in realtime
Anomaly Detection Systems
Process streaming events and
predictably compare them in
realtime to historical data
NRT Event Monitoring
Reliably monitor large streams of
data and perform defined actions
within a specified time
Internet of Things
Ingestion, storage and processing
of events that is highly-available,
scalable and consistent
Data Applications Drive Meaningful Business Value
cask.co3
But Getting Value from Big Data is Hard
Too much focus on infrastructure and integration, rather than applications and analytics
Divergence of distributions
and technologies
Integration silos created by
narrow point solutions
Proliferation of projects,
services and APIs
Complexity of technologies
and new user learning curve
cask.co4
Without a consistent set of tools, IT will not be an effective data enabler for the business
Developer
Architecture & Programming
Focused on Apps & Solutions
Ops
Configuring & Monitoring
Focused on Infrastructure & SLA’s
LOB / Product
Driving Revenue & Decision Making
Focused on Products & Insights
Data Scientist
Scripting & Machine Learning
Focused on Data & Algorithms
And There Are Many Faces of Hadoop
cask.co5
Enter Cask
AT&T, Cloudera and Ericsson 

Strategic Investors
3.5 Cask Data Application Platform,
Cask Hydrator and Cask Tracker
Latest Release
AT&T, Ericsson, Lotame, Salesforce, Cloudera,
Hortonworks, MapR, Microsoft, IBM, Tableau…
Key Customers & Partners
By early Hadoop engineers from
Facebook and Yahoo!
Founded in 2011
Andreessen Horowitz, Safeguard,
Battery Venture and Ignition Partners
Raised $37+ Million
Featuring Cask Market,

the “big data app store”
NEW: CDAP 4 Preview
A Container Architecture that puts
Big Data on Tap
Why “Cask” ?
cask.co6
Convergence of Big Data Apps and Data Integration
The Evolution of the Cask Platform
Big Data Apps + Data Integration
• Data ingest
• Data pipelines
• Workflows and metadata
“WebLogic Meets Informatica”
CDAP
v3
Big Data App Server
• Abstractions & integrations
• Metrics & logs
• Debugging environment
“WebLogic for Hadoop”
CDAP
v2
Unified Integration for Big Data
• Security & governance
• Self-service environment
• Enterprise integrations
“Unified Big Data Integration”
CDAP
v4
cask.co
Introducing Cask Data Application Platform (CDAP)
7
First Unified Integration Platform for Big Data



Platform for distributed apps, bringing together

application management with data integration


• 100% open source and built for extensibility

• Supports all major Hadoop distributions and clouds

• Integrates the latest open source big data technologies
Data Lake
Fraud
Detection
Recommendation
Engine
Sensor Data
Analytics
Customer
360
Modern Data
Integration
Distributed
Application
Framework
Self-Service
User Experience
Enterprise-grade
Security &
Governance
cask.co8
• Real-time and Batch
• Reliable and Scalable
• Simple and Self-Service
Modern Data Integration
EXPLORE
for analytics and
data science
PROCESS
for ETL and
machine learning
SERVE
any data to any
destination
INGEST
any data from
any source
cask.co9
Distributed Application Framework
DEVELOP
rapidly build
applications
TEST
powerful test and
CI framework
DEPLOY
run any apps in
any environment
SCALE
horizontally scale
apps and data
• Real-time and Batch
• Memory, Local, Distributed
• Analytics and Applications
cask.co10
Security and Governance
CAPTURE
store all metadata
about your data
DISCOVER
easily locate any
of your data
TRACK
every audit plus
lineage graphs
ANALYZE
understand usage
patterns of data
AUTHENTICATE AUTHORIZEENCRYPT
cask.co11
A data discovery tool to explore metadata and usageA code-free framework to build and run data pipelines
Self-Service User Experience
Drag & drop
graphical
interface
Create,
debug,
deploy and
manage
Separation
of logic and
execution
environment
Native to
Hadoop &
Spark —
scales out
Rich app-
level
metadata
Track
lineage and
audits
Analyze
usage of
datasets
MDM
integration
framework
cask.co
The CDAP Architecture
12
Applications
Programs
MapReduce Spark
Tigon Workflow
Service Worker
Metadata
Datasets
Table Avro Parquet
Timeseries OLAP Cube
Geospatial ObjectStore
Metadata
Metadata
• Application Container Architecture
• Reusable Programming Abstractions
• Global User and Machine Metadata
• Highly Extensible Plugin Architecture
cask.co13
Single framework for building and running data apps and data lakes on Hadoop and Spark
Rapid
Development
• Standardization, deep
integrations, tools and docs

• Separation of app logic from
data logic and integration logic

• Conceptual integrity within
applications and consistency
across environments
Production
Operations &
Governance
• Simplified packaging, deployment
and monitoring of apps on Hadoop

• Enhanced security and governance
with centralized metrics and logs

• Tracking and exploration of
metadata, data provenance, audit
trails and usage analytics
CDAP Enables the Full Big Data Application Lifecycle
reduces time to develop and deploy big data apps by 80%
reduces time to insights and accelerates business value
removes barriers to innovation and future-proofs your apps
cask.co14
Customer Success Stories
Customer

Situation
Lack of existing Hadoop expertise
and frustration with hand-coding
and scripting tools
Cask Hydrator for rapid creation of
data pipelines and Cask Tracker for
data discovery
POC in 2 days

Production in 2 months
Cask

Solution
Small team and significant
technical challenges limit pace of
development and solution scale
CDAP for real-time ingestion and
consistent processing with
production operations support
Development in 1 month

Production in 3 months
Hundreds of Users

Thousands of Pipelines
Multiple teams and technologies
with widely varied skillsets and
incompatible design choices
CDAP for data lake management
and orchestration, tightly
integrated into existing systems
Health Insurance Provider

offloading clinical / immunization
reporting from Netezza
Leading SaaS Platform

taking new real-time, massive
scale products to market
Large Telco Enterprise

building a centralized, secured,

multi-tenant Data Lake
cask.co15
Cask was Named a
Gartner Cool
Vendor 2016
Cask was Certified a
Great Place to Work 2016
“ … for the rest of us who lack the technological chips or patience to
make it all work, there’s good news: it will soon get easier, thanks to the
work done by the big data pioneers, as well as vendors like Cask …”

(Alex Woodie, Managing Editor, Datanami)
Awards and Accolades
“ … “Cask has tilted the playing field, earning a massive unfair
advantage over proprietary point products for data integration and
ingest …”

(Nik Rouda, Senior Analyst, Enterprise Strategy Group)
“ … “CDAP is a big win for us … the amount of code we needed to
write was minimal with CDAP, and it was much easier and faster than
we ever expected …”

(Jia-Long Wu, Data Architect, Lotame)
cask.co16
NEW: CDAP 4 — Big Data Apps on Tap!
Available for download now!
Release of CDAP 4 Preview
“Big Data App Store”
Cask Market
Interactive Data Preparation
Cask Wrangler
Interactive Wizards for Common Tasks
Resource Center
Rewrite based on React
Reimagined CDAP UI
cask.co17
The “App Store for Big Data”
Cask Market
• Goal: Time to value in minutes w/ no existing experience
• Application and Library Ecosystem with pre-built Hadoop
solutions, reusable templates, and third-party plugins
• Available from anywhere inside the CDAP UI with a click
• Initially, everything in the Cask Market has been bootstrapped
by Cask based on ongoing work across our customers, is 100%
open source and available on GitHub
• Eventually, developers and ISVs will be able to showcase and
market their own applications and libraries (ex: Graylog)
Cask Market includes Interactive, Guided Wizards for Configuring Pre-Built Templates
NEW: CDAP 4 — Big Data Apps on Tap!
cask.co18
Building Data Pipelines on Hadoop with
Cask Hydrator
Data Lake Webinar
Introduction to Cask Hydrator
CDAP - Containers on Hadoop
CDAP Extensions - Cask Hydrator and
Cask Tracker
ESG Solution Spotlight
CDAP Technical Concepts (video)
Cask / Cloudera Solution Brief
Cask Resources
cask.co
● CDAP provides the first unified integration
platform for big data
● Cask Hydrator and Cask Tracker are visual
extensions of CDAP for self-service access
● CDAP empowers enterprise IT to deliver

faster time to value for Hadoop and Spark, from
prototype to production

● Cask Market is a “big data app store” available in
CDAP 4 with pre-built apps, pipelines, plugins

● CDAP is 100% open source, highly extensible,
enterprise-ready, and commercially supported
Big Data on Tap
Summary
cask.co20
For more information, go to: cask.co
Thanks!
Perceptions & Questions
Analyst:
Robin Bloor
Big Data Foundations?
Robin Bloor, PhD
Neither Hadoop Nor Spark Is a Solution
However, both are useful and
increasingly versatile components for
Big Data applications
The Evolution of the Little Elephant
u  Hortonworks: Apache pure
play. No apparent vision.
u  Cloudera: Some proprietary
components (Cloudera
Manager, Impala, Cloudera
Search). Vision is corporate
data hub(?)
u  MapR: Also some proprietary
components (MapR-FS, MapR
Streams, MapR-DB)
u  And then there’s the cloud.
The Ship of Fools
Until Hadoop’s direction is controlled by
a single “captain” we may have to
tolerate the ship of fools
The “Big Data Hype Cycle” Is Misleading
u  Big Data is an ecosystem,
not a technology – which
distorts this graph
u  Some analytics applications
have experienced “absurd
acceleration”
u  Hadoop is, in many
instances, a laggard - Spark
too
u  Nevertheless, we seem to
be exiting “the trough”
Data lake or Governance Hub?
The System Management Issue
Mobile
Devices
DesktopsServers
IoT
The
Cloud
Archive
Data
Stores
Data
Assaying
Data
Capture
Real-Time
Streaming?
Data
Mgt
Data
Serving
The Prospecting Domain
Apps
Data
Life Cycle
Mgt
Staging
Area
(Hadoop?)
System
Management
The Fundamental Issue
Big Data does not really have a
foundation. Neither, imho, does the
Data Lake.
Luckily, there are third parties…
u  Regarding Hadoop, do you have any “preferred
components?”
u  How do you stay current with the various distros?
Backward compatibility? Can a customer upgrade
at will?
u  How does your technology impact performance (if
at all)?
u  Do you provide a consultancy service?
u  Which companies/services do you regard as
competitive?
u  Do you have any specific partners?
u  What does an implementation look like?
THANK YOU
for your
ATTENTION!
Some images provided courtesy of Wikimedia Commons

More Related Content

What's hot

Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKai Wähner
 
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...Lucas Jellema
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big DataInfochimps, a CSC Big Data Business
 
How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...
How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...
How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...confluent
 
The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...Impetus Technologies
 
Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...
Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...
Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...Riccardo Zamana
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKai Wähner
 
Spark Summit Keynote by Suren Nathan
Spark Summit Keynote by Suren NathanSpark Summit Keynote by Suren Nathan
Spark Summit Keynote by Suren NathanSpark Summit
 
One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)Simon Harrer
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...Kai Wähner
 
VP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraVP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraBig Data Spain
 
Reliable Data Intestion in BigData / IoT
Reliable Data Intestion in BigData / IoTReliable Data Intestion in BigData / IoT
Reliable Data Intestion in BigData / IoTGuido Schmutz
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey ResultsCarole Gunst
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsStreamsets Inc.
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...Kai Wähner
 
Rediscovering the Value of Apache Kafka® in Modern Data Architecture
Rediscovering the Value of Apache Kafka® in Modern Data ArchitectureRediscovering the Value of Apache Kafka® in Modern Data Architecture
Rediscovering the Value of Apache Kafka® in Modern Data Architectureconfluent
 
Life is but a Stream
Life is but a StreamLife is but a Stream
Life is but a StreamDatabricks
 
Security, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationSecurity, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationDataWorks Summit
 

What's hot (20)

Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
 
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...
How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...
How USCIS Powered a Digital Transition to eProcessing with Kafka (Rob Brown &...
 
The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...The structured streaming upgrade to Apache Spark and how enterprises can bene...
The structured streaming upgrade to Apache Spark and how enterprises can bene...
 
Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...
Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...
Time series Analytics - a deep dive into ADX Azure Data Explorer @Data Saturd...
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Spark Summit Keynote by Suren Nathan
Spark Summit Keynote by Suren NathanSpark Summit Keynote by Suren Nathan
Spark Summit Keynote by Suren Nathan
 
One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)One Kubernetes to rule them all (ZEUS 2019 Keynote)
One Kubernetes to rule them all (ZEUS 2019 Keynote)
 
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
WJAX 2013 Slides online: Big Data beyond Apache Hadoop - How to integrate ALL...
 
VP of WW Partners by Alan Chhabra
VP of WW Partners by Alan ChhabraVP of WW Partners by Alan Chhabra
VP of WW Partners by Alan Chhabra
 
Reliable Data Intestion in BigData / IoT
Reliable Data Intestion in BigData / IoTReliable Data Intestion in BigData / IoT
Reliable Data Intestion in BigData / IoT
 
2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results2020 Big Data & Analytics Maturity Survey Results
2020 Big Data & Analytics Maturity Survey Results
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
 
Rediscovering the Value of Apache Kafka® in Modern Data Architecture
Rediscovering the Value of Apache Kafka® in Modern Data ArchitectureRediscovering the Value of Apache Kafka® in Modern Data Architecture
Rediscovering the Value of Apache Kafka® in Modern Data Architecture
 
Life is but a Stream
Life is but a StreamLife is but a Stream
Life is but a Stream
 
Security, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software IntegrationSecurity, ETL, BI & Analytics, and Software Integration
Security, ETL, BI & Analytics, and Software Integration
 

Viewers also liked

"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009
"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009
"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009sawares
 
Workshop on Antioxidants
Workshop on AntioxidantsWorkshop on Antioxidants
Workshop on Antioxidantsyenchua
 
Digital booklet trill o.g (deluxe edition).
Digital booklet   trill o.g (deluxe edition).Digital booklet   trill o.g (deluxe edition).
Digital booklet trill o.g (deluxe edition).Zorrge
 
Offical principles of editng
Offical principles of editngOffical principles of editng
Offical principles of editngjs1productionstm
 
Digital booklet how i got over
Digital booklet    how i got overDigital booklet    how i got over
Digital booklet how i got overZorrge
 
Digital booklet monumental
Digital booklet   monumentalDigital booklet   monumental
Digital booklet monumentalZorrge
 
Digital booklet
Digital bookletDigital booklet
Digital bookletZorrge
 
Pusha T - Fear Of God II Let Us Pray
Pusha T - Fear Of God II Let Us PrayPusha T - Fear Of God II Let Us Pray
Pusha T - Fear Of God II Let Us PrayZorrge
 
50 Cent book “Playground”
50 Cent book “Playground”50 Cent book “Playground”
50 Cent book “Playground”Zorrge
 

Viewers also liked (20)

Photography research 2
Photography research 2Photography research 2
Photography research 2
 
57proposal
57proposal57proposal
57proposal
 
"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009
"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009
"Sawares ułatwia życie" - artykuł w miesięczniku Kraków, nr 5 maj 2009
 
Workshop on Antioxidants
Workshop on AntioxidantsWorkshop on Antioxidants
Workshop on Antioxidants
 
Jameel terminology
Jameel terminologyJameel terminology
Jameel terminology
 
Digital booklet trill o.g (deluxe edition).
Digital booklet   trill o.g (deluxe edition).Digital booklet   trill o.g (deluxe edition).
Digital booklet trill o.g (deluxe edition).
 
Jameel terminology
Jameel terminologyJameel terminology
Jameel terminology
 
Offical principles of editng
Offical principles of editngOffical principles of editng
Offical principles of editng
 
Digital booklet how i got over
Digital booklet    how i got overDigital booklet    how i got over
Digital booklet how i got over
 
Photography research 3
Photography research 3Photography research 3
Photography research 3
 
Photography 1
Photography 1Photography 1
Photography 1
 
Digital booklet monumental
Digital booklet   monumentalDigital booklet   monumental
Digital booklet monumental
 
Andy warhol presentation
Andy warhol presentationAndy warhol presentation
Andy warhol presentation
 
Andy warhol presentation
Andy warhol presentationAndy warhol presentation
Andy warhol presentation
 
Light paint presentation
Light paint presentationLight paint presentation
Light paint presentation
 
Digital booklet
Digital bookletDigital booklet
Digital booklet
 
Pusha T - Fear Of God II Let Us Pray
Pusha T - Fear Of God II Let Us PrayPusha T - Fear Of God II Let Us Pray
Pusha T - Fear Of God II Let Us Pray
 
Denison 111
Denison 111Denison 111
Denison 111
 
50 Cent book “Playground”
50 Cent book “Playground”50 Cent book “Playground”
50 Cent book “Playground”
 
Light paint presentation
Light paint presentationLight paint presentation
Light paint presentation
 

Similar to Time's Up! Getting Value from Big Data Now

Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafkaconfluent
 
Streaming Visualization
Streaming VisualizationStreaming Visualization
Streaming VisualizationGuido Schmutz
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 editionDavid Talby
 
Confluent kafka meetupseattle jan2017
Confluent kafka meetupseattle jan2017Confluent kafka meetupseattle jan2017
Confluent kafka meetupseattle jan2017Nitin Kumar
 
Financial Services Analytics on AWS
Financial Services Analytics on AWSFinancial Services Analytics on AWS
Financial Services Analytics on AWSAmazon Web Services
 
Querona Presentation 2018
Querona Presentation 2018Querona Presentation 2018
Querona Presentation 2018Synergo!
 
What's New in 6.3 + Data On-Boarding
What's New in 6.3 + Data On-BoardingWhat's New in 6.3 + Data On-Boarding
What's New in 6.3 + Data On-BoardingSplunk
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureMark Kromer
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
StreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics PlatformStreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics PlatformAtul Sharma
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Dataconomy Media
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIAmazon Web Services
 
Oracle Stream Analytics - Simplifying Stream Processing
Oracle Stream Analytics - Simplifying Stream ProcessingOracle Stream Analytics - Simplifying Stream Processing
Oracle Stream Analytics - Simplifying Stream ProcessingGuido Schmutz
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
Leveraging Mainframe Data for Modern Analytics
Leveraging Mainframe Data for Modern AnalyticsLeveraging Mainframe Data for Modern Analytics
Leveraging Mainframe Data for Modern Analyticsconfluent
 
Tapping the cloud for real time data analytics
 Tapping the cloud for real time data analytics Tapping the cloud for real time data analytics
Tapping the cloud for real time data analyticsAmazon Web Services
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Riccardo Zamana
 
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and ImplyAchieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Implyconfluent
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformVMware Tanzu
 
Derfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeDerfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeMicrosoft
 

Similar to Time's Up! Getting Value from Big Data Now (20)

Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafka
 
Streaming Visualization
Streaming VisualizationStreaming Visualization
Streaming Visualization
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
Confluent kafka meetupseattle jan2017
Confluent kafka meetupseattle jan2017Confluent kafka meetupseattle jan2017
Confluent kafka meetupseattle jan2017
 
Financial Services Analytics on AWS
Financial Services Analytics on AWSFinancial Services Analytics on AWS
Financial Services Analytics on AWS
 
Querona Presentation 2018
Querona Presentation 2018Querona Presentation 2018
Querona Presentation 2018
 
What's New in 6.3 + Data On-Boarding
What's New in 6.3 + Data On-BoardingWhat's New in 6.3 + Data On-Boarding
What's New in 6.3 + Data On-Boarding
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
StreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics PlatformStreamAnalytix - Multi-Engine Streaming Analytics Platform
StreamAnalytix - Multi-Engine Streaming Analytics Platform
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Oracle Stream Analytics - Simplifying Stream Processing
Oracle Stream Analytics - Simplifying Stream ProcessingOracle Stream Analytics - Simplifying Stream Processing
Oracle Stream Analytics - Simplifying Stream Processing
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Leveraging Mainframe Data for Modern Analytics
Leveraging Mainframe Data for Modern AnalyticsLeveraging Mainframe Data for Modern Analytics
Leveraging Mainframe Data for Modern Analytics
 
Tapping the cloud for real time data analytics
 Tapping the cloud for real time data analytics Tapping the cloud for real time data analytics
Tapping the cloud for real time data analytics
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020
 
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and ImplyAchieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Derfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeDerfor skal du bruge en DataLake
Derfor skal du bruge en DataLake
 

More from Eric Kavanagh

The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationEric Kavanagh
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesEric Kavanagh
 
Expediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source AnalysisExpediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source AnalysisEric Kavanagh
 
Will AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and DashboardsWill AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and DashboardsEric Kavanagh
 
Metadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationMetadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationEric Kavanagh
 
Database Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory WebcastDatabase Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory WebcastEric Kavanagh
 
Better to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and SecurityBetter to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and SecurityEric Kavanagh
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceEric Kavanagh
 
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingBest Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingEric Kavanagh
 
A Winning Strategy for the Digital Economy
A Winning Strategy for the Digital EconomyA Winning Strategy for the Digital Economy
A Winning Strategy for the Digital EconomyEric Kavanagh
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs MatterEric Kavanagh
 
Health Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BIHealth Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BIEric Kavanagh
 
Rapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueRapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueEric Kavanagh
 
Solving the Really Big Tech Problems with IoT
 Solving the Really Big Tech Problems with IoT Solving the Really Big Tech Problems with IoT
Solving the Really Big Tech Problems with IoTEric Kavanagh
 
Beyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid AnalysisBeyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid AnalysisEric Kavanagh
 
Protect Your Database: High Availability for High Demand Data
 Protect Your Database: High Availability for High Demand Data Protect Your Database: High Availability for High Demand Data
Protect Your Database: High Availability for High Demand DataEric Kavanagh
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataEric Kavanagh
 
The Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesThe Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesEric Kavanagh
 
A Tight Ship: How Containers and SDS Optimize the Enterprise
 A Tight Ship: How Containers and SDS Optimize the Enterprise A Tight Ship: How Containers and SDS Optimize the Enterprise
A Tight Ship: How Containers and SDS Optimize the EnterpriseEric Kavanagh
 
Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users	Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users Eric Kavanagh
 

More from Eric Kavanagh (20)

The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data Integration
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
 
Expediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source AnalysisExpediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source Analysis
 
Will AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and DashboardsWill AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and Dashboards
 
Metadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationMetadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI Modernization
 
Database Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory WebcastDatabase Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory Webcast
 
Better to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and SecurityBetter to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and Security
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data Governance
 
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingBest Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
 
A Winning Strategy for the Digital Economy
A Winning Strategy for the Digital EconomyA Winning Strategy for the Digital Economy
A Winning Strategy for the Digital Economy
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs Matter
 
Health Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BIHealth Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BI
 
Rapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueRapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the Rescue
 
Solving the Really Big Tech Problems with IoT
 Solving the Really Big Tech Problems with IoT Solving the Really Big Tech Problems with IoT
Solving the Really Big Tech Problems with IoT
 
Beyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid AnalysisBeyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid Analysis
 
Protect Your Database: High Availability for High Demand Data
 Protect Your Database: High Availability for High Demand Data Protect Your Database: High Availability for High Demand Data
Protect Your Database: High Availability for High Demand Data
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with Data
 
The Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesThe Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning Queries
 
A Tight Ship: How Containers and SDS Optimize the Enterprise
 A Tight Ship: How Containers and SDS Optimize the Enterprise A Tight Ship: How Containers and SDS Optimize the Enterprise
A Tight Ship: How Containers and SDS Optimize the Enterprise
 
Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users	Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users
 

Recently uploaded

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Recently uploaded (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

Time's Up! Getting Value from Big Data Now

  • 1. Grab some coffee and enjoy the pre-show banter before the top of the hour! !
  • 2. The Briefing Room Time's Up! Getting Value from Big Data Now
  • 4. u  Reveal the essential characteristics of enterprise software, good and bad u  Provide a forum for detailed analysis of today s innovative technologies u  Give vendors a chance to explain their product to savvy analysts u  Allow audience members to pose serious questions... and get answers! Mission
  • 5. Big Integration u  Old infrastructure lacking u  New pipes are needed u  Well begun is half done!
  • 6. Analyst Robin Bloor is Chief Analyst at The Bloor Group robin.bloor@bloorgroup.com @robinbloor
  • 7. CASK u  CASK offers a unified integration platform for big data applications and data lakes u  Its CDAP architecture provides data containers, program containers and application containers for data and applications on Hadoop u  CASK also offers Hydrator for building and managing data pipelines and data lakes, and Tracker for data lake governance
  • 8. Guest Jonathan Gray Jonathan Gray, Founder & CEO of Cask, is an entrepreneur and software engineer with a background in startups, open source and all things data. Prior to founding Cask, Jonathan was a software engineer at Facebook where he drove HBase engineering efforts, including Facebook Messages and several other large-scale projects from inception to production. An open source evangelist, Jonathan was responsible for helping build the Facebook engineering brand through developer outreach and refocusing the open source strategy of the company. Prior to Facebook, Jonathan founded Streamy.com, where he became an early adopter of Hadoop and HBase and is now a core contributor and active committer in the community. Jonathan holds a bachelor’s degree in Electrical and Computer Engineering and Business Administration from Carnegie Mellon University.
  • 9. Big Data on Tap cask.co November 1, 2016 The Briefing Room Jonathan Gray Founder & CEO
  • 10. cask.co Hadoop Enables New Applications and Architectures 2 ENTERPRISE DATA LAKES BIG DATA ANALYTICS PRODUCTION DATA APPS Batch and Realtime Data Ingestion Any type of data from any type of source in any volume Batch and Streaming ETL Code-free self-service creation and management of pipelines SQL Exploration and Data Science All data is automatically accessible via SQL and client SDKs Data as a Service Easily expose generic or custom REST APIs on any data 360o Customer View Integrate data from any source and expose through queries and APIs Realtime Dashboards Perform realtime OLAP aggregations and serve them through REST APIs Time Series Analysis Store, process and serve massive volumes of time-series data Realtime Log Analytics Ingestion and processing of high-throughput streaming log events Recommendation Engines Build models in batch using historical data and serve them in realtime Anomaly Detection Systems Process streaming events and predictably compare them in realtime to historical data NRT Event Monitoring Reliably monitor large streams of data and perform defined actions within a specified time Internet of Things Ingestion, storage and processing of events that is highly-available, scalable and consistent ENTERPRISE DATA LAKES BIG DATA ANALYTICS PRODUCTION DATA APPS Batch and Realtime Data Ingestion Any type of data from any type of source in any volume Batch and Streaming ETL Code-free self-service creation and management of pipelines SQL Exploration and Data Science All data is automatically accessible via SQL and client SDKs Data as a Service Easily expose generic or custom REST APIs on any data 360o Customer View Integrate data from any source and expose through queries and APIs Realtime Dashboards Perform realtime OLAP aggregations and serve them through REST APIs Time Series Analysis Store, process and serve massive volumes of time-series data Realtime Log Analytics Ingestion and processing of high-throughput streaming log events Recommendation Engines Build models in batch using historical data and serve them in realtime Anomaly Detection Systems Process streaming events and predictably compare them in realtime to historical data NRT Event Monitoring Reliably monitor large streams of data and perform defined actions within a specified time Internet of Things Ingestion, storage and processing of events that is highly-available, scalable and consistent ENTERPRISE DATA LAKES BIG DATA ANALYTICS PRODUCTION DATA APPS Batch and Realtime Data Ingestion Any type of data from any type of source in any volume Batch and Streaming ETL Code-free self-service creation and management of pipelines SQL Exploration and Data Science All data is automatically accessible via SQL and client SDKs Data as a Service Easily expose generic or custom REST APIs on any data 360o Customer View Integrate data from any source and expose through queries and APIs Realtime Dashboards Perform realtime OLAP aggregations and serve them through REST APIs Time Series Analysis Store, process and serve massive volumes of time-series data Realtime Log Analytics Ingestion and processing of high-throughput streaming log events Recommendation Engines Build models in batch using historical data and serve them in realtime Anomaly Detection Systems Process streaming events and predictably compare them in realtime to historical data NRT Event Monitoring Reliably monitor large streams of data and perform defined actions within a specified time Internet of Things Ingestion, storage and processing of events that is highly-available, scalable and consistent Data Applications Drive Meaningful Business Value
  • 11. cask.co3 But Getting Value from Big Data is Hard Too much focus on infrastructure and integration, rather than applications and analytics Divergence of distributions and technologies Integration silos created by narrow point solutions Proliferation of projects, services and APIs Complexity of technologies and new user learning curve
  • 12. cask.co4 Without a consistent set of tools, IT will not be an effective data enabler for the business Developer Architecture & Programming Focused on Apps & Solutions Ops Configuring & Monitoring Focused on Infrastructure & SLA’s LOB / Product Driving Revenue & Decision Making Focused on Products & Insights Data Scientist Scripting & Machine Learning Focused on Data & Algorithms And There Are Many Faces of Hadoop
  • 13. cask.co5 Enter Cask AT&T, Cloudera and Ericsson Strategic Investors 3.5 Cask Data Application Platform, Cask Hydrator and Cask Tracker Latest Release AT&T, Ericsson, Lotame, Salesforce, Cloudera, Hortonworks, MapR, Microsoft, IBM, Tableau… Key Customers & Partners By early Hadoop engineers from Facebook and Yahoo! Founded in 2011 Andreessen Horowitz, Safeguard, Battery Venture and Ignition Partners Raised $37+ Million Featuring Cask Market,
 the “big data app store” NEW: CDAP 4 Preview A Container Architecture that puts Big Data on Tap Why “Cask” ?
  • 14. cask.co6 Convergence of Big Data Apps and Data Integration The Evolution of the Cask Platform Big Data Apps + Data Integration • Data ingest • Data pipelines • Workflows and metadata “WebLogic Meets Informatica” CDAP v3 Big Data App Server • Abstractions & integrations • Metrics & logs • Debugging environment “WebLogic for Hadoop” CDAP v2 Unified Integration for Big Data • Security & governance • Self-service environment • Enterprise integrations “Unified Big Data Integration” CDAP v4
  • 15. cask.co Introducing Cask Data Application Platform (CDAP) 7 First Unified Integration Platform for Big Data
 
 Platform for distributed apps, bringing together
 application management with data integration 
 • 100% open source and built for extensibility • Supports all major Hadoop distributions and clouds • Integrates the latest open source big data technologies Data Lake Fraud Detection Recommendation Engine Sensor Data Analytics Customer 360 Modern Data Integration Distributed Application Framework Self-Service User Experience Enterprise-grade Security & Governance
  • 16. cask.co8 • Real-time and Batch • Reliable and Scalable • Simple and Self-Service Modern Data Integration EXPLORE for analytics and data science PROCESS for ETL and machine learning SERVE any data to any destination INGEST any data from any source
  • 17. cask.co9 Distributed Application Framework DEVELOP rapidly build applications TEST powerful test and CI framework DEPLOY run any apps in any environment SCALE horizontally scale apps and data • Real-time and Batch • Memory, Local, Distributed • Analytics and Applications
  • 18. cask.co10 Security and Governance CAPTURE store all metadata about your data DISCOVER easily locate any of your data TRACK every audit plus lineage graphs ANALYZE understand usage patterns of data AUTHENTICATE AUTHORIZEENCRYPT
  • 19. cask.co11 A data discovery tool to explore metadata and usageA code-free framework to build and run data pipelines Self-Service User Experience Drag & drop graphical interface Create, debug, deploy and manage Separation of logic and execution environment Native to Hadoop & Spark — scales out Rich app- level metadata Track lineage and audits Analyze usage of datasets MDM integration framework
  • 20. cask.co The CDAP Architecture 12 Applications Programs MapReduce Spark Tigon Workflow Service Worker Metadata Datasets Table Avro Parquet Timeseries OLAP Cube Geospatial ObjectStore Metadata Metadata • Application Container Architecture • Reusable Programming Abstractions • Global User and Machine Metadata • Highly Extensible Plugin Architecture
  • 21. cask.co13 Single framework for building and running data apps and data lakes on Hadoop and Spark Rapid Development • Standardization, deep integrations, tools and docs • Separation of app logic from data logic and integration logic • Conceptual integrity within applications and consistency across environments Production Operations & Governance • Simplified packaging, deployment and monitoring of apps on Hadoop • Enhanced security and governance with centralized metrics and logs • Tracking and exploration of metadata, data provenance, audit trails and usage analytics CDAP Enables the Full Big Data Application Lifecycle reduces time to develop and deploy big data apps by 80% reduces time to insights and accelerates business value removes barriers to innovation and future-proofs your apps
  • 22. cask.co14 Customer Success Stories Customer
 Situation Lack of existing Hadoop expertise and frustration with hand-coding and scripting tools Cask Hydrator for rapid creation of data pipelines and Cask Tracker for data discovery POC in 2 days
 Production in 2 months Cask
 Solution Small team and significant technical challenges limit pace of development and solution scale CDAP for real-time ingestion and consistent processing with production operations support Development in 1 month
 Production in 3 months Hundreds of Users
 Thousands of Pipelines Multiple teams and technologies with widely varied skillsets and incompatible design choices CDAP for data lake management and orchestration, tightly integrated into existing systems Health Insurance Provider
 offloading clinical / immunization reporting from Netezza Leading SaaS Platform
 taking new real-time, massive scale products to market Large Telco Enterprise
 building a centralized, secured,
 multi-tenant Data Lake
  • 23. cask.co15 Cask was Named a Gartner Cool Vendor 2016 Cask was Certified a Great Place to Work 2016 “ … for the rest of us who lack the technological chips or patience to make it all work, there’s good news: it will soon get easier, thanks to the work done by the big data pioneers, as well as vendors like Cask …” (Alex Woodie, Managing Editor, Datanami) Awards and Accolades “ … “Cask has tilted the playing field, earning a massive unfair advantage over proprietary point products for data integration and ingest …” (Nik Rouda, Senior Analyst, Enterprise Strategy Group) “ … “CDAP is a big win for us … the amount of code we needed to write was minimal with CDAP, and it was much easier and faster than we ever expected …” (Jia-Long Wu, Data Architect, Lotame)
  • 24. cask.co16 NEW: CDAP 4 — Big Data Apps on Tap! Available for download now! Release of CDAP 4 Preview “Big Data App Store” Cask Market Interactive Data Preparation Cask Wrangler Interactive Wizards for Common Tasks Resource Center Rewrite based on React Reimagined CDAP UI
  • 25. cask.co17 The “App Store for Big Data” Cask Market • Goal: Time to value in minutes w/ no existing experience • Application and Library Ecosystem with pre-built Hadoop solutions, reusable templates, and third-party plugins • Available from anywhere inside the CDAP UI with a click • Initially, everything in the Cask Market has been bootstrapped by Cask based on ongoing work across our customers, is 100% open source and available on GitHub • Eventually, developers and ISVs will be able to showcase and market their own applications and libraries (ex: Graylog) Cask Market includes Interactive, Guided Wizards for Configuring Pre-Built Templates NEW: CDAP 4 — Big Data Apps on Tap!
  • 26. cask.co18 Building Data Pipelines on Hadoop with Cask Hydrator Data Lake Webinar Introduction to Cask Hydrator CDAP - Containers on Hadoop CDAP Extensions - Cask Hydrator and Cask Tracker ESG Solution Spotlight CDAP Technical Concepts (video) Cask / Cloudera Solution Brief Cask Resources
  • 27. cask.co ● CDAP provides the first unified integration platform for big data ● Cask Hydrator and Cask Tracker are visual extensions of CDAP for self-service access ● CDAP empowers enterprise IT to deliver
 faster time to value for Hadoop and Spark, from prototype to production ● Cask Market is a “big data app store” available in CDAP 4 with pre-built apps, pipelines, plugins ● CDAP is 100% open source, highly extensible, enterprise-ready, and commercially supported Big Data on Tap Summary
  • 28. cask.co20 For more information, go to: cask.co Thanks!
  • 31. Neither Hadoop Nor Spark Is a Solution However, both are useful and increasingly versatile components for Big Data applications
  • 32. The Evolution of the Little Elephant u  Hortonworks: Apache pure play. No apparent vision. u  Cloudera: Some proprietary components (Cloudera Manager, Impala, Cloudera Search). Vision is corporate data hub(?) u  MapR: Also some proprietary components (MapR-FS, MapR Streams, MapR-DB) u  And then there’s the cloud.
  • 33. The Ship of Fools Until Hadoop’s direction is controlled by a single “captain” we may have to tolerate the ship of fools
  • 34. The “Big Data Hype Cycle” Is Misleading u  Big Data is an ecosystem, not a technology – which distorts this graph u  Some analytics applications have experienced “absurd acceleration” u  Hadoop is, in many instances, a laggard - Spark too u  Nevertheless, we seem to be exiting “the trough”
  • 35. Data lake or Governance Hub?
  • 36. The System Management Issue Mobile Devices DesktopsServers IoT The Cloud Archive Data Stores Data Assaying Data Capture Real-Time Streaming? Data Mgt Data Serving The Prospecting Domain Apps Data Life Cycle Mgt Staging Area (Hadoop?) System Management
  • 37. The Fundamental Issue Big Data does not really have a foundation. Neither, imho, does the Data Lake. Luckily, there are third parties…
  • 38. u  Regarding Hadoop, do you have any “preferred components?” u  How do you stay current with the various distros? Backward compatibility? Can a customer upgrade at will? u  How does your technology impact performance (if at all)? u  Do you provide a consultancy service?
  • 39. u  Which companies/services do you regard as competitive? u  Do you have any specific partners? u  What does an implementation look like?
  • 40.
  • 41. THANK YOU for your ATTENTION! Some images provided courtesy of Wikimedia Commons