SlideShare a Scribd company logo
1 of 31
1
Big Data Management:
What’s New, What’s Different
and What You Need to Know
2
Today’s Featured Presenter
Matt Aslett
Research Director,
Data Platforms and Analytics
451 Research
As Research Director, Matt has overall responsibility for the data platforms and
analytics research coverage, which includes operational and analytic databases,
Hadoop, grid/cache, stream processing, search-based data platforms, data
integration, data quality, data management, analytics, and advanced analytics.
Matt's own primary area of focus includes data management, reporting and
analytics, and exploring how the various data platform and analytics technology
sectors are converging in the form of next-generation data platform
33
Agenda
• Big Data Management
– Matt Aslett, 451 Research
• SnapLogic Overview
• SnapLogic Demonstration
– Ravi Dharnikota, Head of SnapLogic Enterprise Architecture
• Q&A
Copyright (C) 2016 451 Research LLC
Big Data Management
Matt Aslett, Research Director
Copyright (C) 2016 451 Research LLC
451 Research is a leading IT research & advisory company
5
Founded in 2000
250+ employees, including over 100 analysts
1,000+ clients: Technology & Service providers, corporate
advisory, finance, professional services, and IT decision makers
50,000+ IT professionals, business users and consumers in our research
community
Over 52 million data points published each quarter and 4,500+ reports
published each year
2,000+ technology & service providers under coverage
451 Research and its sister company, Uptime Institute, are the two divisions
of The 451 Group
Headquartered in New York City, with offices in London, Boston, San
Francisco, Washington DC, Mexico, Costa Rica, Brazil, Spain, UAE, Russia,
Taiwan, Singapore and Malaysia
Research & Data
Advisory
Events
Go 2 Market
Copyright (C) 2016 451 Research LLC
Big data and beyond
• V is for various things…
but does not define big data
3
Copyright (C) 2016 451 Research LLC
Big data and beyond
• V is for various things…
but does not define big data
• To understand the trends driving
‘big data’ 451 Research focused
beyond the nature of the data on
what enterprises wanted to do
with it
4
Copyright (C) 2016 451 Research LLC
Big data and beyond
8
• V is for various things…
but does not define big data
• To understand the trends driving
‘big data’ 451 Research focused
beyond the nature of the data on
what enterprises wanted to do
with it
• Totality – storing and processing all data (or as much as is economically viable)
• Exploration – schema-free approaches to analyzing data to identify new patterns
• Frequency – more frequent analysis of data to enable real-time decision making
Copyright (C) 2016 451 Research LLC
‘Big data’ is primarily driven by economics, not data
6
• ‘Big Data’ is the realization of competitive advantage based on the fact that it is now
more economically feasible to store and process data that was previously ignored due
to the cost and functional limitations of traditional data management technologies to
handle its volume, velocity and variety
Copyright (C) 2016 451 Research LLC
‘Big data’ is primarily driven by economics, not data
6
“Big data is what happened when the cost of keeping information became less than the cost of throwing
it away.”
George Dyson
• ‘Big Data’ is the realization of competitive advantage based on the fact that it is now
more economically feasible to store and process data that was previously ignored due
to the cost and functional limitations of traditional data management technologies to
handle its volume, velocity and variety
Copyright (C) 2016 451 Research LLC
‘Big data’ is primarily driven by economics, not data
7
“Big data is what happened when the cost of keeping information became less than the cost of throwing
it away.”
George Dyson
• ‘Big Data’ is the realization of competitive advantage based on the fact that it is now
more economically feasible to store and process data that was previously ignored due
to the cost and functional limitations of traditional data management technologies to
handle its volume, velocity and variety
• Moved from storing 1% of data for 60 days in EDW @ $100,000/TB
• To 100% of data for a year in Hadoop @ $900/TB
Copyright (C) 2016 451 Research LLC
Source: 451 Research, Total Data Analytics 2016
The evolution of enterprise analytics
12
REPORTING
- What happened
ANALYSIS
- Why did it happen?
PRESCRIPTIVE
- Influence what happens
STATISTICAL
MODELING
MACHINE
LEARNING
DESCRIPTIVE
- What is happening?
PREDICTIVE
- What will happen?
Complexity
AutomatedUser-drivenIT-driven
VISUALIZATION
Copyright (C) 2016 451 Research LLC
Data sources:
Multi-structured
RDBMS,
Hadoop, NoSQL,
stream processing,
historical and real-time
Source: 451 Research, Total Data Analytics 2016
Data sources:
Structured,
RDBMS,
historical
The evolution of enterprise analytics
13
REPORTING
- What happened
ANALYSIS
- Why did it happen?
PRESCRIPTIVE
- Influence what happens
STATISTICAL
MODELING
MACHINE
LEARNING
DESCRIPTIVE
- What is happening?
PREDICTIVE
- What will happen?
Complexity
AutomatedUser-drivenIT-driven
VISUALIZATION
Copyright (C) 2016 451 Research LLC
EDW vs Hadoop (Schema-on-write vs schema-on-read)
14
Source: https://www.flickr.com/photos/wbaiv/16510090506/ Source: https://www.flickr.com/photos/notbrucelee/5696238930/
Copyright (C) 2016 451 Research LLC
Schema-on-write
15
Source: https://www.flickr.com/photos/wbaiv/16510090506/
• Pre-prepared
• Single-purpose
• Some assembly required
• Inflexible
Copyright (C) 2016 451 Research LLC
Schema-on-read
16
Source: https://www.flickr.com/photos/notbrucelee/5696238930/
• Flexible
• Reusable
• Some imagination required*
• Multi-purpose
• *Instructions available if desired
Copyright (C) 2016 451 Research LLC
Hadoop-based data lakes
• The concept of the data lake
has taken off in recent years,
with the Apache Hadoop
data-processing framework
serving as the unified
repository into which raw
data is landed from multiple
sources and made available
to multiple users for multiple
purposes.
17
Photo: Myrabella / Wikimedia Commons, CC BY-SA 3.0,
https://commons.wikimedia.org/w/index.php?curid=11263585
Copyright (C) 2016 451 Research LLC
Hadoop-based data lakes
• The concept of the data lake
has taken off in recent years,
with the Apache Hadoop
data-processing framework
serving as the unified
repository into which raw
data is landed from multiple
sources and made available
to multiple users for multiple
purposes.
• Beware the data swamp
18
https://www.flickr.com/photos/lofink/4501610335/
Copyright (C) 2016 451 Research LLC
Data governance, data preparation and the data lake
• Data needs to be filtered, processed, treated
and managed to make it suitable for multiple
analytics use cases.
• Data governance
• Data catalog
• Data security
• Data lineage
• Data preparation
• Data discovery
• Data cleansing
• Data harmonization
19
• Data inventory
• Data quality
• Data pipelines
• Data enrichment
• Data matching
• Collaboration
Copyright (C) 2016 451 Research LLC
Data governance, data preparation and the data lake
20
DATA-AS-A-SERVICE
PARTNERS
SUPPLIERS
SELF-SERVICE
DATA PREPARATION
IT
DATA LAKE
APPLICATIONS
DATA GOVERNANCE
Data lineage Data inventory
Data catalog
Data security Data quality
Data pipelines
DATA STEWARDS
Data cleansing
Data harmonization
Data discovery
Collaboration
Data matching
Data enrichment
ADVANCED ANALYTICS
DATA SCIENTISTS
SELF-SERVICE ANALYTICS
SENIOR EXECUTIVES BUSINESS ANALYSTS DATA ANALYSTS
Copyright (C) 2016 451 Research LLC
Hadoop and other animals
21
Copyright (C) 2016 451 Research LLC
Recommendations
22
• Enterprises should seriously consider the data governance and management requirements before
embarking on data lake projects to ensure that the functionality is available to turn the concept into
reality.
• For flexibility and agility, employ data management approaches and technologies that abstract data
processing pipelines from the execution environment.
• Look for data integration and transformation technologies that execute natively, taking advantage of
the underlying engine (e.g. Spark, YARN).
• Seek out data management and integration technologies that enable consumption and
transformation of large volumes of structured and unstructured data.
Copyright (C) 2016 451 Research LLC
Thank You!
matthew.aslett@451research.com
@maslett
www.451research.com
SnapLogic Elastic Integration
Accelerate Your Integration. Accelerate Your Business
“We can do more in two hours with SnapLogic than we could in two days with traditional solutions.”
25
CSV
Big Data and hybrid cloud environments are making
yesterday’s approaches to integration obsolete
26
Anything
apps | data | APIs | things
SnapLogic: Unified Platform for Data and Application Integration
Anytime
batch | streaming | real-time
Anywhere
on prem | cloud | hybrid
2727
SnapLogic in the Modern Data Fabric: Ingest, Transform, Deliver
ConsumeStore&ProcessSource
z z z z
HANA
Data Warehouses &
Data Marts
Big Data and Data
Lakes
INGEST INGEST
Data Integration and
Transformation
On Prem
Applications
Relational
Databases
Cloud
Applications
NoSQL
Databases
Web
Logs
Internet of
Things
DELIVER DELIVER
28
Modern Architecture: Hybrid and Elastic Execution
Streams: No data is
stored/cached
Secure: 100%
standards-based
Elastic: Scales out &
handles data and app
integration use cases
Metadata
Data
Databases
On Prem
Apps
Big Data
Cloud Apps
and DataCloud-Based Designer, Manager,
Dashboard
Execution
Execution
Execution
Firewall
SnapLogic “respects data’s gravity.”
SnapLogic Demonstration
30
Discussion
Matt Aslett
Research Director,
Data Platforms and Analytics
451 Research
Ravi Dharnikota
Head of Enterprise Architecture
SnapLogic
31
Integrate at the speed of
modern business
+1 888-494-1570
sales@snaplogic.com
@SnapLogic
www.snaplogic.com

More Related Content

What's hot

Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform LoadABDUL KHALIQ
 
Analytics ROI Best Practices
Analytics ROI Best PracticesAnalytics ROI Best Practices
Analytics ROI Best PracticesDATAVERSITY
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei VaranovichLambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei VaranovichDatabricks
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
 
Master Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and GovernanceMaster Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and GovernanceDATAVERSITY
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big datahktripathy
 
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...Denodo
 
Actionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceActionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceHarald Erb
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?DATAVERSITY
 
Data quality architecture
Data quality architectureData quality architecture
Data quality architectureanicewick
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data WarehousingAlex Meadows
 

What's hot (20)

Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform Load
 
Analytics ROI Best Practices
Analytics ROI Best PracticesAnalytics ROI Best Practices
Analytics ROI Best Practices
 
From Data Warehouse to Lakehouse
From Data Warehouse to LakehouseFrom Data Warehouse to Lakehouse
From Data Warehouse to Lakehouse
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei VaranovichLambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Master Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and GovernanceMaster Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and Governance
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
 
Actionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceActionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data Science
 
Big data
Big dataBig data
Big data
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 
Data quality architecture
Data quality architectureData quality architecture
Data quality architecture
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data Warehousing
 

Similar to Big Data Management: What's New, What's Different, and What You Need To Know

Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarHortonworks
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformVMware Tanzu
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...
Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...
Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...Lightbend
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?DataStax
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Cambridge Semantics
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsArcadia Data
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...DataScienceConferenc1
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDATAVERSITY
 
Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success DataWorks Summit/Hadoop Summit
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsNoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsInfiniteGraph
 

Similar to Big Data Management: What's New, What's Different, and What You Need To Know (20)

Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinar
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...
Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...
Microservices And Fast Data: Industry And Architecture Trends [with 451 Resea...
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
The Power of Data
The Power of DataThe Power of Data
The Power of Data
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time Analytics
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
 
Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success
 
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsNoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive Analytics
 

More from SnapLogic

The AI Mindset: Bridging Industry and Academic Perspectives
The AI Mindset: Bridging Industry and Academic PerspectivesThe AI Mindset: Bridging Industry and Academic Perspectives
The AI Mindset: Bridging Industry and Academic PerspectivesSnapLogic
 
Supercharging Self-Service API Integration with AI
Supercharging Self-Service API Integration with AI Supercharging Self-Service API Integration with AI
Supercharging Self-Service API Integration with AI SnapLogic
 
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?SnapLogic
 
SnapLogic Culture Deck
SnapLogic Culture DeckSnapLogic Culture Deck
SnapLogic Culture DeckSnapLogic
 
Euromoney's integration journey: Selecting SnapLogic's self-service integrati...
Euromoney's integration journey: Selecting SnapLogic's self-service integrati...Euromoney's integration journey: Selecting SnapLogic's self-service integrati...
Euromoney's integration journey: Selecting SnapLogic's self-service integrati...SnapLogic
 
Digital Transformation is Cloud-Powered
Digital Transformation is Cloud-PoweredDigital Transformation is Cloud-Powered
Digital Transformation is Cloud-PoweredSnapLogic
 
How to Build a Winning Data Culture
How to Build a Winning Data CultureHow to Build a Winning Data Culture
How to Build a Winning Data CultureSnapLogic
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies SnapLogic
 
Overcoming the challenge of multiple data frameworks in a multiple cloud envi...
Overcoming the challenge of multiple data frameworks in a multiple cloud envi...Overcoming the challenge of multiple data frameworks in a multiple cloud envi...
Overcoming the challenge of multiple data frameworks in a multiple cloud envi...SnapLogic
 
SnapLogic Technology Open House – January 2018
SnapLogic Technology Open House – January 2018SnapLogic Technology Open House – January 2018
SnapLogic Technology Open House – January 2018SnapLogic
 
Self-Service Integration in the Age of Digital Transformation at Box
Self-Service Integration in the Age of Digital Transformation at BoxSelf-Service Integration in the Age of Digital Transformation at Box
Self-Service Integration in the Age of Digital Transformation at BoxSnapLogic
 
Live Demo: Accelerate the integration of workday applications
Live Demo: Accelerate the integration of workday applicationsLive Demo: Accelerate the integration of workday applications
Live Demo: Accelerate the integration of workday applicationsSnapLogic
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data SnapLogic
 
Spring 2017 release customer webinar
Spring 2017 release customer webinarSpring 2017 release customer webinar
Spring 2017 release customer webinarSnapLogic
 
SnapLogic unveils machine-learning-driven integration assistant
SnapLogic unveils machine-learning-driven integration assistantSnapLogic unveils machine-learning-driven integration assistant
SnapLogic unveils machine-learning-driven integration assistantSnapLogic
 
Webinar: Evolution of Data Management for the IoT
Webinar: Evolution of Data Management for the IoTWebinar: Evolution of Data Management for the IoT
Webinar: Evolution of Data Management for the IoTSnapLogic
 
SnapLogic Culture
SnapLogic CultureSnapLogic Culture
SnapLogic CultureSnapLogic
 
SnapLogic Live: Enabling the Citizen Integrator
SnapLogic Live: Enabling the Citizen IntegratorSnapLogic Live: Enabling the Citizen Integrator
SnapLogic Live: Enabling the Citizen IntegratorSnapLogic
 
SnapLogic Live: Workday Integration
SnapLogic Live: Workday IntegrationSnapLogic Live: Workday Integration
SnapLogic Live: Workday IntegrationSnapLogic
 

More from SnapLogic (20)

The AI Mindset: Bridging Industry and Academic Perspectives
The AI Mindset: Bridging Industry and Academic PerspectivesThe AI Mindset: Bridging Industry and Academic Perspectives
The AI Mindset: Bridging Industry and Academic Perspectives
 
Supercharging Self-Service API Integration with AI
Supercharging Self-Service API Integration with AI Supercharging Self-Service API Integration with AI
Supercharging Self-Service API Integration with AI
 
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
 
SnapLogic Culture Deck
SnapLogic Culture DeckSnapLogic Culture Deck
SnapLogic Culture Deck
 
Euromoney's integration journey: Selecting SnapLogic's self-service integrati...
Euromoney's integration journey: Selecting SnapLogic's self-service integrati...Euromoney's integration journey: Selecting SnapLogic's self-service integrati...
Euromoney's integration journey: Selecting SnapLogic's self-service integrati...
 
Digital Transformation is Cloud-Powered
Digital Transformation is Cloud-PoweredDigital Transformation is Cloud-Powered
Digital Transformation is Cloud-Powered
 
How to Build a Winning Data Culture
How to Build a Winning Data CultureHow to Build a Winning Data Culture
How to Build a Winning Data Culture
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
 
Overcoming the challenge of multiple data frameworks in a multiple cloud envi...
Overcoming the challenge of multiple data frameworks in a multiple cloud envi...Overcoming the challenge of multiple data frameworks in a multiple cloud envi...
Overcoming the challenge of multiple data frameworks in a multiple cloud envi...
 
SnapLogic Technology Open House – January 2018
SnapLogic Technology Open House – January 2018SnapLogic Technology Open House – January 2018
SnapLogic Technology Open House – January 2018
 
Self-Service Integration in the Age of Digital Transformation at Box
Self-Service Integration in the Age of Digital Transformation at BoxSelf-Service Integration in the Age of Digital Transformation at Box
Self-Service Integration in the Age of Digital Transformation at Box
 
Live Demo: Accelerate the integration of workday applications
Live Demo: Accelerate the integration of workday applicationsLive Demo: Accelerate the integration of workday applications
Live Demo: Accelerate the integration of workday applications
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
 
Spring 2017 release customer webinar
Spring 2017 release customer webinarSpring 2017 release customer webinar
Spring 2017 release customer webinar
 
SnapLogic unveils machine-learning-driven integration assistant
SnapLogic unveils machine-learning-driven integration assistantSnapLogic unveils machine-learning-driven integration assistant
SnapLogic unveils machine-learning-driven integration assistant
 
Webinar: Evolution of Data Management for the IoT
Webinar: Evolution of Data Management for the IoTWebinar: Evolution of Data Management for the IoT
Webinar: Evolution of Data Management for the IoT
 
The API Lie
The API LieThe API Lie
The API Lie
 
SnapLogic Culture
SnapLogic CultureSnapLogic Culture
SnapLogic Culture
 
SnapLogic Live: Enabling the Citizen Integrator
SnapLogic Live: Enabling the Citizen IntegratorSnapLogic Live: Enabling the Citizen Integrator
SnapLogic Live: Enabling the Citizen Integrator
 
SnapLogic Live: Workday Integration
SnapLogic Live: Workday IntegrationSnapLogic Live: Workday Integration
SnapLogic Live: Workday Integration
 

Recently uploaded

Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 

Recently uploaded (20)

Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 

Big Data Management: What's New, What's Different, and What You Need To Know

  • 1. 1 Big Data Management: What’s New, What’s Different and What You Need to Know
  • 2. 2 Today’s Featured Presenter Matt Aslett Research Director, Data Platforms and Analytics 451 Research As Research Director, Matt has overall responsibility for the data platforms and analytics research coverage, which includes operational and analytic databases, Hadoop, grid/cache, stream processing, search-based data platforms, data integration, data quality, data management, analytics, and advanced analytics. Matt's own primary area of focus includes data management, reporting and analytics, and exploring how the various data platform and analytics technology sectors are converging in the form of next-generation data platform
  • 3. 33 Agenda • Big Data Management – Matt Aslett, 451 Research • SnapLogic Overview • SnapLogic Demonstration – Ravi Dharnikota, Head of SnapLogic Enterprise Architecture • Q&A
  • 4. Copyright (C) 2016 451 Research LLC Big Data Management Matt Aslett, Research Director
  • 5. Copyright (C) 2016 451 Research LLC 451 Research is a leading IT research & advisory company 5 Founded in 2000 250+ employees, including over 100 analysts 1,000+ clients: Technology & Service providers, corporate advisory, finance, professional services, and IT decision makers 50,000+ IT professionals, business users and consumers in our research community Over 52 million data points published each quarter and 4,500+ reports published each year 2,000+ technology & service providers under coverage 451 Research and its sister company, Uptime Institute, are the two divisions of The 451 Group Headquartered in New York City, with offices in London, Boston, San Francisco, Washington DC, Mexico, Costa Rica, Brazil, Spain, UAE, Russia, Taiwan, Singapore and Malaysia Research & Data Advisory Events Go 2 Market
  • 6. Copyright (C) 2016 451 Research LLC Big data and beyond • V is for various things… but does not define big data 3
  • 7. Copyright (C) 2016 451 Research LLC Big data and beyond • V is for various things… but does not define big data • To understand the trends driving ‘big data’ 451 Research focused beyond the nature of the data on what enterprises wanted to do with it 4
  • 8. Copyright (C) 2016 451 Research LLC Big data and beyond 8 • V is for various things… but does not define big data • To understand the trends driving ‘big data’ 451 Research focused beyond the nature of the data on what enterprises wanted to do with it • Totality – storing and processing all data (or as much as is economically viable) • Exploration – schema-free approaches to analyzing data to identify new patterns • Frequency – more frequent analysis of data to enable real-time decision making
  • 9. Copyright (C) 2016 451 Research LLC ‘Big data’ is primarily driven by economics, not data 6 • ‘Big Data’ is the realization of competitive advantage based on the fact that it is now more economically feasible to store and process data that was previously ignored due to the cost and functional limitations of traditional data management technologies to handle its volume, velocity and variety
  • 10. Copyright (C) 2016 451 Research LLC ‘Big data’ is primarily driven by economics, not data 6 “Big data is what happened when the cost of keeping information became less than the cost of throwing it away.” George Dyson • ‘Big Data’ is the realization of competitive advantage based on the fact that it is now more economically feasible to store and process data that was previously ignored due to the cost and functional limitations of traditional data management technologies to handle its volume, velocity and variety
  • 11. Copyright (C) 2016 451 Research LLC ‘Big data’ is primarily driven by economics, not data 7 “Big data is what happened when the cost of keeping information became less than the cost of throwing it away.” George Dyson • ‘Big Data’ is the realization of competitive advantage based on the fact that it is now more economically feasible to store and process data that was previously ignored due to the cost and functional limitations of traditional data management technologies to handle its volume, velocity and variety • Moved from storing 1% of data for 60 days in EDW @ $100,000/TB • To 100% of data for a year in Hadoop @ $900/TB
  • 12. Copyright (C) 2016 451 Research LLC Source: 451 Research, Total Data Analytics 2016 The evolution of enterprise analytics 12 REPORTING - What happened ANALYSIS - Why did it happen? PRESCRIPTIVE - Influence what happens STATISTICAL MODELING MACHINE LEARNING DESCRIPTIVE - What is happening? PREDICTIVE - What will happen? Complexity AutomatedUser-drivenIT-driven VISUALIZATION
  • 13. Copyright (C) 2016 451 Research LLC Data sources: Multi-structured RDBMS, Hadoop, NoSQL, stream processing, historical and real-time Source: 451 Research, Total Data Analytics 2016 Data sources: Structured, RDBMS, historical The evolution of enterprise analytics 13 REPORTING - What happened ANALYSIS - Why did it happen? PRESCRIPTIVE - Influence what happens STATISTICAL MODELING MACHINE LEARNING DESCRIPTIVE - What is happening? PREDICTIVE - What will happen? Complexity AutomatedUser-drivenIT-driven VISUALIZATION
  • 14. Copyright (C) 2016 451 Research LLC EDW vs Hadoop (Schema-on-write vs schema-on-read) 14 Source: https://www.flickr.com/photos/wbaiv/16510090506/ Source: https://www.flickr.com/photos/notbrucelee/5696238930/
  • 15. Copyright (C) 2016 451 Research LLC Schema-on-write 15 Source: https://www.flickr.com/photos/wbaiv/16510090506/ • Pre-prepared • Single-purpose • Some assembly required • Inflexible
  • 16. Copyright (C) 2016 451 Research LLC Schema-on-read 16 Source: https://www.flickr.com/photos/notbrucelee/5696238930/ • Flexible • Reusable • Some imagination required* • Multi-purpose • *Instructions available if desired
  • 17. Copyright (C) 2016 451 Research LLC Hadoop-based data lakes • The concept of the data lake has taken off in recent years, with the Apache Hadoop data-processing framework serving as the unified repository into which raw data is landed from multiple sources and made available to multiple users for multiple purposes. 17 Photo: Myrabella / Wikimedia Commons, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=11263585
  • 18. Copyright (C) 2016 451 Research LLC Hadoop-based data lakes • The concept of the data lake has taken off in recent years, with the Apache Hadoop data-processing framework serving as the unified repository into which raw data is landed from multiple sources and made available to multiple users for multiple purposes. • Beware the data swamp 18 https://www.flickr.com/photos/lofink/4501610335/
  • 19. Copyright (C) 2016 451 Research LLC Data governance, data preparation and the data lake • Data needs to be filtered, processed, treated and managed to make it suitable for multiple analytics use cases. • Data governance • Data catalog • Data security • Data lineage • Data preparation • Data discovery • Data cleansing • Data harmonization 19 • Data inventory • Data quality • Data pipelines • Data enrichment • Data matching • Collaboration
  • 20. Copyright (C) 2016 451 Research LLC Data governance, data preparation and the data lake 20 DATA-AS-A-SERVICE PARTNERS SUPPLIERS SELF-SERVICE DATA PREPARATION IT DATA LAKE APPLICATIONS DATA GOVERNANCE Data lineage Data inventory Data catalog Data security Data quality Data pipelines DATA STEWARDS Data cleansing Data harmonization Data discovery Collaboration Data matching Data enrichment ADVANCED ANALYTICS DATA SCIENTISTS SELF-SERVICE ANALYTICS SENIOR EXECUTIVES BUSINESS ANALYSTS DATA ANALYSTS
  • 21. Copyright (C) 2016 451 Research LLC Hadoop and other animals 21
  • 22. Copyright (C) 2016 451 Research LLC Recommendations 22 • Enterprises should seriously consider the data governance and management requirements before embarking on data lake projects to ensure that the functionality is available to turn the concept into reality. • For flexibility and agility, employ data management approaches and technologies that abstract data processing pipelines from the execution environment. • Look for data integration and transformation technologies that execute natively, taking advantage of the underlying engine (e.g. Spark, YARN). • Seek out data management and integration technologies that enable consumption and transformation of large volumes of structured and unstructured data.
  • 23. Copyright (C) 2016 451 Research LLC Thank You! matthew.aslett@451research.com @maslett www.451research.com
  • 24. SnapLogic Elastic Integration Accelerate Your Integration. Accelerate Your Business “We can do more in two hours with SnapLogic than we could in two days with traditional solutions.”
  • 25. 25 CSV Big Data and hybrid cloud environments are making yesterday’s approaches to integration obsolete
  • 26. 26 Anything apps | data | APIs | things SnapLogic: Unified Platform for Data and Application Integration Anytime batch | streaming | real-time Anywhere on prem | cloud | hybrid
  • 27. 2727 SnapLogic in the Modern Data Fabric: Ingest, Transform, Deliver ConsumeStore&ProcessSource z z z z HANA Data Warehouses & Data Marts Big Data and Data Lakes INGEST INGEST Data Integration and Transformation On Prem Applications Relational Databases Cloud Applications NoSQL Databases Web Logs Internet of Things DELIVER DELIVER
  • 28. 28 Modern Architecture: Hybrid and Elastic Execution Streams: No data is stored/cached Secure: 100% standards-based Elastic: Scales out & handles data and app integration use cases Metadata Data Databases On Prem Apps Big Data Cloud Apps and DataCloud-Based Designer, Manager, Dashboard Execution Execution Execution Firewall SnapLogic “respects data’s gravity.”
  • 30. 30 Discussion Matt Aslett Research Director, Data Platforms and Analytics 451 Research Ravi Dharnikota Head of Enterprise Architecture SnapLogic
  • 31. 31 Integrate at the speed of modern business +1 888-494-1570 sales@snaplogic.com @SnapLogic www.snaplogic.com

Editor's Notes

  1. Cast your mind back to 2010/11 – everyone is trying to define ‘big data’ with words beginning with V. 451 Research took a different tack
  2. Cast your mind back to 2010/11 – everyone is trying to define ‘big data’ with words beginning with V. 451 Research took a different tack
  3. Cast your mind back to 2010/11 – everyone is trying to define ‘big data’ with words beginning with V. 451 Research took a different tack
  4. Connecting applications or data from multiple sources is not new – ESB, SOA, ETL have been around for a long time. But the old ways are not keeping up with today’s realities…
  5. Leading enterprises choose SnapLogic because we help them connect data and applications faster. We connect anything: sources including applications, APIs, things, or data We connect anytime: in batches, streaming, or in real time And we connect anywhere: on premises, in the cloud or a combination of both
  6. Here is an example of a SnapLogic deployment. The SnapLogic control plane – including he Designer, Manager and Dashboard - does not store your data. It’s metadata only. Once a pipeline is executed, it looks for the associated Snaplex or Hadooplex. The plex dynamically scales out, adding more nodes as needed. We like to say that SnapLogic “respects data gravity” and runs as close to the data as need be. If you are integrating only cloud applications, it would make no sense to run your integrations behind the firewall. Similarly, if you’re doing ground to ground or cloud to ground, you may want to run your Snaplex on Window or Linux servers. Note that the dotted line is sending instructions via metadata to the plex, which is waiting to run. The solid line indicates how data movies bi-directionally between systems.
  7. Leading enterprises choose SnapLogic because we help them connect data and applications faster. We connect anything: sources including applications, APIs, things, or data We connect anytime: in batches, streaming, or in real time And we connect anywhere: on premises, in the cloud or a combination of both