SlideShare a Scribd company logo
1 of 25
Download to read offline
1 | Proprietary & Confidential
Alation
Data Governance on the
Delta Lake
Raja Perumal
November 2020
2 | Proprietary & Confidential
Agenda
• The Alation Data Catalog
• Alation Active Data Governance
- Our Approach
- Capabilities Overview
• Alation and Databricks
3 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Data Culture Is Quickly Becoming an Imperative
The Benefits are
Compelling
• Insights-driven
businesses on track to
earn $1.8T by 2021
• 7x more likely to
increase revenue as
result of big data
• 2.8x more likely to have
double-digit growth
• 1.6x more likely to
increase revenue
The Stakes are High
and Mistakes Costly
• Compliance with new
laws and regulations
• Building analyses and
models on bad data
• The best defense is
a good offense
Activities Critical to Data &
Analytics Team Success
*Insights-Driven Businesses Set the Pace for Global Growth, October 2018
4 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
The Pandemic Is Accelerating Change
In May, we did a full project review and it was
determined that the enterprise data catalog initiative
was of vital strategic importance to our business.”
– LEADING AUTOMOBILE MANUFACTURER
COVID-19 is Accelerating
Digital Transformation
Data & Analytics
Are Key Enablers
Data and analytics are the key accelerant of an
organization’s digitization and transformation efforts.”
–
The future got accelerated because of #covid19! Are you ready?
#PostPandemic
”
”
5 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
The Alation Vision
To empower a curious and rational world
Data Culture
Data
Literacy
Enable proper
interpretation & analysis
Data
Governance
Take responsibility
& authority
Data Search
& Discovery
Find &
understand
6 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Data Is Supposed to Be an Asset, Not a Liability
Too much data,
not too little
Capture tribal knowledge,
enable collaboration
Balance compliance
with access
Data Explosion
• 63 ZB in 2021 (volume)
• 300+ DBMSs (variety)
• IoT and Edge (velocity)
Evolving Laws
• HIPAA / PII / PHI
• Europe / GDPR
• Calif / CCPA
Changing Workforce
• Turnover, restructuring
• New SMEs, lost stewards
• Remote work (WFH)
7 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
What is a Data Catalog?
• A repository of metadata on information
sources across an organization
- Search & discovery
- Collaboration & analysis
- Curation & data governance
• Catalogs a broad range
of information assets
- Data sets, tables, articles, reports,
queries, visualizations, conversations
Answers these core questions:
How to find information? Can it be used? Should it be used? How should it be used?
8 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Five Things To Look For in a Data Catalog
Intelligence
Behavioral, linguistic and inferential intelligence drive accelerated time to insight.
Collaboration
Crowdsourcing, conversations and expert steward identification drive usage and trust.
Guided Navigation
Guide users to the right data to accelerate analyst onboarding.
Active Governance
Enforce policies at the point of consumption to balance business value and compliance.
Broad, Deep Connectivity
Maximize access to trusted data to speed discovery and insight.
9 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Not All Data Catalogs Are Created Equal
Tool Adjunct Governance Suite Catalog as Platform
Purpose
Find assets associated
with a given tool or
infrastructure
One-stop shopping for
loosely integrated
metadata suite
Find, understand, and
govern assets across
the enterprise
Origins The tool or
infrastructure
Governance and
compliance
Data search and
discovery
Orientation Tool / infrastructure Data / control People / access
Architecture Add-on Cobbled together Integrated platform
Data strategy Offense Defense Both
There are three distinct types of catalog on the market today
10 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
7 | Proprietary & Confidential
The Catalog is a Platform (For Data Management)
Solutions
APIs/SDKs
Search Data Domains SQL Editor Collaboration Glossaries Analytics
Active Metadata Catalog
Behavior Analysis Engine
Universal Connector Framework
Platform
HDFSRelational BI …Snowflake AWS Azure GCP
Data Search &
Discovery
Governance &
Stewardship
Data Privacy Cloud Data Migration Other…
Databricks
The Catalog is a Platform (for Data Management)
11 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Data Explosion
Data Growth (IDC) and
increasing regulations.
The sum of all data
created, captured, or
replicated at core, edge,
and endpoint locations -
will expand from 33 ZB in
2018 to 175 ZB by 2025.”
Data Breaches
The number of reported
data breaches increased
33% in 2019, resulting in
more than 7.9 billion
personal records exposed.”
Top Concern
Data governance has
surpassed cybersecurity
preparedness as the top
concern of Chief Audit
Executives.”
Data Governance is on Everyone’s Mind
”
” ”
12 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Traditional Approach to Governance is Failing
Challenges with traditional approach
• Focus on governing data rather than people’s behavior
• Prolonged, Expensive
• Defensive – Focused primarily on risk mitigation
• Lacks engagement with business stakeholders
• Not exercised at the point of data use
13 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Active Data Governance - a Different, Value-
Driven Approach
People-First Operationalized Collaborative Intelligent
Operationalize policy
into action at point of use;
measure via analytics
Guide behavior;
discover and formalize
relationship with data
Automated and guided
via AI and ML
Crowdsourcing and
community-driven
Adaptive
Adjust to changes in the business
environment with agility
14 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Active Data Governance Process
• Pragmatic, not theoretical
• Business, not IT-centric
• Incremental, not 'big-bang'
implementation
• Community-driven, not
committee-controlled
• Guided, not gated
participation
• Employee value, not steward
& IT centric value
• Data usage, not
documentation dead ends
15 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Data Analyst Data Scientist IT PersonaData Steward Business User
Alation empowers a diverse set of personas along the value chain
16 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Scale and Automate Governance with Alation
Data Policies
• Singular view of enterprise data policies and agile approvals
• Policies surfaced at the point of data consumption
Data Stewardship
Data Lineage
• Automatic lineage extraction from data sources and BI tools
• Cross-system, column-level lineage
• Impact analysis
Business Glossary
• Agile, intelligent stewardship with analytics and dashboards
to measure and address curation gaps
• Business-approved definitions
• Automatic suggestion/association of business terms,
auto-titling
Crowdsourcing & Collaboration
• Out-of-the box collaboration capabilities and gamification
Metadata Security
• Role-based security
• Integration with identity management systems
Data Inventorying, Insights
• Deep, broad connectivity including QLI; automated
metadata extraction
• Behavior Insights-top users, assets
Data Quality
• Data quality alerts, scores, trust flags, warnings cataloged
adjacent to data
• Quality information surfaced at the point of data consumption
PII Discovery & Classification
• Sensitive data discovery
• Automatic classification & tagging
17 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Alation & Databricks
18 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Challenges in Machine Learning on a Cloud Platform
Moving to a cloud
environment is a
complex project
requiring significant
time and resources
Data scientists waste
time and energy
finding data to use for
experiments
Data scientists have
difficulty finding
expertise about a data
set, and analysis
already done
19 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Establish a Cloud Environment
Popularity identifies essential data assets
to migrate, lineage identifies how data is
used, warnings communicate new location
Unified data service ingests data into data
lake, manages ETL and security, and
enables access for ML and BI
Lower the risk of moving to the cloud,
build a dynamic cloud environment and
accelerate use of data lakes and cloud
processing for faster ML
Joint Value
Proposition
Moving to a cloud
environment is a complex
project requiring significant
time and resources
Solution
Establish, maintain,
and benefit from a cloud
data environment
20 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Select the Best Data for Machine Learning
Discover data sources across the
enterprise, understand usage patterns
and quickly select data for analysis
Simplifies the data pipeline across
batch and streaming data to create a
fast and reliable cloud platform
Data scientists can efficiently discover,
understand, transform and analyze data
at enterprise scale on the full dataset
Joint Value
Proposition
Data scientists waste time
and energy finding data to
use for experiments
Solution
Find and use the best data
for fast and accurate ML
21 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Collaborate to Drive Data Culture
Context and conversations enable data
scientists to collaborate, and the
catalog identifies data users and experts
Track experiments and models, publish
dashboards, and facilitate hand-offs
among 1000s of users in real-time
Collaboration results in better predictive
models and business insights are shared
with more users across the enterprise
Joint Value
Proposition
Data scientists have
difficulty finding expertise
about a data set, and
analysis already done
Solution
Collaborate to share
knowledge and increase
the value of data
22 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
DATABRICKS CONFIDENTIAL
Alation & SQL Analytics Service
DELTA LakeODBC/
JDBC
Drivers
BI & SQL
Client
Connectors
Routing
Service
Query
Planning
Query
Execution
Databricks
SQL Analytics
23 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
SQL Analytics Service Endpoints
SQL Optimized Compute
SQL Endpoints give a quick way to setup SQL /
BI optimized compute. You pick a tshirt size.
Databricks will ensure configuration that
provides the highest price/performance.
Concurrency Scaling Built-in
Virtual clusters can load balance queries across
multiple clusters behind the scenes, providing
unlimited concurrency.
24 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™
Alation and Databricks
Natural language
search across entire
enterprise, and find
data in data lakes
Find data in all
enterprise sources,
including Databricks
and Delta Lake
Identify data experts
and users for
collaboration and
connect to build data
science models in
Databricks
People who know
about the data can
add context for
more users to
understand data
Conversations to
collaborate with data
experts and users
Migrate to a cloud
environment with
confidence by
identifying the most
popular data to
prioritize
25 | Proprietary & Confidential
Who to Contact to Learn More
Raja Perumal
Business Development (ISV)
Raja.Perumal@alation.com

More Related Content

What's hot

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business EnablerSrinivasan Sankar
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architectureSudheer Kondla
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureDatabricks
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best PracticesDATAVERSITY
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricCambridge Semantics
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
DAS Slides: Data Architect vs. Data Engineer vs. Data ModelerDAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
DAS Slides: Data Architect vs. Data Engineer vs. Data ModelerDATAVERSITY
 
Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Rajesh Kumar
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsKhalid Salama
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 

What's hot (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Data Catalog as a Business Enabler
Data Catalog as a Business EnablerData Catalog as a Business Enabler
Data Catalog as a Business Enabler
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data platform architecture
Data platform architectureData platform architecture
Data platform architecture
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
DAS Slides: Data Architect vs. Data Engineer vs. Data ModelerDAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
 
Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 

Similar to Active Governance Across the Delta Lake with Alation

¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DATAVERSITY
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewDenodo
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonDATAVERSITY
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AIDATAVERSITY
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationDenodo
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderDataconomy Media
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDATAVERSITY
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...DataScienceConferenc1
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data LakeCaserta
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
Unified Information Governance, Powered by Knowledge Graph
Unified Information Governance, Powered by Knowledge GraphUnified Information Governance, Powered by Knowledge Graph
Unified Information Governance, Powered by Knowledge GraphVaticle
 

Similar to Active Governance Across the Delta Lake with Alation (20)

¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and Comparison
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital TransformationReinvent Your Data Management Strategy for Successful Digital Transformation
Reinvent Your Data Management Strategy for Successful Digital Transformation
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
 
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the SameDAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
DAS Slides: Cloud-Based Data Warehousing – What’s New and What Stays the Same
 
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
[DSC Europe 23] Milos Solujic - Data Lakehouse Revolutionizing Data Managemen...
 
Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Unified Information Governance, Powered by Knowledge Graph
Unified Information Governance, Powered by Knowledge GraphUnified Information Governance, Powered by Knowledge Graph
Unified Information Governance, Powered by Knowledge Graph
 

More from Databricks

Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Databricks
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Databricks
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringDatabricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsDatabricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkDatabricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesDatabricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkDatabricks
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeDatabricks
 
Machine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack DetectionMachine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack DetectionDatabricks
 

More from Databricks (20)

Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 
Machine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack DetectionMachine Learning CI/CD for Email Attack Detection
Machine Learning CI/CD for Email Attack Detection
 

Recently uploaded

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 

Recently uploaded (20)

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 

Active Governance Across the Delta Lake with Alation

  • 1. 1 | Proprietary & Confidential Alation Data Governance on the Delta Lake Raja Perumal November 2020
  • 2. 2 | Proprietary & Confidential Agenda • The Alation Data Catalog • Alation Active Data Governance - Our Approach - Capabilities Overview • Alation and Databricks
  • 3. 3 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Data Culture Is Quickly Becoming an Imperative The Benefits are Compelling • Insights-driven businesses on track to earn $1.8T by 2021 • 7x more likely to increase revenue as result of big data • 2.8x more likely to have double-digit growth • 1.6x more likely to increase revenue The Stakes are High and Mistakes Costly • Compliance with new laws and regulations • Building analyses and models on bad data • The best defense is a good offense Activities Critical to Data & Analytics Team Success *Insights-Driven Businesses Set the Pace for Global Growth, October 2018
  • 4. 4 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ The Pandemic Is Accelerating Change In May, we did a full project review and it was determined that the enterprise data catalog initiative was of vital strategic importance to our business.” – LEADING AUTOMOBILE MANUFACTURER COVID-19 is Accelerating Digital Transformation Data & Analytics Are Key Enablers Data and analytics are the key accelerant of an organization’s digitization and transformation efforts.” – The future got accelerated because of #covid19! Are you ready? #PostPandemic ” ”
  • 5. 5 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ The Alation Vision To empower a curious and rational world Data Culture Data Literacy Enable proper interpretation & analysis Data Governance Take responsibility & authority Data Search & Discovery Find & understand
  • 6. 6 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Data Is Supposed to Be an Asset, Not a Liability Too much data, not too little Capture tribal knowledge, enable collaboration Balance compliance with access Data Explosion • 63 ZB in 2021 (volume) • 300+ DBMSs (variety) • IoT and Edge (velocity) Evolving Laws • HIPAA / PII / PHI • Europe / GDPR • Calif / CCPA Changing Workforce • Turnover, restructuring • New SMEs, lost stewards • Remote work (WFH)
  • 7. 7 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ What is a Data Catalog? • A repository of metadata on information sources across an organization - Search & discovery - Collaboration & analysis - Curation & data governance • Catalogs a broad range of information assets - Data sets, tables, articles, reports, queries, visualizations, conversations Answers these core questions: How to find information? Can it be used? Should it be used? How should it be used?
  • 8. 8 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Five Things To Look For in a Data Catalog Intelligence Behavioral, linguistic and inferential intelligence drive accelerated time to insight. Collaboration Crowdsourcing, conversations and expert steward identification drive usage and trust. Guided Navigation Guide users to the right data to accelerate analyst onboarding. Active Governance Enforce policies at the point of consumption to balance business value and compliance. Broad, Deep Connectivity Maximize access to trusted data to speed discovery and insight.
  • 9. 9 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Not All Data Catalogs Are Created Equal Tool Adjunct Governance Suite Catalog as Platform Purpose Find assets associated with a given tool or infrastructure One-stop shopping for loosely integrated metadata suite Find, understand, and govern assets across the enterprise Origins The tool or infrastructure Governance and compliance Data search and discovery Orientation Tool / infrastructure Data / control People / access Architecture Add-on Cobbled together Integrated platform Data strategy Offense Defense Both There are three distinct types of catalog on the market today
  • 10. 10 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ 7 | Proprietary & Confidential The Catalog is a Platform (For Data Management) Solutions APIs/SDKs Search Data Domains SQL Editor Collaboration Glossaries Analytics Active Metadata Catalog Behavior Analysis Engine Universal Connector Framework Platform HDFSRelational BI …Snowflake AWS Azure GCP Data Search & Discovery Governance & Stewardship Data Privacy Cloud Data Migration Other… Databricks The Catalog is a Platform (for Data Management)
  • 11. 11 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Data Explosion Data Growth (IDC) and increasing regulations. The sum of all data created, captured, or replicated at core, edge, and endpoint locations - will expand from 33 ZB in 2018 to 175 ZB by 2025.” Data Breaches The number of reported data breaches increased 33% in 2019, resulting in more than 7.9 billion personal records exposed.” Top Concern Data governance has surpassed cybersecurity preparedness as the top concern of Chief Audit Executives.” Data Governance is on Everyone’s Mind ” ” ”
  • 12. 12 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Traditional Approach to Governance is Failing Challenges with traditional approach • Focus on governing data rather than people’s behavior • Prolonged, Expensive • Defensive – Focused primarily on risk mitigation • Lacks engagement with business stakeholders • Not exercised at the point of data use
  • 13. 13 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Active Data Governance - a Different, Value- Driven Approach People-First Operationalized Collaborative Intelligent Operationalize policy into action at point of use; measure via analytics Guide behavior; discover and formalize relationship with data Automated and guided via AI and ML Crowdsourcing and community-driven Adaptive Adjust to changes in the business environment with agility
  • 14. 14 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Active Data Governance Process • Pragmatic, not theoretical • Business, not IT-centric • Incremental, not 'big-bang' implementation • Community-driven, not committee-controlled • Guided, not gated participation • Employee value, not steward & IT centric value • Data usage, not documentation dead ends
  • 15. 15 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Data Analyst Data Scientist IT PersonaData Steward Business User Alation empowers a diverse set of personas along the value chain
  • 16. 16 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Scale and Automate Governance with Alation Data Policies • Singular view of enterprise data policies and agile approvals • Policies surfaced at the point of data consumption Data Stewardship Data Lineage • Automatic lineage extraction from data sources and BI tools • Cross-system, column-level lineage • Impact analysis Business Glossary • Agile, intelligent stewardship with analytics and dashboards to measure and address curation gaps • Business-approved definitions • Automatic suggestion/association of business terms, auto-titling Crowdsourcing & Collaboration • Out-of-the box collaboration capabilities and gamification Metadata Security • Role-based security • Integration with identity management systems Data Inventorying, Insights • Deep, broad connectivity including QLI; automated metadata extraction • Behavior Insights-top users, assets Data Quality • Data quality alerts, scores, trust flags, warnings cataloged adjacent to data • Quality information surfaced at the point of data consumption PII Discovery & Classification • Sensitive data discovery • Automatic classification & tagging
  • 17. 17 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Alation & Databricks
  • 18. 18 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Challenges in Machine Learning on a Cloud Platform Moving to a cloud environment is a complex project requiring significant time and resources Data scientists waste time and energy finding data to use for experiments Data scientists have difficulty finding expertise about a data set, and analysis already done
  • 19. 19 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Establish a Cloud Environment Popularity identifies essential data assets to migrate, lineage identifies how data is used, warnings communicate new location Unified data service ingests data into data lake, manages ETL and security, and enables access for ML and BI Lower the risk of moving to the cloud, build a dynamic cloud environment and accelerate use of data lakes and cloud processing for faster ML Joint Value Proposition Moving to a cloud environment is a complex project requiring significant time and resources Solution Establish, maintain, and benefit from a cloud data environment
  • 20. 20 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Select the Best Data for Machine Learning Discover data sources across the enterprise, understand usage patterns and quickly select data for analysis Simplifies the data pipeline across batch and streaming data to create a fast and reliable cloud platform Data scientists can efficiently discover, understand, transform and analyze data at enterprise scale on the full dataset Joint Value Proposition Data scientists waste time and energy finding data to use for experiments Solution Find and use the best data for fast and accurate ML
  • 21. 21 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Collaborate to Drive Data Culture Context and conversations enable data scientists to collaborate, and the catalog identifies data users and experts Track experiments and models, publish dashboards, and facilitate hand-offs among 1000s of users in real-time Collaboration results in better predictive models and business insights are shared with more users across the enterprise Joint Value Proposition Data scientists have difficulty finding expertise about a data set, and analysis already done Solution Collaborate to share knowledge and increase the value of data
  • 22. 22 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ DATABRICKS CONFIDENTIAL Alation & SQL Analytics Service DELTA LakeODBC/ JDBC Drivers BI & SQL Client Connectors Routing Service Query Planning Query Execution Databricks SQL Analytics
  • 23. 23 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ SQL Analytics Service Endpoints SQL Optimized Compute SQL Endpoints give a quick way to setup SQL / BI optimized compute. You pick a tshirt size. Databricks will ensure configuration that provides the highest price/performance. Concurrency Scaling Built-in Virtual clusters can load balance queries across multiple clusters behind the scenes, providing unlimited concurrency.
  • 24. 24 | © 2020 Alation, Inc. – All Rights Reserved.The Catalog is the Platform™ Alation and Databricks Natural language search across entire enterprise, and find data in data lakes Find data in all enterprise sources, including Databricks and Delta Lake Identify data experts and users for collaboration and connect to build data science models in Databricks People who know about the data can add context for more users to understand data Conversations to collaborate with data experts and users Migrate to a cloud environment with confidence by identifying the most popular data to prioritize
  • 25. 25 | Proprietary & Confidential Who to Contact to Learn More Raja Perumal Business Development (ISV) Raja.Perumal@alation.com