SlideShare a Scribd company logo
1 of 19
Accelerating BI on Big Data
Topics
• BI on Big Data Trade-Off
• SQL-on-Hadoop Performance Challenges
• Live Demo: Tableau on Hadoop
Impala / Redshift / Jethro
• Jethro Technology Overview
• What Does Jethro Do?
– Acceleration server for BI on Big Data
• How It Works?
– Full Indexing and cube caching
– Combines Columnar SQL DB design
with search-indexing technology
• When to Use It?
– Reporting, dashboards, discovery, ad-hoc
• How to Get It
– Download & free evaluation
• Partnerships
– BI & Hadoop vendors
About Us
SQL
Data
• Typical usage based on extracting selective
data from remote data sources
• Extracted data then dynamically loaded into
memory for interactive analysis
• Challenges:
– Size: performance degradation typically
~250M rows
– Refresh lag time
BI & Big Data: Extract (In-Memory)
Tableau & Big Data
Data
Extract
• For every user interaction Tableau issues
SQL queries to the target DB
• DB retrieves requested data, processes
SQL aggregations and returns to Tableau
• Challenges:
– DB performance is significantly slower than
in-mem speed
BI & Big Data: Live-Connect (In-DB)
Tableau & Big Data
Queries
Live Access
SQL enables the change of data platform while keeping the analytic apps intact
Analytics: ETL, Predictive, Reporting, BI
10x-100x
Data
1/10 HW
$cost
Open
Platform
Big Data Platforms: Hadoop vs. EDW Appliances
SQL-on-Hadoop Performance Challenges
SQL-on-Hadoop
ETL Predictive Reporting

BI
Too SLOW on Hadoopx
It’s unrealistic to expect to the same performance when data is much larger and highly optimized
hardware is replaced with commodity boxes
The Hadoop Trade-Off: Scale & Cost vs. Performance
SQL-on-Hadoop Performance Challenges
More Hardware
– Add nodes, RAM, CPU, SSD, network
Different SQL-on-Hadoop engines
– Hive, Impala, Drill, SparkSQL, HAWQ,
Presto, Actian, etc.
Rigid Data Model
– Less granularity, more pre-aggregations
– Pre-defined OLAP Cubes
– De-normalize into single large table
– Multiple partition keys (replication)
Replicate from Hadoop to EDW
– Traditional: Teradata, Vertica, Netezza,
…
– Cloud: Redshift
– As-a-Svc: BigQuery, Snowflake, Qubole
No Hadoop, No EDW
– Search: Elastic + Kibana
– NoSQL: Hbase, Cassandra, MongoDB
BI & Data Combined
– Full-stack Hadoop: Platfora, Arcadia
– As-a-Svc: DOMO, QuikSight, PowerBI,
…
BI on Big Data: Technology Alternatives
A Library Analogy:
Billions of books, Thousands of racks
Query:
List books by author “Stephen King”
Process:
Every librarian pulls out book by book
from their rack and check for Author
• Hive
• Impala
• Presto
• SparkSQL
• Drill
• Pivotal/HAWQ
• IBM/Big SQL
• Actian
• …
SQL-on-Hadoop: MPP/Full-Scan Architecture
SQL-on-Hadoop Performance Challenges
Unsuitable for BI
Query:
List books by author “Stephen King”
Process:
Access Author index, entry of “Stephen
King”, get list of books, fetch only these
books
Result:
Fast, minimal resources, scalable
SQL-on-Hadoop: Index-Access Architecture
SQL-on-Hadoop Performance Challenges
Optimal for BI
Hardware Data Format Hadoop
Cluster
Compute
Cluster
Total RAM,
CPU
AWS $
per hr.
Jethro Jethro indexes 3x m1.xlarge 2x r3.4xlarge
(spot)
290GB, 44
cores
$0.75
Impala Parquet 6x r3.2xlarge
1x r3.xlarge
390GB, 52
cores
$4.25
Redshift Redshift 6x dc1.large 90GB, 12
cores
$1.50
• Point browser to: tableau.jethrodata.com
– Login: demo / demo
• Choose workbook: Jethro, Impala, Redshift
• Dashboard interaction: choose year,
category or any other filters to drill-down
• Data
– Based on TPC-DS benchmark
– 1TB raw data (400GB fact)
– Fact table: ~2.9B rows
– 7 Dimensions
LIVE Benchmark: Tableau on Hadoop (and Redshift)
Live Benchmark
Indexing Data for Jethro Acceleration
• Identify BI-worthy datasets
– Not all data in Hadoop should have Jethro
• Jethro “loader” creates an indexed version
– Stores back in same HDFS
• If no Hadoop is used it can also be stored in local
filesystem, network storage or cloud storage (e.g. S3)
– Highly efficient: ~1B rows/hour, 3x compression
• Incremental refresh
– As frequently as every min, hour, day, …
– Does not require a full-rebuild of index
Raw Indexed
Data
Node
Data
Node
Data
Node
Data
Node
Data
Node
Jethro
Query
Node
Jethro
QN
1. Index Access 2. Read data only for required rows
Performance and resources based on the size of the working-set
Storage
- HDFS
- Cloud (S3, EFS)
- NAS/SAN
- Local FS
SELECT date, SUM(sales) FROM T1 WHERE product=‘Books’ AND state=‘NY’ GROUP BY date
Index-Access: How it Works
Jethro Indexes – Superior Technology
http://www.google.com/patents/WO2013001535A3?cl=enPatent Pending:
Complete
– Every column is indexed
Simple
– Inverted-list indexes map each
column value to a list of rows
Fast to read
– Index-of-index provides direct
access to a value entry
– No need to scan entire index,
or load index to memory
Scalable
– Distributed, highly hierarchical
compressed bitmaps
Fast to write
– Appendable index structure for
fast incremental refresh
Automated Cube & Query Cashing
• Every query is cached
– Based on result-set size vs. execution time
• Cubes generated automatically
– Identify repeat query patterns
– For example: adding the filter as a col to a
GROUP BY
• All stored in HDFS
– 10,000’s of cashed cubes and queries
• Incremental refresh
– Query executes ONLY on the incremental data
and then merges with cached results
What Is Jethro for Tableau?
An indexing & cashing server
1. Tableau uses live connect
(ODBC) to send SQL queries
2. Jethro checks if query can
be served from existing
cubes
– Yes: reply to Tableau
3. Jethro uses indexed table to
access only necessary data
– Auto create a cube based on
this and similar queries
Live
Connect
HDFS
BI Tools
Why Jethro is the Right Technology for BI on Big
Data?
Limitless BI on Big Data: Supporting the full-range of BI use-cases.
Jethro’s technology is a unique and optimal fit.
1. Full indexing enables interactive discovery and fast drill down
– Eliminates need to repeatedly read unnecessary data. The deeper you go the
faster it gets!
2. Auto cubes & cache enables interactive dashboards and fast reports
– Optimize repeat query performance
3. Incremental-refresh enables LIVE BI over streaming data
– Reduces maintenance and cuts lag time
Ready to Try Jethro?
1. Register: jethro.io
– Download and Install on-prem or cloud
2. Schedule a 30min POC review with Jethro SA (free!)
3. Index BI-worthy datasets
4. Use Tableau
5. Train Jethro with BI apps
– Continuous performance improvement
That’s It!
Accelerating BI on Big Data

More Related Content

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Recently uploaded (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Jethro for Tableau - Accelerating BI on Big Data

  • 2. Topics • BI on Big Data Trade-Off • SQL-on-Hadoop Performance Challenges • Live Demo: Tableau on Hadoop Impala / Redshift / Jethro • Jethro Technology Overview
  • 3. • What Does Jethro Do? – Acceleration server for BI on Big Data • How It Works? – Full Indexing and cube caching – Combines Columnar SQL DB design with search-indexing technology • When to Use It? – Reporting, dashboards, discovery, ad-hoc • How to Get It – Download & free evaluation • Partnerships – BI & Hadoop vendors About Us SQL Data
  • 4. • Typical usage based on extracting selective data from remote data sources • Extracted data then dynamically loaded into memory for interactive analysis • Challenges: – Size: performance degradation typically ~250M rows – Refresh lag time BI & Big Data: Extract (In-Memory) Tableau & Big Data Data Extract
  • 5. • For every user interaction Tableau issues SQL queries to the target DB • DB retrieves requested data, processes SQL aggregations and returns to Tableau • Challenges: – DB performance is significantly slower than in-mem speed BI & Big Data: Live-Connect (In-DB) Tableau & Big Data Queries Live Access
  • 6. SQL enables the change of data platform while keeping the analytic apps intact Analytics: ETL, Predictive, Reporting, BI 10x-100x Data 1/10 HW $cost Open Platform Big Data Platforms: Hadoop vs. EDW Appliances SQL-on-Hadoop Performance Challenges
  • 7. SQL-on-Hadoop ETL Predictive Reporting  BI Too SLOW on Hadoopx It’s unrealistic to expect to the same performance when data is much larger and highly optimized hardware is replaced with commodity boxes The Hadoop Trade-Off: Scale & Cost vs. Performance SQL-on-Hadoop Performance Challenges
  • 8. More Hardware – Add nodes, RAM, CPU, SSD, network Different SQL-on-Hadoop engines – Hive, Impala, Drill, SparkSQL, HAWQ, Presto, Actian, etc. Rigid Data Model – Less granularity, more pre-aggregations – Pre-defined OLAP Cubes – De-normalize into single large table – Multiple partition keys (replication) Replicate from Hadoop to EDW – Traditional: Teradata, Vertica, Netezza, … – Cloud: Redshift – As-a-Svc: BigQuery, Snowflake, Qubole No Hadoop, No EDW – Search: Elastic + Kibana – NoSQL: Hbase, Cassandra, MongoDB BI & Data Combined – Full-stack Hadoop: Platfora, Arcadia – As-a-Svc: DOMO, QuikSight, PowerBI, … BI on Big Data: Technology Alternatives
  • 9. A Library Analogy: Billions of books, Thousands of racks Query: List books by author “Stephen King” Process: Every librarian pulls out book by book from their rack and check for Author • Hive • Impala • Presto • SparkSQL • Drill • Pivotal/HAWQ • IBM/Big SQL • Actian • … SQL-on-Hadoop: MPP/Full-Scan Architecture SQL-on-Hadoop Performance Challenges Unsuitable for BI
  • 10. Query: List books by author “Stephen King” Process: Access Author index, entry of “Stephen King”, get list of books, fetch only these books Result: Fast, minimal resources, scalable SQL-on-Hadoop: Index-Access Architecture SQL-on-Hadoop Performance Challenges Optimal for BI
  • 11. Hardware Data Format Hadoop Cluster Compute Cluster Total RAM, CPU AWS $ per hr. Jethro Jethro indexes 3x m1.xlarge 2x r3.4xlarge (spot) 290GB, 44 cores $0.75 Impala Parquet 6x r3.2xlarge 1x r3.xlarge 390GB, 52 cores $4.25 Redshift Redshift 6x dc1.large 90GB, 12 cores $1.50 • Point browser to: tableau.jethrodata.com – Login: demo / demo • Choose workbook: Jethro, Impala, Redshift • Dashboard interaction: choose year, category or any other filters to drill-down • Data – Based on TPC-DS benchmark – 1TB raw data (400GB fact) – Fact table: ~2.9B rows – 7 Dimensions LIVE Benchmark: Tableau on Hadoop (and Redshift) Live Benchmark
  • 12. Indexing Data for Jethro Acceleration • Identify BI-worthy datasets – Not all data in Hadoop should have Jethro • Jethro “loader” creates an indexed version – Stores back in same HDFS • If no Hadoop is used it can also be stored in local filesystem, network storage or cloud storage (e.g. S3) – Highly efficient: ~1B rows/hour, 3x compression • Incremental refresh – As frequently as every min, hour, day, … – Does not require a full-rebuild of index Raw Indexed
  • 13. Data Node Data Node Data Node Data Node Data Node Jethro Query Node Jethro QN 1. Index Access 2. Read data only for required rows Performance and resources based on the size of the working-set Storage - HDFS - Cloud (S3, EFS) - NAS/SAN - Local FS SELECT date, SUM(sales) FROM T1 WHERE product=‘Books’ AND state=‘NY’ GROUP BY date Index-Access: How it Works
  • 14. Jethro Indexes – Superior Technology http://www.google.com/patents/WO2013001535A3?cl=enPatent Pending: Complete – Every column is indexed Simple – Inverted-list indexes map each column value to a list of rows Fast to read – Index-of-index provides direct access to a value entry – No need to scan entire index, or load index to memory Scalable – Distributed, highly hierarchical compressed bitmaps Fast to write – Appendable index structure for fast incremental refresh
  • 15. Automated Cube & Query Cashing • Every query is cached – Based on result-set size vs. execution time • Cubes generated automatically – Identify repeat query patterns – For example: adding the filter as a col to a GROUP BY • All stored in HDFS – 10,000’s of cashed cubes and queries • Incremental refresh – Query executes ONLY on the incremental data and then merges with cached results
  • 16. What Is Jethro for Tableau? An indexing & cashing server 1. Tableau uses live connect (ODBC) to send SQL queries 2. Jethro checks if query can be served from existing cubes – Yes: reply to Tableau 3. Jethro uses indexed table to access only necessary data – Auto create a cube based on this and similar queries Live Connect HDFS BI Tools
  • 17. Why Jethro is the Right Technology for BI on Big Data? Limitless BI on Big Data: Supporting the full-range of BI use-cases. Jethro’s technology is a unique and optimal fit. 1. Full indexing enables interactive discovery and fast drill down – Eliminates need to repeatedly read unnecessary data. The deeper you go the faster it gets! 2. Auto cubes & cache enables interactive dashboards and fast reports – Optimize repeat query performance 3. Incremental-refresh enables LIVE BI over streaming data – Reduces maintenance and cuts lag time
  • 18. Ready to Try Jethro? 1. Register: jethro.io – Download and Install on-prem or cloud 2. Schedule a 30min POC review with Jethro SA (free!) 3. Index BI-worthy datasets 4. Use Tableau 5. Train Jethro with BI apps – Continuous performance improvement That’s It!
  • 19. Accelerating BI on Big Data