Do data quality issues, demanding business needs and increasingly stringent regulations sound familiar? You may have moved your data to a data lake on Amazon S3 or a data warehouse on Amazon Redshift, but how do you deliver the ‘single source of trust’ needed to make decisions with scale and speed? We will share best practices that have helped our customers overcome these challenges. Betfair Pty Ltd, the world’s largest online betting exchange, will also share their experience using Talend and Redshift in enabling a analytics data warehouse with strong data quality and light governance practices.
6. 66
TALEND-AWS RELATIONSHIP
Fast Facts
• Advanced Technology Partner
• APN Data & Analytics Competency
• Global coverage
• 1,500+ AWS customers
SaaS Solutions on AWS
• Talend Cloud
• Stitch Data Loader
• API Services
Talend on AWS
• 70+ web service components
AWS Marketplace
• Talend Cloud Remote Engines
7. 77
70+ TALEND COMPONENTS FOR AWS
STORAGE | DATABASE | DATA WAREHOUSE | REAL-TIME | BIG DATA
S3 Aurora DynamoDB
RDS
Redshift Kinesis EMR
8. 88
TALEND ON AWS – KEY CAPABILITIES
POWER THE BUSINESS WITH A FULLY GOVERNED DATA LAKE
AND MODERN DATA WAREHOUSE
Scalable Data
Processing
Generate native Spark on
Amazon EMR
Control Costs
With changes in workload
on AWS
Data Cleansing &
Sharing
Across AWS and hybrid
environments
Across Environment
Ingest in minutes to
AWS
9. 9
TALEND ON AWS – USE-CASES
GOVERNANCE
Data Quality | Lineage | Stewardship | Catalog
Data Warehouse
Modernization
DATA INTEGRATION
Ingest | Transform | Cleanse
Cloud Data
Processing
Hive | Spark | Machine Learning
Real-time Analytics
Governed
Data Lake
DATA INTEGRATION
Ingest | Transform | Cleanse
11. 1111
SHORTER TIME TO INSIGHT
INTEGRATION
SOURCES & STORAGE
Cloud
On-Premises
Apps Collect Govern Transform Share
Developers
TARGETS
Analysts /
Data Scientists
Business
Users
RAW INSIGHT-READY
12. 1212
DATA TO INSIGHTS IN MINUTES
With Talend’s Stitch Data Loader and Amazon Redshift
90+ SaaS
SOURCES VISUALIZE & ANALYZE
SELF-SERVICE
DATA INGESTION
FOR THE
BUSINESS USER
LOB Analyst
Stitch
* Not all AWS services shown are necessary or required. Configuration depends on individual customer’s setup. Example workflow only.
Redshift
13. 1313
GOVERNED AND TRUSTED DATA LAKE
Delivering a single source of truth with Talend Cloud on AWS
On-Premises apps, Big
Data and databases
SaaS Apps
Reporting (Looker,
Tableau, Qlik,
Amazon QuickSight)
Integrate
Transform
Ingest
Integrate
Transform
DQ/Cleanse
Integrate
Transform
DQ/Cleanse
Catalog
Lineage
Amazon S3 Redshift
EMR
Aurora
Athena
* Not all AWS services shown are necessary or required. Configuration depends on individual customer’s setup. Example workflow only.
14. 1414
ELASTIC BIG DATA PROVISIONING & CLUSTER RESIZING
Save on storage and compute costs as AWS workloads change
Spin up, spin
down
Cluster Auto-size
Resize clusters dynamically according to workload needs
within integration workflow
Spin up/down Amazon
Redshift and Amazon EMR
Create Spark
transformations
Push results through
Amazon EMR or into
Amazon Redshift for
aggregation or analysis
Redshift EMR
* Not all AWS services shown are necessary or required. Configuration depends on individual customer’s setup. Example workflow only.
16. 1616
ABOUT BETFAIR PTY LTD
Founded in 2005 & established as a joint venture in Australia
Fully owned by local business in 2014 providing the PaddyPower Betfair
Exchange
The world’s largest online betting exchange
Constant state of transformation for the past 5 years
Customer of Talend for the last three years
17. 1717
PAINS & CHALLENGES
Data Driven Business
Faster Business Decision!
BUSINESS
OBJECTIVES
System Separation
From UK DWH to AUS DWH
Migrate Data into Cloud
CONTEXT
Explored Handcode & DVD
Import
SOLUTIONING
20. 2020
OUTCOMES & BENEFITS
WHENEVER, WHATEVER
Autonomy for business users &
data analytics team
SPEEDY ANALYTICS
Control dataset type to be
pulled in from master
CENTRALIZED OPERATIONS
Ability to review jobs & audit
trails of sensitive data
21. 2121
THE ROAD AHEAD
MOVING
AHEAD
Analytics DWH to full fledge
DWH (on Talend Cloud)
DWH separation – full ops
@Betfair Pty Ltd
GOAL
Full DWH ops
Full Data Governance model
with lineage tracking for
auditability (data @ ANZ)
Fast analytics & insights
22. 2222
WHY TALEND FOR AWS?
We provide governance to deliver trust and speed
On-Premises apps, Big Data and
databases
ANALYTICS, ML &
VISUALIZATION
INGEST TRANSFORM SHARECLEAN
STITCH | TALEND CLOUD
STEWARDSHIP | LINEAGE | SECURE ACCESS INTEGRATION |
DATA PROCESSING
GOVERNANCE
SaaS Apps
SOURCES
CLOUD FIRST l ADDRESS COMPREHENSIVE NEEDS
23. 2323
TALEND DELIVERS BOTH
SPEED AND TRUST
Get Started
Today with
Free Trial
https://cloud.talend.com
Learn More @
https://www.talend.com/solutions
/information-technology/aws-
cloud-integration/
Contact Us
https://www.talend.com/contact/