IAC 2024 - IA Fast Track to Search Focused AI Solutions
End To End Business Intelligence On Google Cloud
1. Tu Pham - CTO @ Eway
Google Cloud Next
-- Surabaya, Indonesia - 06/2019 --
End to End Business Intelligence
on Google Cloud
2. About Me - CTO at Eway JSC
- Google Developer Expert on Cloud
Platform
- 8 years experience on Big data and
Cloud Computing
- Open source contributor, blogger,
father
2
12. In Affiliate Marketing, We Are Partners With
- Indonesia
- Go-Jek
- Bukalapak
- Traveloka
- The World
- Lazada
- Shopee
- Aliexpress
- Adcombo
- Leadbit
12
31. Step 2: GC Compute Engine Instances
Convert Raw Data To Apache Parquet Files
- Technology: Compute Engine, Parquet file format
- Why Parquet:
- Self-describing, columnar storage format
- Language-independent
- High query-performance
- Spark SQL is much faster with Parquet
- High compression (up to 70%)- less disk IO
31
32. Step 2: GC Compute Engine Instances
Convert Raw Data To Apache Parquet Files
32
33. Step 2: GC Compute Engine Instances
Convert Raw Data To Apache Parquet Files
33
34. - Technology: Compute Engine, Parquet file format, Cloud Storage
- Why Cloud Storage:
- Four storage classes
- Easy to integrate
- Object Lifecycle Management
- Fast Networking
Step 3: GC Compute Engine Upload Parquet
File To GC Cloud Storage
34
57. Be 1% better everyday
tips
Create your system
principles
Design system
architecture, data flow,
data model, data
structure first
Separate realtime and
batch flows
Separate data storage
strategies between data
types
Save the cost by
network cost, instances
cost, storage cost by
metric monitoring &
alert system 57
58. Thank You - Q&A
● Eway: https://eway.vn
● My Contact: tupp@eway.vn
58