SlideShare a Scribd company logo
1 of 40
© 2014 DataStreams Corp. All Rights Reserved. 
DataStreams Corp. 
"Always find the better value of your data" 
www.datastreams.co.kr
© 2014 DataStreams Corp. All Rights Reserved. 
1.Company Profile 
2.Data Integration as Data Infrastructure 
3.TeraStream™ Overview 
Performance & Cost Savings of TeraStream™ 
Features & Benefits 
Customers & Application 
Contents
© 2014 DataStreams Corp. All Rights Reserved. 
1.Company Profile
© 2014 DataStreams Corp. All Rights Reserved. 
Name 
Data Streams Corp. 
CEO 
Mr. Young-sang Lee 
Business 
Area 
Data Integration Solutions Development and Sales 
Data Quality Solutions Development and Sales 
Data Warehouse / BI / FDS / Forensic / Audit Consulting and Construction 
Big Data Analytic Consulting and Platform Construction 
Data Governance Platform Development/Consulting/Sales 
Data Migration Consulting and Construction 
Large Volume Data Batch Processing Improvement Consulting and Construction 
Data Standardization and Quality Management Consulting and System Construction 
Data Architecture Consulting 
Office 
Address 
HQ Chungho-nais B/D 6F, 28 Saimdang-ro, Seocho-gu, Seoul, Korea 
R&D U-Spacemall #2 B-601, 670 Daewangpangyo-ro, Bundang-gu, Seongnam, Korea 
China Office Room 1216, 12th Floor, Intersection of Hopson Kirin Society Building 2, 
Wang Jing Fu Tong West Street, Wangjing, Chaoyang District, Bejing 
Contact 
Tel) +82-2-3473-9077 / Fax) +82-2-3473-9084 
Investor 
JAFCO ASIA 
Capital 
USD 2M 
Sales Amount 
USD 19M (2013) 
Established 
Sep 19, 2001 
Employees 
121 
Company Profile 
3
© 2014 DataStreams Corp. All Rights Reserved. 
2012 
•Established R&D Center in Pangyo Techno Valley 
•Released Social Cube for SNS Data Analytics 
•Participated in Original Technology Development Project for Next Generation Memory Based Big Data Analytics and Management 
2009 
•TeraStream™, Selected as Standard Data Integration Tool by The Korea Federation of Banks 
•Selected as Contractor for Building Resource Management Data Standardization and Meta-data Management System by Ministry of National Defense 
•Released DeltaStream™, QualityStream™, and ImpactStream™ 
2007 
•Awarded for Excellent Venture Company by Deputy Prime Minister 
•MetaStream™, Awarded for Digital Business Innovation by SMBA 
•Released TeraStream™ Version 2.0 
•JAFCO, Japan invested 4 million USD 
2005 
•Mr. Young-sang Lee, CEO, Was Awarded a Grand Prize for Korea Digital Competitiveness 
•TeraStream™ Won New Technology Certification from Ministry of Knowledge Economy 
•Released MetaStream™ Version 1.0 
•Acquired KDB Solution Co., Ltd., Korea’s First Meta-data Management Solution Company 
•TeraStream™ Version 1.4, Acquired GS(Good Software) Certification 
2003 
•KEB selected TeraStream™ as Standard Batch/ETL Solution for Next Generation Banking System 
•First Worldwide Sales Contract of FACT™ 
•Presented FACT™ to Oracle Open World 2003 in San Francisco, USA 
•Released MetaStream™ Version 2.7 
•CEO, Mr. Young-sang Lee, Was Elected as a Chairman of KOSEA(Korea Software Enterprise Association) 
•Released TeraStream™ Version 3.2 
2010 
•TeraStream™ Version 2.2, Acquired GS Certification 
•Changed Company Name to DataStreams Corp. 
•Contracted with Intellectual Property Office for Enterprise Data Quality Management 
2008 
•Awarded for Top Private Company for Population and Housing Census by Deputy Prime Minister 
•TeraStream™, Selected as Standard ETL Tool by Ministry of Government Administration and Home Affairs 
2006 
•Selected as Technically Innovated Company of 2004 by SMBA 
•Selected as Technically Innovated Company of 2004 by Small & Medium Business Administration(SMBA) 
•TeraStream™, Selected for a Next Generation Banking Data Migration Tool by Shinhan Bank 
2004 
•Registered TeraStream™ as a Trademark 
•Released TeraStream™ Designer Version 1.1 
•First TeraStream™ V.1.1 Contract with National Statistics Office 
2002 
2011 
•Awarded Prime Minister Citation for SW Achievement 
•Selected as an ATC(Advanced Technology Center) by Ministry of Knowledge Economy 
•Established China Office in Beijing 
2013 
•TeraStream™ for Hadoop, Selected as Base Solution for Building Government-Wide Big Data Infrastructure 
•Acquired Patent for Readable Data Encryption and Decryption 
•CEO, Mr. Young-sang Lee, Was Awarded Digital Management Innovation Prize 
•Joined Int’l SOFT China in Beijing 
• MetaStream™ Version 3.0, Acquired GS Certification 
• Launched DQ Appliance 
2001 
•Released TeraStream™ Beta Version 
•Innovative Data Solutions Corp., Was Established 
Company History 
4 
2014 
•TeraStream™, Was Awarded 2014 Korea Software Award by Ministrer of Science, ICT and Future 
•DataStreams Is Listed on KONEX(Korea New Exchange)
© 2014 DataStreams Corp. All Rights Reserved. 
Organizational Structure 
Present Condition of Engineering Employees 
96 
Consultants 
Data Governance 
36 
Data Integration & Migration 
Big Data Management 
DW & BI 
SNS Analytics 
Engineers 
Meta Data & Data Quality Management Solution 
60 
Data Integration & Migration Solution 
Big Data Management Platform 
Number 
Of 
Employees 
Total 
Consultants/Developers 
Management/Sales 
121 
96 
25 
Engineering Lev. 
Total 
Consultants 
Engineers 
Total 
96 
36 
60 
Professional 
27 
18 
9 
Qualified 
26 
11 
15 
Intermediate 
19 
3 
16 
Beginning 
24 
4 
20 
•Government Offices 
•Banking 
•Manufacturing Business 
•Logistics/Services 
•Planning Products 
•Presales & Consulting 
•DW/BI 
•Big Data 
•SNS 
•QA(Quality Assurance) 
•Marketing 
•Overseas Sales 
•Overseas Corp. 
•HR/General Affairs 
•Financial Admin. 
•Knowledge Mgmt. 
•Sales Support 
•PI 
•DI Technical Support 
•DQ Technical Support 
•DI 
•RTI 
•DQ 
•UI 
CEO 
Auditor 
Counselor 
Sales Div. 
PPC Div. 
Business Consulting Div. 
Global Business Div. 
Management Support Div. 
Technical Service Div. 
R&D Center 
5
© 2014 DataStreams Corp. All Rights Reserved. 
Business Area 
Data Governance 
• Data Governance Architecture 
• Data Quality Management 
• Meta Data Management 
• Master Data Management 
• Data Quality Appliance 
Data Integration 
• High Performance ETL ∙ Batch 
• Data Integration 
• Deferred(Near Real Time) 
• CDC, Real Time Data Transition 
• High Speed Data Extraction 
• High Speed Data Sort 
• Data Integration with Hadoop 
• Data Integration with Grid 
• Test Data Management 
Big Data 
• Big Data Platform with Hadoop 
• Big Data Anaysis & Visualization 
• Structured & Unstructured Data Analysis 
• SNS Data Analysis ∙ Monitoring 
Consulting 
• ISP & Big Data Consulting 
• Fraud Detection System(FDS) Consulting 
• DW ∙ CRM ∙ BI Consulting 
• Data Integration & Migration Consulting 
• Data Standardization ∙ Quality Management ∙ Architecture Consulting 
• Master Data Management Consulting 
• Data Lineage Management Consulting 
DW/BI 
• Building DW ∙ CRM ∙ BI 
• QPI Methodology 
• Fraud Detection System(FDS) 
• Information System Planning(ISP) 
• Alternative Trading System Consulting 
• Transaction Cost Data Analysis Framework 
• Transaction Cost Data Analysis Framework & Consulting(TCA) 
• Financial Analysis Services 
DataStreams Is a Company Which Has Expertise in Data Processing and Analysis to Provide Total Data Management Services in Data Integration and Quality Management. 
Data Lineage Management 
• Data Lineage Analysis Platform 
• Visualization for Data Lineage 
• Relative Tool, Program & Script Language Analysis 
• Table Column Search & Monitoring 
6
© 2014 DataStreams Corp. All Rights Reserved. 
Market Recognition & Share 
7 
60% 
25% 
10% 
5% 
80% 
15% 
5% 
Korean Market Share 
for ETL Solution 
Korean Market Share 
for Data Migration 
(Banking Industry) 
DataStreams Corp. 
IBM 
Informatica 
Others 
* The market share for ETL solutions is self-researched in 2013. 
55% 
30% 
15% 
Korean Market Share 
for Metadata Management 
W Company (Korean) 
G Company (Korean) 
ETL 
Data 
Migration 
Metadata 
Management 
No. 1 Total Data Management Technologies in Korea 
Vendor Report of Magic Quadrant for Data Integration and Data Quality Tools 
(2013) 
Reference URL : http://www.citia.co.uk/ 
Mentioned DataStreams’ capabilities of offering wide range of data integration 
products through ETL, CDC and near-real time technologies.
© 2014 DataStreams Corp. All Rights Reserved. 
Private 
Banking / 
Finance Companies 
Public Finance 
Companies 
Government & 
Public / 
Educational 
Institutions 
Enterprises 
Major Domestic Customers 
8
© 2014 DataStreams Corp. All Rights Reserved. 
DataStreams Is Exporting & Expanding… 
Columbia 
Banco Colpatria 
Bogota City Government 
Credibanco 
China 
Kookmin Bank 
Hana Bank 
USA 
Merklenet, Inc. 
CSC Consulting 
Bisys 
Comcast 
Merkle Data Tech 
USA 
Airweb 
Sungard 
American Airlines 
Highmark, Inc. 
Mexico 
Sodexhopass 
Procesar 
Peru 
Banco Ripley 
Chile 
Banco Estado de Chile 
Australia 
National Wealth Management 
(MLC/NAB) 
Spain 
Procecard 
Tecnocom 
Telefonica Soluciones 
ITnow! 
Germany 
Accenture GmbH 
India 
Reliance Industry 
Indonesia 
Excelcom 
Aviva 
Telkomsel 
Hana Bank 
Global Customers 
9 
Japan 
with Reliable Business Partners 
U.S.A. 
BellaDati(US) 
Vietnam 
HIPT 
FPT IS 
Lac Viet 
QTSC 
EU 
BellaDati(CZE) 
Gibkie(RUS) 
IMBI(Europe, N. Africa) 
China 
China Mobile 
Fuchen Telecom 
& Banking / Insurance Companies
© 2014 DataStreams Corp. All Rights Reserved. 
Business Challenges 
Data Integration Architecture 
2. Data Integration as Data Infrastructure
© 2014 DataStreams Corp. All Rights Reserved. 
Data Integration Landscape: Business Challenges 
 Inaccurate data leads to bad or no decisions 
 More than 30% of IT budgets typically spent on Data integration 
 Inconsistent enterprise and application architecture for integration 
Disparate data 
Inaccurate data 
Incomplete data 
Untimely data 
Fragmented Integration Approach 
Factors 
 Bad decisions 
 Lost revenue 
 Lost productivity 
 Lost market opportunity 
 Bad Citizen relationships 
Results 
Multiple versions of the “Truth” 
Wasted time and resources aggregating information 
Difficult to use Data 
Delayed Decision making 
Uninformed management 
Impacts 
11
© 2014 DataStreams Corp. All Rights Reserved. 
Data Oriented Business Intelligence Architecture 
Administration of data integration and quality should be based on solid data 
infrastructure which requires data transformation and quality management. 
DM 
Legacy 
Legacy Channel 
ERP CRM 
DBMS 
EDW 
ODS 
SAM file 
DW 
ETL/DQ/RTI 
ETL/DQ 
ETL/DQ 
Data Architecture Application Architecture 
OLAP 
… 
E 
D 
W 
P 
O 
R 
T 
A 
L 
Casual 
User 
Power 
User 
Data 
Architect 
IT 
Developer 
Metadata 
manage-ment 
system 
Aggregation 
Cu tsatbolme er 
Contract 
Revenue 
… 
… 
Fundamental 
Rule set 
Column 
Dependency 
Description of 
column 
… 
Involution 
Rule Set vs Column 
Talbe vs Column 
F(FundamentFa(lF) undamental) 
A(Association) 
I(Involution) 
D(Dependency) D(Dependency) 
Central Metadata Repository 
Relational 
Integrity 
User Defined 
Integrity 
DQI 
aggreg 
ation 
Rule 
Set 
Inspec 
-tion 
Inspec 
tion 
Result 
Inspecti 
on 
report 
Domain 
Integrity 
Metadata Oriented Service Architecture 
QualityStreamTM MetaStreamTM ImpactStreamTM 
------ 
------ 
------ 
------ 
------ 
------ 
------ 
------ 
------ 
Program source 
Pl/SQL 
Stored Procedure 
... 
Token 
Special String 
SQL script 
Analysis 
PGM source 
Impact Analysis 
Engine 
TeraStreamTM / DeltaStreamTM/TeraNRT/ QualityStreamTM 
12
© 2014 DataStreams Corp. All Rights Reserved. 
Performance & Cost Savings of TeraStream™ 
Features & Benefits 
Customers & Application 
3. TeraStream™ Overview
© 2014 DataStreams Corp. All Rights Reserved. 
TeraStream™ Overview (1/2) 
Variety of data types and formats transport from source to target as needed. 
Covers enterprise-wise data flow from operational to subject data mart. 
Also applied to high volume batch processing and near real-time data integration. 
Data extraction from a various commercial DBMS in high speed 
High performance sort engine resolves time bottleneck due to Sort//Join/Aggregation 
Automatic generation of scripts can be used for loading to various DBMSs 
Transform / Cleansing 
Load 
Files 
New Systems 
Files 
Conversion 
Sort/Join/Aggregation 
Reformat 
Databases 
Databases 
Extraction 
14
© 2014 DataStreams Corp. All Rights Reserved. 
TeraStream overview (2/2) – product configuration 
TeraStream™ includes a sort engine and a high volume data extraction engine(FACT™), and meta data is stored and managed in DBMS. 
• easy to use GUI for developers. 
 User Interface 
• High performance (FACT/CoSORT) 
• External command(shell/SortCL) 
• Query processing 
• Data conversion (Korean/Japanese) 
• Function processing 
 Data Processing 
 Metadata Management 
 Operations & Administration 
User Interface 
Operations & Administration 
Data Processing Engine 
TeraStream Designer 
Metadata Management Engine 
TeraStream DB 
(Repository) 
Log 
Manager 
Project 
Scheduler 
FFD 
Manager 
Process 
Manager 
Data 
Access 
Manager 
Message 
Broker 
FACTTM 
CoSORTTM 
Converter 
USQL 
External 
command 
User SCL 
• Job and system log management 
• Job scheduling 
• File Format Description for metadata 
• Real-time job monitoring 
• Authentication Management 
•Data format, job & system information in TSDB(Repository) 
Monitor 
15
© 2014 DataStreams Corp. All Rights Reserved. 
TeraStreamTM for Data Integration 
 TeraStream™ is a high-performance ETL solution with convenient GUI 
which is proved for its reliability in variety of enterprises for a decade. 
Experiences Cost 
Plenty of customers 
Various industry 
Decades of 
experiences 
TeraStream™ 
 Easy to use 
 Easy to operate 
 Easy to maintain 
 Sort engine 
(CoSORT™) 
High-speed 
extraction (FACT™) 
Reuse of data (EBH) 
 Low resource use 
 Low development 
cost 
 quick development 
Performance Conveniences 
16
© 2014 DataStreams Corp. All Rights Reserved. 
FILE → DB 
DB → DB 
FILE → FILE 
FILE → DB 
Excellent Performance and Resource Usage 
TeraStreamTM out-performed 3-times in speed against its competitor with 30% of CPU resource.(Data Migration in Shinhan Bank, Korea) 
D product 
TeraStream™ 
 Elapse time : 20 minutes 
 Wasted System Resource : 800 
( 40% Avg. CPU usage X 20 mins ) 
 Elapse time : 59 minutes 
 Wasted System Resource : 3000 
(50% Avg. CPU usage X 60 mins) 
 thread MAX for sort =3 
 File manipulation : 35% CPU usage 
Load : 80% of peak CPU usage 
 Parallel = 4 
 File manipulation : 58% of CPU usage. 
 Load: 58% of peak CPU usage 
Conclusion 
Conclusion 
17
© 2014 DataStreams Corp. All Rights Reserved. 
Excellent Performance in NRT Implementation 
Transportation of up to 1 million records per minute by reading flat files through 
EAI and splitting them per tables eliminating the duplicated business days to 
Sybase IQ. 
3 X 
0 
10 
20 
30 
40 
50 
60 
70 
100 1,000 5,000 10,000 20,000 
D product 
minutes 
Thousand records 
[Shinhan bank DW Benchmark in August, 2006)] 
18
© 2014 DataStreams Corp. All Rights Reserved. 
Time Table for NRT Implementation 
Unit 
(records in thousand) 
TeraStream™ 
D product 
mapping/processing/loading 
mapping/processing/loading 
start 
end 
time 
start 
end 
time 
100 
18:02:39 
18:02:55 
0:16 
15:08:16 
15:10:33 
00:53 
1000 
18:05:25 
18:06:23 
0:58 
15:11:13 
15:20:34 
03:32 
5000 
18:07:20 
18:12:02 
4:42 
15:25:14 
15:43:44 
15:28 
10,000 
18:13:54 
18:24:20 
10:26 
15:47:57 
16:23:45 
31:09 
20,000 
18:29:10 
18:49:55 
20:45 
16:31:40 
17:36:10 
58:41 
10,000 
(concurrent execution) 
11:35:48 
11:50:35 
14:47 
11:35:48 
12:17:10 
41:22 
19
© 2014 DataStreams Corp. All Rights Reserved. 
Excellent Performance in Batch Jobs 
TeraStream™’s excellent performance can be applied to not only ETL but also daily batch jobs. 
[Batch Job of POST Insurance Service Company, 2007] 
High Performance 
Effective use of resources 
Convenience 
No. of Records 
Oracle (SQL) 
TeraStream 
400,000 
1m 32s 
28s 
1,000,000 
5m 01s 
41s 
2,500,000 
12m 21s 
59s 
No. of Recs 
Oracle 
Time 
250,000~500,000 
Tth 
20
© 2014 DataStreams Corp. All Rights Reserved. 
ETL Performance Improvement 
 Using EBH, TeraStreamTM can cut down data path from Legacy to MART saving ETL time and resource usage. 
 Massive volume of files extracted from Legacy Systems are stored in EBH for further reuse in next step. 
 ETL time is reduced by avg. 56%. (In L-Telecom from D-3 to D-1) 
EDW Server 
IBM p690 
NCR 10Node 
Teradata 
D-1 
Oracle 8i 
ETL Server 
ODS 
Customer/Call/ 
Billing 
Connection 
PPS/BSS 
Mining Input Variable 
MOLAP Analysis 
Mining Analysis 
Campaign Analysis 
Sybase IQ/ASE 
OLAP 
MART Server 
CSM/AR 
Billing 
Oracle 8.0.6 
CCS/MPS/ERP 
CTI /PPS/NMS 
SRDF 
Legacy 
ETL 
EBH 
Infomatica 
EBH (ETL and Batch Hub) stores temporary and result files which is shared for further table generation in EDW and MART. 
21
© 2014 DataStreams Corp. All Rights Reserved. 
Job 
Task 
Cycle 
System 
Before 
After 
Improvementrate 
Billing 
Sales 
Month 
EDW 
12:50 
5:00 
61% 
OLAP Mart 
18:35 
8:20 
55% 
Calls 
Charges 
day 
EDW 
5:50 
3:00 
49% 
OLAP Mart 
8:00 
4:00 
50% 
ACCUM 
week 
EDW 
4:20 
1:55 
56% 
OLAP Mart 
7:20 
3:00 
60% 
receiving CDR (NMS) 
day 
EDW 
1:00 
0:30 
50% 
OLAP Mart 
2:20 
0:55 
61% 
Sending CDR (NMS) 
Day 
EDW 
1:40 
1:05 
35% 
ERP batch 
Month 
EDW 
11:20 
3:15 
71% 
receiving CDR (NMS) 
Month 
EDW 
5:00 
2:15 
55% 
OLAP mart 
11:40 
2:20 
80% 
sending CDR (NMS) 
Month 
EDW 
8:20 
4:50 
42% 
ERP provided BATCH 
Month 
EDW 
16:20 
5:15 
68% 
Customer Service 
After service 
month 
EDW 
5:30 
5:05 
9% 
Improvement Details 
22
© 2014 DataStreams Corp. All Rights Reserved. 
Cost Saving Factors 
 TeraStream has many cost saving factors. 
Ease of Maintenance 
Operational Efficiency 
Quick Development 
 Creating standard operation and new development procedures 
 Efficient operation of test system 
 Ability to fast make test file 
Ease of maintenance 
Quality improvement of developing programs - Reliability - Maintainability - Efficiency - Functionality 
 Productivity improvement of developers and administrators 
 Developing correct and efficient logic programs 
 Quality improvement 
23
© 2014 DataStreams Corp. All Rights Reserved. 
Cost Saving : Development Stage 
The higher complexity, the bigger cost saving in development . 
(Courtesy of Hanhwa Insurance Co. and SKC&C 
in 2007) 
Job 
complexity 
No. of recs 
Input 
Size 
(Gb) 
TeraStream™ 
In- house 
coding 
Speed- up 
1:1 mapping 
90 
22 
30min 
2hour 
75% 
1:N mapping 
900 
21 
2hour 
6hour 
66% 
N:1 mapping 
1700 
15 
2hour 
10hour 
80% 
N:N mapping, complex 
1300 
8 
2hour 
20hour 
90% 
Avg. 70% of development speed-up 
90% speed-up for more complex jobs 
Overhead from modification, test and preliminary data checking. 
Development 
(4Month) 
Test 
(4Month) 
Stabilization 
(1Month) 
24M/M 
48M/M 
54M/M 
TeraStream™ 
In-house coding 
(Estimated) 
40M/M 
80M/M 
90M/M 
40% 
Reduction 
24
© 2014 DataStreams Corp. All Rights Reserved. 
Features & Benefits 
TeraStream™ guarantees to meet your need for enterprise data integration as well as excellent batch job hub. 
Sort Engine 
Using CoSORT™, the first sort package since 1985, TeraStream™ can accelerate sort-related data manipulation (dedup, average, min, max, join, summary and etc.) 
FAst extraCT 
FACT™ performs high speed bulk extraction from various commercial DBMS. 
Automatic Metadata Generation 
TeraStream™ provides direct reading of DBMS data dictionary to create its own metadata information. 
High Speed Lookup 
It provides in-memory lookup function which is high speed mapping conversion using lookup tables. 
Variety of conversion function calls 
It provides more than 100 user friendly mapping functions. 
Developers can easily add their own functions. 
Pre/Post Processing 
TeraStream™ provides inter-record and inter-table conversion through pre/post mapping. 
Major Features 
Description 
25
© 2014 DataStreams Corp. All Rights Reserved. 
Features & Benefits 
TeraStream™ has been evolved to meet various parallel processing needs and to give convenience through highly efficient GUIs. 
Inter-node Operation 
Remote call is possible to initiate the projects of other nodes between TeraStream™s. 
Distributed Computing using idle nodes is possible by easy transfer of data. 
Near Real-Time ETL 
Data transportation every minute is possible including complex data mapping 
Efficient GUI 
Using GUI, no skills on programming language are necessary. 
Unified monitor and control in single screen or specialied monitoring is possible through web browser. 
Scheduling of jobs is made in unified GUI but even for distributed servers. 
Multi Language Support 
UTF-8 is supported. 
Major Features 
Description 
26
© 2014 DataStreams Corp. All Rights Reserved. 
Improved GUI 
Supports for data integration activities(develop, execute, monitor, validation) in integrated GUI environment 
Intuitive task flow 
Project monitor 
Editor window 
GUI for developers 
Intuitive task flow 
checking standard output/error/file information/ 
number of files processed 
Execution log 
real time job monitoring 
 Project Monitor 
scheduling by time/ period/ business calendar 
 Scheduler 
Mapping creation 
 Editor window 
Scheduler 
Task block execution log 
Metadata property 
Impact analysis 
Change history manager 
 Metadata Repository 
27
© 2014 DataStreams Corp. All Rights Reserved. 
DBMS Connectivity 
Powerful connection between different DBMS types. 
Both DB-to-DB and File-to-DB data transportation are supported. 
•N:N mapping 
•Conversion while transportation 
•Click to choose record processing types : (Insert/delete/update/insert- update/delete-insert) 
•DBMS types : Oracle, DB2, Sybase, Informix, Teradata, Greenplum, MSSQL, MySQL, (Altibase, Tibero) 
Transformation Logic 
Source Table 
Target Table 
28
© 2014 DataStreams Corp. All Rights Reserved. 
High speed data extraction with FACT™ 
High speed data extraction of commercial database with SQL is supported. 
Automatic extraction query is generated. 
Select * from table 
•High speed extraction engine(FACT™) with optimized database API. 
•DBMS Supported : 
- Oracle - Informix - DB2 / UDB - Sybase IQ /ASE - Teradata - Greenplum - MSSQL /MySQL - Altibase 
•File split and filtering while extraction 
•Time, time stamp, and user data format specification 
29
© 2014 DataStreams Corp. All Rights Reserved. 
Data Conversion 
By mapping source to target, conversion of formats, types, character sets, dates, bytes/bits, encryption 
•Convenient data conversion using mapping window of “converter task block” 
•Data character set conversion including EBCDIC to ASCII 
•Data conversion from NDB(Unisys 9-bit) or HDB(IBM) data type to RDB 
•300 built-in functions 
•DATE, Time Stamp Conversion between different date formats 
•CLOB/BLOB supported 
•Users can add more functions as needed 
List of provided functions 
CALLED_NO function editor 
=addday(cdate(“",”",” (N)") 
addday(cdate("2005/05/12 12:08:24", "YYYY/HH/DD HH:MI:SS"),2) 
Converter task block 
30
© 2014 DataStreams Corp. All Rights Reserved. 
Data Transport 
TeraStream uses various transportation method according to file structure, transportation distance, security, amount of record and etc. 
•File to DB data load for bulk data 
•“Load task block” generates load scripts automatically. 
•Remote transportation using FTP 
•Encryption while transporting 
•Near Real-time and Bulk transportation is possible 
Load Scripts 
31
© 2014 DataStreams Corp. All Rights Reserved. 
Korea Exchange Bank 
Expected Result 
 TeraStream™ is in charge of data flow from core-banking to data mart. 
 Batch job is also executed during ETL process in the same manner. 
Issues 
Plans 
System Configuration 
Extract 
24 hours 
DB SPLIT 
Profit Management 
RISK 
KPI 
EUC 
current ODW 
ORACLE 9i 
EDW 
Sybase ASIQ 12.5 
New ODW 
ORACLE 9i 
Bidirectional ETL 
ETL 
ETL 
ETL 
Batch 
Batch 
Batch 
FTP transfer after extraction 
Accounting System 
Information System DW 
DM & Sub System 
Open in Feb. 2005 
Batch 
• A next-generation conversion of M/F and IMS HDB 
• Interactive ETL process 
• High volume data handling within batch process time 
• Converting main frame data into Unix data (1TB → 1.5TB) including Korean character conversion 
• ETL task from accounting system server to new ODW server(extracting appx. 200 GB of daily changed data within 1 hour and 30 minutes by using FACT™ of TeraStream™) 
• Data hub is used in ETL process from new core banking system to subjective marts. (EUC, KPI, RISK, IFRS, Basel2) 
• ETL and Batch process in unified way. 
• Batch process is designed to be done in both ODW and EDW. 
 Unified Data Processing in ETL & Batch 
32
© 2014 DataStreams Corp. All Rights Reserved. 
System configuration 
Issues 
Plans 
Kookmin Bank 
IBM M/F 
HDB, DB2 
Server RDB 
Sybase ASIQ 12.7 
IMS HDB 
- Seg. split 
- conversion & Array split 
- logic applied 
- conversion 
- logic applied 
- Logic applied 
EDW 
A-SOR 
DM 
ETL 
ETL 
ETL 
Informover 
TS(FACT) 
Informover 
Source system 
File process flow 
DB QUERY 
Expected Result 
 Various DBMS(IMS HDB, HOST DB2, Oracle, DB2 UDB) integration by using TeraStream™ 
 Meeting batch target time of 2 hours and 30 minutes for 4TB of EBCDIC data. 
• M/F and IMS HDB conversion 
• Processing changed data in absence of time-series column 
• Processing large size data within batch process time(10TB/day based on source data) 
• How to process high volume files in parallel 
• Converting main frame data into data in Unix environment (10TB → 25TB) within 18 hours. 
• Various data conversion and processing including Korean character conversion 
• ETL task from accounting system server to new ODW server(extracting appx. 200 GB of daily changed data within 1 hour and 30 minutes by using FACT module of TeraStream™) 
• ETL and Batch process in unified way. 
• Batch job in core banking system within 6 hours. 
 EDW and integrated DM installation 
33
© 2014 DataStreams Corp. All Rights Reserved. 
Samsung Electronics 
• Rea-time data transportation between Germany and 
China. 
• Bi-directional synchronization between TeraStreams 
of Germany and China. 
• 20 min. MAX loading time for transported data is 
implemented using TeraStream NRT. 
•Web Monitoring is developed 
• Registration in one country should have the same 
service at other country. 
• duplicated record should be avoided due to cross 
transportation. 
• 20 minutes Near Real-time 
• Perfect Recovery scheme should be presented 
Plans 
Issues System Configuration 
Smart Phone System 
in Germany 
DBs in 
Service 
 Efficiency is maintained despite cross transportation 
Bi-directional NRT integration allows the same service regardless of system type 
and country (Time from extraction to loading is 20 minutes.) 
 Bi-directional remote data transportation using TeraStream 
NRT Extract 
프로그램 성공, 실패 등 실행 결과 
Web Monitoring 
Sam To DB 
UPSERT 
NRT Extract 
SAM To DB 
UPSERT 
 Global Database Integration using NRT ETL 
DBs in 
Service 
Smart Phone System 
in China 
Expected 
Result 
34
© 2014 DataStreams Corp. All Rights Reserved. 
Department of Domestic Administration 
 Building an E-government and improving the accessibility of government Information 
System Configuration 
•The required data is interactively transferred between DW and DM by EAI and configured with ETL. 
•Target : the central office and 16 cities and provinces 
•Work status by provinces 
1) Administration -> Seoul City 
2) Environment : 2 cities and 2 provinces 
3) Health Service : 1 city 
4) Economy and trade : 2 cities and 7 provinces 
•50 GB of data processing per day 
•How to save the processed data : transfer data to a file system and use a backup solution 
•The central office data is provided in file through EAI. 
• Oracle data from central office, other cities and provinces is directly extracted by FACT and stored in DW/DM(Sybase). 
• To complete substantial E-Government service 
• To provide integrated high-quality information 
• To build a fast and accurate decision support system 
Issues 
Plan 
Integrated Data Mart 
Data Warehouses in 16 cities and provinces 
ODS 
DW 
ODS 
DW 
DM 
DM 
Central 
governmental 
system 
Other 
provincial s 
ystems 
ETL 
EAI 
 Provincial Administrative DW 
Expected Result 
35
© 2014 DataStreams Corp. All Rights Reserved. 
System Configuration 
 Batch processing time for daily closing and unified collection reporting is reduced. 
 ETL flows unified in TeraStream. 
 ETL Batch Hub reduces data flow path. 
• Complex extraction process & difficult job tracking 
• Absence of ODS makes data referencing hard. 
• Long DW time cause delayed service start in the morning. 
•Heavy loads in closing process 
Issues 
• Unified processing of ODS, DW & DM 
• Simple extraction scheme and easy tracking of errors. 
•ETL time and load reduction using EBH 
Plans 
SQL 
script 
stage 
DW 
DM 
Procedure 
Source 
ODS 
DM 
DW 
ETL HUB 
D-1 
D-1 
Source 
Before Change 
After Change 
 Data Warehouse ETL 
Expected Result 
Korea Worker’s Compensation & Welfare Service 
36
© 2014 DataStreams Corp. All Rights Reserved. 
Health and Welfare Department’s e-Voucher 
E-Voucher DW Performance Improvement 
Statistics reporting time is dramatically reduced from 1~6 days to a few second or minutes. 
 Statistics reporting process made simple and easy to get report. 
 Consistent data delievry increase data reliability. 
• daily transportation to ODS 
• build ODS, DW and DM for better table model 
• e-Voucher System (DB2 -> DW Server) 
• Platform - OS : AIX 5.3(ASIS,TOBE ) - CPU : Power5, 2.1GHz, 6core , IBM P-serise 
- MEM : 12 GB 
- H/W : 1TB 
• Simple logic made MA easy 
•Low data integrity 
•Lack of expeditious response 
•Fraud detection was hard. 
•Low reliability of statistic data caused dispute between data users and generators 
Plans 
Issues 
System Configuration 
E-Voucher Statistical DW 
Operational 
Source DB 
(oracle) 
- ODS data conversion 
- update/insert at ODS 
- 1:1 mapping 
- Daily batch 
- Load to ODS 
IBM P-serise 
Voucher Servide 
Mis-settlement 
Pregnancy & Birth 
History 
Target DB 
(oracle) 
FACT 
ODS 
DM 
DW 
ETL 
- ODS/ DW data manuplation 
- update/insert to data mart 
ETL 
ETL 
Expected Result 
37
© 2014 DataStreams Corp. All Rights Reserved. 
Korea National Open University (Synthetic Statistic DW ) 
Expected Result 
 Building a complete view and improving the accessibility of Statistical DW . 
 Reducing the institutional service time from 15 days to 4 hours. 
 Expeditious response of university’s administration statistics report become possible. 
System Configuration 
•Extract an operational data for the institutional service and university administration service, transfer the data into ODS, DW and DM, and then load Oracle DBMS by ETL for producing typical and atypical statistic reports 
•Institutional Statistic Service Data (Initial/Alteration) : 
20GB/1GB, Alteration data load time: 4 hours 
•University Administration Statistic Service Data (Initial/Alteration) : 5GB/100MB, Alteration data load time: 1 hour 
• Reducing time and automation for the existing manual based university statistic services 
• To provide integrated high-quality academic, administration, and management information 
• To build a fast and accurate academic administration decision support system 
Issues 
Plan 
Univ, Admin. 
Graduate 
Institutional Services 
Electronic 
settlement 
Tutor 
Administ 
-ration 
Lifelong 
Education 
Institution 
Graduation 
Grade 
Registration 
Admission 
Synthetic Statistic DW Server 
TeraStream 
ODS 
DW 
DM 
Data Extract/Load 
Data 
Extract/Load 
Data Extract/Transfer/Load 
ETL Control 
Data Extract/Load 
Data Extract/Load 
Unit Process 
Processing Time 
Institutional Statistics Services 
Within 4 hours 
University Statistic Services 
Within 1 hour 
Huge time reduction from of institutional statistic sservice reports 14 days to 4 hours 
38
TeraStream - Data Integration/Migration/ETL/Batch Tool

More Related Content

What's hot

Demo Showcase: Graphs for Cybersecurity in Action
Demo Showcase: Graphs for Cybersecurity in ActionDemo Showcase: Graphs for Cybersecurity in Action
Demo Showcase: Graphs for Cybersecurity in ActionNeo4j
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationData Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationAlan McSweeney
 
GPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge GraphGPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge GraphNeo4j
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner
 
Building Effective Data Governance
Building Effective Data GovernanceBuilding Effective Data Governance
Building Effective Data GovernanceJeff Block
 
Demystifying Graph Neural Networks
Demystifying Graph Neural NetworksDemystifying Graph Neural Networks
Demystifying Graph Neural NetworksNeo4j
 
Airbyte - Series-B deck
Airbyte - Series-B deckAirbyte - Series-B deck
Airbyte - Series-B deckAirbyte
 
Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...
Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...
Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...Firoz Muhammed
 
The Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j OverviewThe Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j OverviewNeo4j
 
Building the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSXBuilding the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSXNeo4j
 
Modernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesModernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesCarole Gunst
 
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...Neo4j
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceDenodo
 
Lab1-DB-Cassandra
Lab1-DB-CassandraLab1-DB-Cassandra
Lab1-DB-CassandraLilia Sfaxi
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4jjexp
 
Pi Day 2022 - from IoT to MySQL HeatWave Database Service
Pi Day 2022 -  from IoT to MySQL HeatWave Database ServicePi Day 2022 -  from IoT to MySQL HeatWave Database Service
Pi Day 2022 - from IoT to MySQL HeatWave Database ServiceFrederic Descamps
 
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Neo4j
 
Best Practices in Metadata Management
Best Practices in Metadata ManagementBest Practices in Metadata Management
Best Practices in Metadata ManagementDATAVERSITY
 

What's hot (20)

Demo Showcase: Graphs for Cybersecurity in Action
Demo Showcase: Graphs for Cybersecurity in ActionDemo Showcase: Graphs for Cybersecurity in Action
Demo Showcase: Graphs for Cybersecurity in Action
 
Neo4j Graph Data Science - Webinar
Neo4j Graph Data Science - WebinarNeo4j Graph Data Science - Webinar
Neo4j Graph Data Science - Webinar
 
Data Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata HarmonisationData Profiling, Data Catalogs and Metadata Harmonisation
Data Profiling, Data Catalogs and Metadata Harmonisation
 
GPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge GraphGPT and Graph Data Science to power your Knowledge Graph
GPT and Graph Data Science to power your Knowledge Graph
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management Functionality
 
Building Effective Data Governance
Building Effective Data GovernanceBuilding Effective Data Governance
Building Effective Data Governance
 
Demystifying Graph Neural Networks
Demystifying Graph Neural NetworksDemystifying Graph Neural Networks
Demystifying Graph Neural Networks
 
Airbyte - Series-B deck
Airbyte - Series-B deckAirbyte - Series-B deck
Airbyte - Series-B deck
 
Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...
Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...
Navigating through Microsoft Dynamics 365 landscape | Dynamics 365 for Custom...
 
The Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j OverviewThe Graph Database Universe: Neo4j Overview
The Graph Database Universe: Neo4j Overview
 
Building the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSXBuilding the Rail Network Digital Twin at CSX
Building the Rail Network Digital Twin at CSX
 
Modernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data PipelinesModernize & Automate Analytics Data Pipelines
Modernize & Automate Analytics Data Pipelines
 
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
The Art of the Possible with Graph - Sudhir Hasbe - GraphSummit London 14 Nov...
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Lab1-DB-Cassandra
Lab1-DB-CassandraLab1-DB-Cassandra
Lab1-DB-Cassandra
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
 
Pi Day 2022 - from IoT to MySQL HeatWave Database Service
Pi Day 2022 -  from IoT to MySQL HeatWave Database ServicePi Day 2022 -  from IoT to MySQL HeatWave Database Service
Pi Day 2022 - from IoT to MySQL HeatWave Database Service
 
What is Cloudonomics ?
What is  Cloudonomics ?What is  Cloudonomics ?
What is Cloudonomics ?
 
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
 
Best Practices in Metadata Management
Best Practices in Metadata ManagementBest Practices in Metadata Management
Best Practices in Metadata Management
 

Viewers also liked

TeraStream for ETL
TeraStream for ETLTeraStream for ETL
TeraStream for ETL치민 최
 
IBM MDM 10.1 What's New - Aomar Bariz
IBM MDM 10.1  What's New - Aomar BarizIBM MDM 10.1  What's New - Aomar Bariz
IBM MDM 10.1 What's New - Aomar BarizIBMInfoSphereUGFR
 
Tera stream for datastreams
Tera stream for datastreamsTera stream for datastreams
Tera stream for datastreams치민 최
 
Technology organizational chart
Technology organizational chartTechnology organizational chart
Technology organizational chartRhoncla82
 
DataStreams : Corporate Overview
DataStreams : Corporate OverviewDataStreams : Corporate Overview
DataStreams : Corporate OverviewDataStreams
 
13 11-26 snia, storage networking industry association - panorama mundial do ...
13 11-26 snia, storage networking industry association - panorama mundial do ...13 11-26 snia, storage networking industry association - panorama mundial do ...
13 11-26 snia, storage networking industry association - panorama mundial do ...Carvalho Comunicação
 
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBMInfoSphereUGFR
 
Automate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile wayAutomate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile wayTorana, Inc.
 
Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3IBMInfoSphereUGFR
 
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...DataWorks Summit/Hadoop Summit
 
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스DataStreams
 
스사모 테크톡 - GraphX
스사모 테크톡 - GraphX스사모 테크톡 - GraphX
스사모 테크톡 - GraphXSangWoo Kim
 
Master data management (mdm) & plm in context of enterprise product management
Master data management (mdm) & plm in context of enterprise product managementMaster data management (mdm) & plm in context of enterprise product management
Master data management (mdm) & plm in context of enterprise product managementTata Consultancy Services
 
Gartner: Seven Building Blocks of Master Data Management
Gartner: Seven Building Blocks of Master Data ManagementGartner: Seven Building Blocks of Master Data Management
Gartner: Seven Building Blocks of Master Data ManagementGartner
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesBoris Otto
 

Viewers also liked (16)

TeraStream for ETL
TeraStream for ETLTeraStream for ETL
TeraStream for ETL
 
Tera stream ETL
Tera stream ETLTera stream ETL
Tera stream ETL
 
IBM MDM 10.1 What's New - Aomar Bariz
IBM MDM 10.1  What's New - Aomar BarizIBM MDM 10.1  What's New - Aomar Bariz
IBM MDM 10.1 What's New - Aomar Bariz
 
Tera stream for datastreams
Tera stream for datastreamsTera stream for datastreams
Tera stream for datastreams
 
Technology organizational chart
Technology organizational chartTechnology organizational chart
Technology organizational chart
 
DataStreams : Corporate Overview
DataStreams : Corporate OverviewDataStreams : Corporate Overview
DataStreams : Corporate Overview
 
13 11-26 snia, storage networking industry association - panorama mundial do ...
13 11-26 snia, storage networking industry association - panorama mundial do ...13 11-26 snia, storage networking industry association - panorama mundial do ...
13 11-26 snia, storage networking industry association - panorama mundial do ...
 
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
 
Automate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile wayAutomate data warehouse etl testing and migration testing the agile way
Automate data warehouse etl testing and migration testing the agile way
 
Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3
 
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
 
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
 
스사모 테크톡 - GraphX
스사모 테크톡 - GraphX스사모 테크톡 - GraphX
스사모 테크톡 - GraphX
 
Master data management (mdm) & plm in context of enterprise product management
Master data management (mdm) & plm in context of enterprise product managementMaster data management (mdm) & plm in context of enterprise product management
Master data management (mdm) & plm in context of enterprise product management
 
Gartner: Seven Building Blocks of Master Data Management
Gartner: Seven Building Blocks of Master Data ManagementGartner: Seven Building Blocks of Master Data Management
Gartner: Seven Building Blocks of Master Data Management
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 

Similar to TeraStream - Data Integration/Migration/ETL/Batch Tool

Fisher Practice Areas 2012
Fisher Practice Areas 2012Fisher Practice Areas 2012
Fisher Practice Areas 2012fish1960
 
Align Business Data & Analytics for Digital Transformation
Align Business Data & Analytics for Digital TransformationAlign Business Data & Analytics for Digital Transformation
Align Business Data & Analytics for Digital TransformationPerficient, Inc.
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Mious case study presentation (2)
Mious   case study presentation (2)Mious   case study presentation (2)
Mious case study presentation (2)Emtec Inc.
 
Humana Case Study: Paradigm Shift in Reporting by Deploying Four OBIA Module...
Humana Case Study:  Paradigm Shift in Reporting by Deploying Four OBIA Module...Humana Case Study:  Paradigm Shift in Reporting by Deploying Four OBIA Module...
Humana Case Study: Paradigm Shift in Reporting by Deploying Four OBIA Module...Emtec Inc.
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsDATAVERSITY
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DATAVERSITY
 
E Team Data Management Offerings
E Team Data Management OfferingsE Team Data Management Offerings
E Team Data Management Offeringsaturner_eTeam
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...
Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...
Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...Health IT Conference – iHT2
 
Presentation done in GENPACT
Presentation done in GENPACTPresentation done in GENPACT
Presentation done in GENPACTDr. Amit Kapoor
 
Incedo - Corporate Overview
Incedo - Corporate OverviewIncedo - Corporate Overview
Incedo - Corporate Overviewtushar chhetri
 
Estes Group Capabilities Overview
Estes Group Capabilities OverviewEstes Group Capabilities Overview
Estes Group Capabilities OverviewBrandon_Haave
 
Estes Group Capabilities Overview
Estes Group Capabilities OverviewEstes Group Capabilities Overview
Estes Group Capabilities OverviewMatt_Thompson
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseCaserta
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation Caserta
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationDenodo
 
Estes Group Capabilities Overview
Estes Group Capabilities OverviewEstes Group Capabilities Overview
Estes Group Capabilities OverviewJohn_Koski
 

Similar to TeraStream - Data Integration/Migration/ETL/Batch Tool (20)

Fisher Practice Areas 2012
Fisher Practice Areas 2012Fisher Practice Areas 2012
Fisher Practice Areas 2012
 
Align Business Data & Analytics for Digital Transformation
Align Business Data & Analytics for Digital TransformationAlign Business Data & Analytics for Digital Transformation
Align Business Data & Analytics for Digital Transformation
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Mious case study presentation (2)
Mious   case study presentation (2)Mious   case study presentation (2)
Mious case study presentation (2)
 
Humana Case Study: Paradigm Shift in Reporting by Deploying Four OBIA Module...
Humana Case Study:  Paradigm Shift in Reporting by Deploying Four OBIA Module...Humana Case Study:  Paradigm Shift in Reporting by Deploying Four OBIA Module...
Humana Case Study: Paradigm Shift in Reporting by Deploying Four OBIA Module...
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced Analytics
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
 
ExistBI Data Integration Consulting Case Study
ExistBI Data Integration Consulting Case StudyExistBI Data Integration Consulting Case Study
ExistBI Data Integration Consulting Case Study
 
E Team Data Management Offerings
E Team Data Management OfferingsE Team Data Management Offerings
E Team Data Management Offerings
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...
Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...
Health IT Summit Austin 2013 - Closing Keynote "100 Years of Innovation at AC...
 
Presentation done in GENPACT
Presentation done in GENPACTPresentation done in GENPACT
Presentation done in GENPACT
 
Incedo - Corporate Overview
Incedo - Corporate OverviewIncedo - Corporate Overview
Incedo - Corporate Overview
 
Estes Group Capabilities Overview
Estes Group Capabilities OverviewEstes Group Capabilities Overview
Estes Group Capabilities Overview
 
Estes Group Capabilities Overview
Estes Group Capabilities OverviewEstes Group Capabilities Overview
Estes Group Capabilities Overview
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the Enterprise
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data Virtualization
 
Estes Group Capabilities Overview
Estes Group Capabilities OverviewEstes Group Capabilities Overview
Estes Group Capabilities Overview
 

Recently uploaded

Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 

Recently uploaded (20)

Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 

TeraStream - Data Integration/Migration/ETL/Batch Tool

  • 1. © 2014 DataStreams Corp. All Rights Reserved. DataStreams Corp. "Always find the better value of your data" www.datastreams.co.kr
  • 2. © 2014 DataStreams Corp. All Rights Reserved. 1.Company Profile 2.Data Integration as Data Infrastructure 3.TeraStream™ Overview Performance & Cost Savings of TeraStream™ Features & Benefits Customers & Application Contents
  • 3. © 2014 DataStreams Corp. All Rights Reserved. 1.Company Profile
  • 4. © 2014 DataStreams Corp. All Rights Reserved. Name Data Streams Corp. CEO Mr. Young-sang Lee Business Area Data Integration Solutions Development and Sales Data Quality Solutions Development and Sales Data Warehouse / BI / FDS / Forensic / Audit Consulting and Construction Big Data Analytic Consulting and Platform Construction Data Governance Platform Development/Consulting/Sales Data Migration Consulting and Construction Large Volume Data Batch Processing Improvement Consulting and Construction Data Standardization and Quality Management Consulting and System Construction Data Architecture Consulting Office Address HQ Chungho-nais B/D 6F, 28 Saimdang-ro, Seocho-gu, Seoul, Korea R&D U-Spacemall #2 B-601, 670 Daewangpangyo-ro, Bundang-gu, Seongnam, Korea China Office Room 1216, 12th Floor, Intersection of Hopson Kirin Society Building 2, Wang Jing Fu Tong West Street, Wangjing, Chaoyang District, Bejing Contact Tel) +82-2-3473-9077 / Fax) +82-2-3473-9084 Investor JAFCO ASIA Capital USD 2M Sales Amount USD 19M (2013) Established Sep 19, 2001 Employees 121 Company Profile 3
  • 5. © 2014 DataStreams Corp. All Rights Reserved. 2012 •Established R&D Center in Pangyo Techno Valley •Released Social Cube for SNS Data Analytics •Participated in Original Technology Development Project for Next Generation Memory Based Big Data Analytics and Management 2009 •TeraStream™, Selected as Standard Data Integration Tool by The Korea Federation of Banks •Selected as Contractor for Building Resource Management Data Standardization and Meta-data Management System by Ministry of National Defense •Released DeltaStream™, QualityStream™, and ImpactStream™ 2007 •Awarded for Excellent Venture Company by Deputy Prime Minister •MetaStream™, Awarded for Digital Business Innovation by SMBA •Released TeraStream™ Version 2.0 •JAFCO, Japan invested 4 million USD 2005 •Mr. Young-sang Lee, CEO, Was Awarded a Grand Prize for Korea Digital Competitiveness •TeraStream™ Won New Technology Certification from Ministry of Knowledge Economy •Released MetaStream™ Version 1.0 •Acquired KDB Solution Co., Ltd., Korea’s First Meta-data Management Solution Company •TeraStream™ Version 1.4, Acquired GS(Good Software) Certification 2003 •KEB selected TeraStream™ as Standard Batch/ETL Solution for Next Generation Banking System •First Worldwide Sales Contract of FACT™ •Presented FACT™ to Oracle Open World 2003 in San Francisco, USA •Released MetaStream™ Version 2.7 •CEO, Mr. Young-sang Lee, Was Elected as a Chairman of KOSEA(Korea Software Enterprise Association) •Released TeraStream™ Version 3.2 2010 •TeraStream™ Version 2.2, Acquired GS Certification •Changed Company Name to DataStreams Corp. •Contracted with Intellectual Property Office for Enterprise Data Quality Management 2008 •Awarded for Top Private Company for Population and Housing Census by Deputy Prime Minister •TeraStream™, Selected as Standard ETL Tool by Ministry of Government Administration and Home Affairs 2006 •Selected as Technically Innovated Company of 2004 by SMBA •Selected as Technically Innovated Company of 2004 by Small & Medium Business Administration(SMBA) •TeraStream™, Selected for a Next Generation Banking Data Migration Tool by Shinhan Bank 2004 •Registered TeraStream™ as a Trademark •Released TeraStream™ Designer Version 1.1 •First TeraStream™ V.1.1 Contract with National Statistics Office 2002 2011 •Awarded Prime Minister Citation for SW Achievement •Selected as an ATC(Advanced Technology Center) by Ministry of Knowledge Economy •Established China Office in Beijing 2013 •TeraStream™ for Hadoop, Selected as Base Solution for Building Government-Wide Big Data Infrastructure •Acquired Patent for Readable Data Encryption and Decryption •CEO, Mr. Young-sang Lee, Was Awarded Digital Management Innovation Prize •Joined Int’l SOFT China in Beijing • MetaStream™ Version 3.0, Acquired GS Certification • Launched DQ Appliance 2001 •Released TeraStream™ Beta Version •Innovative Data Solutions Corp., Was Established Company History 4 2014 •TeraStream™, Was Awarded 2014 Korea Software Award by Ministrer of Science, ICT and Future •DataStreams Is Listed on KONEX(Korea New Exchange)
  • 6. © 2014 DataStreams Corp. All Rights Reserved. Organizational Structure Present Condition of Engineering Employees 96 Consultants Data Governance 36 Data Integration & Migration Big Data Management DW & BI SNS Analytics Engineers Meta Data & Data Quality Management Solution 60 Data Integration & Migration Solution Big Data Management Platform Number Of Employees Total Consultants/Developers Management/Sales 121 96 25 Engineering Lev. Total Consultants Engineers Total 96 36 60 Professional 27 18 9 Qualified 26 11 15 Intermediate 19 3 16 Beginning 24 4 20 •Government Offices •Banking •Manufacturing Business •Logistics/Services •Planning Products •Presales & Consulting •DW/BI •Big Data •SNS •QA(Quality Assurance) •Marketing •Overseas Sales •Overseas Corp. •HR/General Affairs •Financial Admin. •Knowledge Mgmt. •Sales Support •PI •DI Technical Support •DQ Technical Support •DI •RTI •DQ •UI CEO Auditor Counselor Sales Div. PPC Div. Business Consulting Div. Global Business Div. Management Support Div. Technical Service Div. R&D Center 5
  • 7. © 2014 DataStreams Corp. All Rights Reserved. Business Area Data Governance • Data Governance Architecture • Data Quality Management • Meta Data Management • Master Data Management • Data Quality Appliance Data Integration • High Performance ETL ∙ Batch • Data Integration • Deferred(Near Real Time) • CDC, Real Time Data Transition • High Speed Data Extraction • High Speed Data Sort • Data Integration with Hadoop • Data Integration with Grid • Test Data Management Big Data • Big Data Platform with Hadoop • Big Data Anaysis & Visualization • Structured & Unstructured Data Analysis • SNS Data Analysis ∙ Monitoring Consulting • ISP & Big Data Consulting • Fraud Detection System(FDS) Consulting • DW ∙ CRM ∙ BI Consulting • Data Integration & Migration Consulting • Data Standardization ∙ Quality Management ∙ Architecture Consulting • Master Data Management Consulting • Data Lineage Management Consulting DW/BI • Building DW ∙ CRM ∙ BI • QPI Methodology • Fraud Detection System(FDS) • Information System Planning(ISP) • Alternative Trading System Consulting • Transaction Cost Data Analysis Framework • Transaction Cost Data Analysis Framework & Consulting(TCA) • Financial Analysis Services DataStreams Is a Company Which Has Expertise in Data Processing and Analysis to Provide Total Data Management Services in Data Integration and Quality Management. Data Lineage Management • Data Lineage Analysis Platform • Visualization for Data Lineage • Relative Tool, Program & Script Language Analysis • Table Column Search & Monitoring 6
  • 8. © 2014 DataStreams Corp. All Rights Reserved. Market Recognition & Share 7 60% 25% 10% 5% 80% 15% 5% Korean Market Share for ETL Solution Korean Market Share for Data Migration (Banking Industry) DataStreams Corp. IBM Informatica Others * The market share for ETL solutions is self-researched in 2013. 55% 30% 15% Korean Market Share for Metadata Management W Company (Korean) G Company (Korean) ETL Data Migration Metadata Management No. 1 Total Data Management Technologies in Korea Vendor Report of Magic Quadrant for Data Integration and Data Quality Tools (2013) Reference URL : http://www.citia.co.uk/ Mentioned DataStreams’ capabilities of offering wide range of data integration products through ETL, CDC and near-real time technologies.
  • 9. © 2014 DataStreams Corp. All Rights Reserved. Private Banking / Finance Companies Public Finance Companies Government & Public / Educational Institutions Enterprises Major Domestic Customers 8
  • 10. © 2014 DataStreams Corp. All Rights Reserved. DataStreams Is Exporting & Expanding… Columbia Banco Colpatria Bogota City Government Credibanco China Kookmin Bank Hana Bank USA Merklenet, Inc. CSC Consulting Bisys Comcast Merkle Data Tech USA Airweb Sungard American Airlines Highmark, Inc. Mexico Sodexhopass Procesar Peru Banco Ripley Chile Banco Estado de Chile Australia National Wealth Management (MLC/NAB) Spain Procecard Tecnocom Telefonica Soluciones ITnow! Germany Accenture GmbH India Reliance Industry Indonesia Excelcom Aviva Telkomsel Hana Bank Global Customers 9 Japan with Reliable Business Partners U.S.A. BellaDati(US) Vietnam HIPT FPT IS Lac Viet QTSC EU BellaDati(CZE) Gibkie(RUS) IMBI(Europe, N. Africa) China China Mobile Fuchen Telecom & Banking / Insurance Companies
  • 11. © 2014 DataStreams Corp. All Rights Reserved. Business Challenges Data Integration Architecture 2. Data Integration as Data Infrastructure
  • 12. © 2014 DataStreams Corp. All Rights Reserved. Data Integration Landscape: Business Challenges  Inaccurate data leads to bad or no decisions  More than 30% of IT budgets typically spent on Data integration  Inconsistent enterprise and application architecture for integration Disparate data Inaccurate data Incomplete data Untimely data Fragmented Integration Approach Factors  Bad decisions  Lost revenue  Lost productivity  Lost market opportunity  Bad Citizen relationships Results Multiple versions of the “Truth” Wasted time and resources aggregating information Difficult to use Data Delayed Decision making Uninformed management Impacts 11
  • 13. © 2014 DataStreams Corp. All Rights Reserved. Data Oriented Business Intelligence Architecture Administration of data integration and quality should be based on solid data infrastructure which requires data transformation and quality management. DM Legacy Legacy Channel ERP CRM DBMS EDW ODS SAM file DW ETL/DQ/RTI ETL/DQ ETL/DQ Data Architecture Application Architecture OLAP … E D W P O R T A L Casual User Power User Data Architect IT Developer Metadata manage-ment system Aggregation Cu tsatbolme er Contract Revenue … … Fundamental Rule set Column Dependency Description of column … Involution Rule Set vs Column Talbe vs Column F(FundamentFa(lF) undamental) A(Association) I(Involution) D(Dependency) D(Dependency) Central Metadata Repository Relational Integrity User Defined Integrity DQI aggreg ation Rule Set Inspec -tion Inspec tion Result Inspecti on report Domain Integrity Metadata Oriented Service Architecture QualityStreamTM MetaStreamTM ImpactStreamTM ------ ------ ------ ------ ------ ------ ------ ------ ------ Program source Pl/SQL Stored Procedure ... Token Special String SQL script Analysis PGM source Impact Analysis Engine TeraStreamTM / DeltaStreamTM/TeraNRT/ QualityStreamTM 12
  • 14. © 2014 DataStreams Corp. All Rights Reserved. Performance & Cost Savings of TeraStream™ Features & Benefits Customers & Application 3. TeraStream™ Overview
  • 15. © 2014 DataStreams Corp. All Rights Reserved. TeraStream™ Overview (1/2) Variety of data types and formats transport from source to target as needed. Covers enterprise-wise data flow from operational to subject data mart. Also applied to high volume batch processing and near real-time data integration. Data extraction from a various commercial DBMS in high speed High performance sort engine resolves time bottleneck due to Sort//Join/Aggregation Automatic generation of scripts can be used for loading to various DBMSs Transform / Cleansing Load Files New Systems Files Conversion Sort/Join/Aggregation Reformat Databases Databases Extraction 14
  • 16. © 2014 DataStreams Corp. All Rights Reserved. TeraStream overview (2/2) – product configuration TeraStream™ includes a sort engine and a high volume data extraction engine(FACT™), and meta data is stored and managed in DBMS. • easy to use GUI for developers.  User Interface • High performance (FACT/CoSORT) • External command(shell/SortCL) • Query processing • Data conversion (Korean/Japanese) • Function processing  Data Processing  Metadata Management  Operations & Administration User Interface Operations & Administration Data Processing Engine TeraStream Designer Metadata Management Engine TeraStream DB (Repository) Log Manager Project Scheduler FFD Manager Process Manager Data Access Manager Message Broker FACTTM CoSORTTM Converter USQL External command User SCL • Job and system log management • Job scheduling • File Format Description for metadata • Real-time job monitoring • Authentication Management •Data format, job & system information in TSDB(Repository) Monitor 15
  • 17. © 2014 DataStreams Corp. All Rights Reserved. TeraStreamTM for Data Integration  TeraStream™ is a high-performance ETL solution with convenient GUI which is proved for its reliability in variety of enterprises for a decade. Experiences Cost Plenty of customers Various industry Decades of experiences TeraStream™  Easy to use  Easy to operate  Easy to maintain  Sort engine (CoSORT™) High-speed extraction (FACT™) Reuse of data (EBH)  Low resource use  Low development cost  quick development Performance Conveniences 16
  • 18. © 2014 DataStreams Corp. All Rights Reserved. FILE → DB DB → DB FILE → FILE FILE → DB Excellent Performance and Resource Usage TeraStreamTM out-performed 3-times in speed against its competitor with 30% of CPU resource.(Data Migration in Shinhan Bank, Korea) D product TeraStream™  Elapse time : 20 minutes  Wasted System Resource : 800 ( 40% Avg. CPU usage X 20 mins )  Elapse time : 59 minutes  Wasted System Resource : 3000 (50% Avg. CPU usage X 60 mins)  thread MAX for sort =3  File manipulation : 35% CPU usage Load : 80% of peak CPU usage  Parallel = 4  File manipulation : 58% of CPU usage.  Load: 58% of peak CPU usage Conclusion Conclusion 17
  • 19. © 2014 DataStreams Corp. All Rights Reserved. Excellent Performance in NRT Implementation Transportation of up to 1 million records per minute by reading flat files through EAI and splitting them per tables eliminating the duplicated business days to Sybase IQ. 3 X 0 10 20 30 40 50 60 70 100 1,000 5,000 10,000 20,000 D product minutes Thousand records [Shinhan bank DW Benchmark in August, 2006)] 18
  • 20. © 2014 DataStreams Corp. All Rights Reserved. Time Table for NRT Implementation Unit (records in thousand) TeraStream™ D product mapping/processing/loading mapping/processing/loading start end time start end time 100 18:02:39 18:02:55 0:16 15:08:16 15:10:33 00:53 1000 18:05:25 18:06:23 0:58 15:11:13 15:20:34 03:32 5000 18:07:20 18:12:02 4:42 15:25:14 15:43:44 15:28 10,000 18:13:54 18:24:20 10:26 15:47:57 16:23:45 31:09 20,000 18:29:10 18:49:55 20:45 16:31:40 17:36:10 58:41 10,000 (concurrent execution) 11:35:48 11:50:35 14:47 11:35:48 12:17:10 41:22 19
  • 21. © 2014 DataStreams Corp. All Rights Reserved. Excellent Performance in Batch Jobs TeraStream™’s excellent performance can be applied to not only ETL but also daily batch jobs. [Batch Job of POST Insurance Service Company, 2007] High Performance Effective use of resources Convenience No. of Records Oracle (SQL) TeraStream 400,000 1m 32s 28s 1,000,000 5m 01s 41s 2,500,000 12m 21s 59s No. of Recs Oracle Time 250,000~500,000 Tth 20
  • 22. © 2014 DataStreams Corp. All Rights Reserved. ETL Performance Improvement  Using EBH, TeraStreamTM can cut down data path from Legacy to MART saving ETL time and resource usage.  Massive volume of files extracted from Legacy Systems are stored in EBH for further reuse in next step.  ETL time is reduced by avg. 56%. (In L-Telecom from D-3 to D-1) EDW Server IBM p690 NCR 10Node Teradata D-1 Oracle 8i ETL Server ODS Customer/Call/ Billing Connection PPS/BSS Mining Input Variable MOLAP Analysis Mining Analysis Campaign Analysis Sybase IQ/ASE OLAP MART Server CSM/AR Billing Oracle 8.0.6 CCS/MPS/ERP CTI /PPS/NMS SRDF Legacy ETL EBH Infomatica EBH (ETL and Batch Hub) stores temporary and result files which is shared for further table generation in EDW and MART. 21
  • 23. © 2014 DataStreams Corp. All Rights Reserved. Job Task Cycle System Before After Improvementrate Billing Sales Month EDW 12:50 5:00 61% OLAP Mart 18:35 8:20 55% Calls Charges day EDW 5:50 3:00 49% OLAP Mart 8:00 4:00 50% ACCUM week EDW 4:20 1:55 56% OLAP Mart 7:20 3:00 60% receiving CDR (NMS) day EDW 1:00 0:30 50% OLAP Mart 2:20 0:55 61% Sending CDR (NMS) Day EDW 1:40 1:05 35% ERP batch Month EDW 11:20 3:15 71% receiving CDR (NMS) Month EDW 5:00 2:15 55% OLAP mart 11:40 2:20 80% sending CDR (NMS) Month EDW 8:20 4:50 42% ERP provided BATCH Month EDW 16:20 5:15 68% Customer Service After service month EDW 5:30 5:05 9% Improvement Details 22
  • 24. © 2014 DataStreams Corp. All Rights Reserved. Cost Saving Factors  TeraStream has many cost saving factors. Ease of Maintenance Operational Efficiency Quick Development  Creating standard operation and new development procedures  Efficient operation of test system  Ability to fast make test file Ease of maintenance Quality improvement of developing programs - Reliability - Maintainability - Efficiency - Functionality  Productivity improvement of developers and administrators  Developing correct and efficient logic programs  Quality improvement 23
  • 25. © 2014 DataStreams Corp. All Rights Reserved. Cost Saving : Development Stage The higher complexity, the bigger cost saving in development . (Courtesy of Hanhwa Insurance Co. and SKC&C in 2007) Job complexity No. of recs Input Size (Gb) TeraStream™ In- house coding Speed- up 1:1 mapping 90 22 30min 2hour 75% 1:N mapping 900 21 2hour 6hour 66% N:1 mapping 1700 15 2hour 10hour 80% N:N mapping, complex 1300 8 2hour 20hour 90% Avg. 70% of development speed-up 90% speed-up for more complex jobs Overhead from modification, test and preliminary data checking. Development (4Month) Test (4Month) Stabilization (1Month) 24M/M 48M/M 54M/M TeraStream™ In-house coding (Estimated) 40M/M 80M/M 90M/M 40% Reduction 24
  • 26. © 2014 DataStreams Corp. All Rights Reserved. Features & Benefits TeraStream™ guarantees to meet your need for enterprise data integration as well as excellent batch job hub. Sort Engine Using CoSORT™, the first sort package since 1985, TeraStream™ can accelerate sort-related data manipulation (dedup, average, min, max, join, summary and etc.) FAst extraCT FACT™ performs high speed bulk extraction from various commercial DBMS. Automatic Metadata Generation TeraStream™ provides direct reading of DBMS data dictionary to create its own metadata information. High Speed Lookup It provides in-memory lookup function which is high speed mapping conversion using lookup tables. Variety of conversion function calls It provides more than 100 user friendly mapping functions. Developers can easily add their own functions. Pre/Post Processing TeraStream™ provides inter-record and inter-table conversion through pre/post mapping. Major Features Description 25
  • 27. © 2014 DataStreams Corp. All Rights Reserved. Features & Benefits TeraStream™ has been evolved to meet various parallel processing needs and to give convenience through highly efficient GUIs. Inter-node Operation Remote call is possible to initiate the projects of other nodes between TeraStream™s. Distributed Computing using idle nodes is possible by easy transfer of data. Near Real-Time ETL Data transportation every minute is possible including complex data mapping Efficient GUI Using GUI, no skills on programming language are necessary. Unified monitor and control in single screen or specialied monitoring is possible through web browser. Scheduling of jobs is made in unified GUI but even for distributed servers. Multi Language Support UTF-8 is supported. Major Features Description 26
  • 28. © 2014 DataStreams Corp. All Rights Reserved. Improved GUI Supports for data integration activities(develop, execute, monitor, validation) in integrated GUI environment Intuitive task flow Project monitor Editor window GUI for developers Intuitive task flow checking standard output/error/file information/ number of files processed Execution log real time job monitoring  Project Monitor scheduling by time/ period/ business calendar  Scheduler Mapping creation  Editor window Scheduler Task block execution log Metadata property Impact analysis Change history manager  Metadata Repository 27
  • 29. © 2014 DataStreams Corp. All Rights Reserved. DBMS Connectivity Powerful connection between different DBMS types. Both DB-to-DB and File-to-DB data transportation are supported. •N:N mapping •Conversion while transportation •Click to choose record processing types : (Insert/delete/update/insert- update/delete-insert) •DBMS types : Oracle, DB2, Sybase, Informix, Teradata, Greenplum, MSSQL, MySQL, (Altibase, Tibero) Transformation Logic Source Table Target Table 28
  • 30. © 2014 DataStreams Corp. All Rights Reserved. High speed data extraction with FACT™ High speed data extraction of commercial database with SQL is supported. Automatic extraction query is generated. Select * from table •High speed extraction engine(FACT™) with optimized database API. •DBMS Supported : - Oracle - Informix - DB2 / UDB - Sybase IQ /ASE - Teradata - Greenplum - MSSQL /MySQL - Altibase •File split and filtering while extraction •Time, time stamp, and user data format specification 29
  • 31. © 2014 DataStreams Corp. All Rights Reserved. Data Conversion By mapping source to target, conversion of formats, types, character sets, dates, bytes/bits, encryption •Convenient data conversion using mapping window of “converter task block” •Data character set conversion including EBCDIC to ASCII •Data conversion from NDB(Unisys 9-bit) or HDB(IBM) data type to RDB •300 built-in functions •DATE, Time Stamp Conversion between different date formats •CLOB/BLOB supported •Users can add more functions as needed List of provided functions CALLED_NO function editor =addday(cdate(“",”",” (N)") addday(cdate("2005/05/12 12:08:24", "YYYY/HH/DD HH:MI:SS"),2) Converter task block 30
  • 32. © 2014 DataStreams Corp. All Rights Reserved. Data Transport TeraStream uses various transportation method according to file structure, transportation distance, security, amount of record and etc. •File to DB data load for bulk data •“Load task block” generates load scripts automatically. •Remote transportation using FTP •Encryption while transporting •Near Real-time and Bulk transportation is possible Load Scripts 31
  • 33. © 2014 DataStreams Corp. All Rights Reserved. Korea Exchange Bank Expected Result  TeraStream™ is in charge of data flow from core-banking to data mart.  Batch job is also executed during ETL process in the same manner. Issues Plans System Configuration Extract 24 hours DB SPLIT Profit Management RISK KPI EUC current ODW ORACLE 9i EDW Sybase ASIQ 12.5 New ODW ORACLE 9i Bidirectional ETL ETL ETL ETL Batch Batch Batch FTP transfer after extraction Accounting System Information System DW DM & Sub System Open in Feb. 2005 Batch • A next-generation conversion of M/F and IMS HDB • Interactive ETL process • High volume data handling within batch process time • Converting main frame data into Unix data (1TB → 1.5TB) including Korean character conversion • ETL task from accounting system server to new ODW server(extracting appx. 200 GB of daily changed data within 1 hour and 30 minutes by using FACT™ of TeraStream™) • Data hub is used in ETL process from new core banking system to subjective marts. (EUC, KPI, RISK, IFRS, Basel2) • ETL and Batch process in unified way. • Batch process is designed to be done in both ODW and EDW.  Unified Data Processing in ETL & Batch 32
  • 34. © 2014 DataStreams Corp. All Rights Reserved. System configuration Issues Plans Kookmin Bank IBM M/F HDB, DB2 Server RDB Sybase ASIQ 12.7 IMS HDB - Seg. split - conversion & Array split - logic applied - conversion - logic applied - Logic applied EDW A-SOR DM ETL ETL ETL Informover TS(FACT) Informover Source system File process flow DB QUERY Expected Result  Various DBMS(IMS HDB, HOST DB2, Oracle, DB2 UDB) integration by using TeraStream™  Meeting batch target time of 2 hours and 30 minutes for 4TB of EBCDIC data. • M/F and IMS HDB conversion • Processing changed data in absence of time-series column • Processing large size data within batch process time(10TB/day based on source data) • How to process high volume files in parallel • Converting main frame data into data in Unix environment (10TB → 25TB) within 18 hours. • Various data conversion and processing including Korean character conversion • ETL task from accounting system server to new ODW server(extracting appx. 200 GB of daily changed data within 1 hour and 30 minutes by using FACT module of TeraStream™) • ETL and Batch process in unified way. • Batch job in core banking system within 6 hours.  EDW and integrated DM installation 33
  • 35. © 2014 DataStreams Corp. All Rights Reserved. Samsung Electronics • Rea-time data transportation between Germany and China. • Bi-directional synchronization between TeraStreams of Germany and China. • 20 min. MAX loading time for transported data is implemented using TeraStream NRT. •Web Monitoring is developed • Registration in one country should have the same service at other country. • duplicated record should be avoided due to cross transportation. • 20 minutes Near Real-time • Perfect Recovery scheme should be presented Plans Issues System Configuration Smart Phone System in Germany DBs in Service  Efficiency is maintained despite cross transportation Bi-directional NRT integration allows the same service regardless of system type and country (Time from extraction to loading is 20 minutes.)  Bi-directional remote data transportation using TeraStream NRT Extract 프로그램 성공, 실패 등 실행 결과 Web Monitoring Sam To DB UPSERT NRT Extract SAM To DB UPSERT  Global Database Integration using NRT ETL DBs in Service Smart Phone System in China Expected Result 34
  • 36. © 2014 DataStreams Corp. All Rights Reserved. Department of Domestic Administration  Building an E-government and improving the accessibility of government Information System Configuration •The required data is interactively transferred between DW and DM by EAI and configured with ETL. •Target : the central office and 16 cities and provinces •Work status by provinces 1) Administration -> Seoul City 2) Environment : 2 cities and 2 provinces 3) Health Service : 1 city 4) Economy and trade : 2 cities and 7 provinces •50 GB of data processing per day •How to save the processed data : transfer data to a file system and use a backup solution •The central office data is provided in file through EAI. • Oracle data from central office, other cities and provinces is directly extracted by FACT and stored in DW/DM(Sybase). • To complete substantial E-Government service • To provide integrated high-quality information • To build a fast and accurate decision support system Issues Plan Integrated Data Mart Data Warehouses in 16 cities and provinces ODS DW ODS DW DM DM Central governmental system Other provincial s ystems ETL EAI  Provincial Administrative DW Expected Result 35
  • 37. © 2014 DataStreams Corp. All Rights Reserved. System Configuration  Batch processing time for daily closing and unified collection reporting is reduced.  ETL flows unified in TeraStream.  ETL Batch Hub reduces data flow path. • Complex extraction process & difficult job tracking • Absence of ODS makes data referencing hard. • Long DW time cause delayed service start in the morning. •Heavy loads in closing process Issues • Unified processing of ODS, DW & DM • Simple extraction scheme and easy tracking of errors. •ETL time and load reduction using EBH Plans SQL script stage DW DM Procedure Source ODS DM DW ETL HUB D-1 D-1 Source Before Change After Change  Data Warehouse ETL Expected Result Korea Worker’s Compensation & Welfare Service 36
  • 38. © 2014 DataStreams Corp. All Rights Reserved. Health and Welfare Department’s e-Voucher E-Voucher DW Performance Improvement Statistics reporting time is dramatically reduced from 1~6 days to a few second or minutes.  Statistics reporting process made simple and easy to get report.  Consistent data delievry increase data reliability. • daily transportation to ODS • build ODS, DW and DM for better table model • e-Voucher System (DB2 -> DW Server) • Platform - OS : AIX 5.3(ASIS,TOBE ) - CPU : Power5, 2.1GHz, 6core , IBM P-serise - MEM : 12 GB - H/W : 1TB • Simple logic made MA easy •Low data integrity •Lack of expeditious response •Fraud detection was hard. •Low reliability of statistic data caused dispute between data users and generators Plans Issues System Configuration E-Voucher Statistical DW Operational Source DB (oracle) - ODS data conversion - update/insert at ODS - 1:1 mapping - Daily batch - Load to ODS IBM P-serise Voucher Servide Mis-settlement Pregnancy & Birth History Target DB (oracle) FACT ODS DM DW ETL - ODS/ DW data manuplation - update/insert to data mart ETL ETL Expected Result 37
  • 39. © 2014 DataStreams Corp. All Rights Reserved. Korea National Open University (Synthetic Statistic DW ) Expected Result  Building a complete view and improving the accessibility of Statistical DW .  Reducing the institutional service time from 15 days to 4 hours.  Expeditious response of university’s administration statistics report become possible. System Configuration •Extract an operational data for the institutional service and university administration service, transfer the data into ODS, DW and DM, and then load Oracle DBMS by ETL for producing typical and atypical statistic reports •Institutional Statistic Service Data (Initial/Alteration) : 20GB/1GB, Alteration data load time: 4 hours •University Administration Statistic Service Data (Initial/Alteration) : 5GB/100MB, Alteration data load time: 1 hour • Reducing time and automation for the existing manual based university statistic services • To provide integrated high-quality academic, administration, and management information • To build a fast and accurate academic administration decision support system Issues Plan Univ, Admin. Graduate Institutional Services Electronic settlement Tutor Administ -ration Lifelong Education Institution Graduation Grade Registration Admission Synthetic Statistic DW Server TeraStream ODS DW DM Data Extract/Load Data Extract/Load Data Extract/Transfer/Load ETL Control Data Extract/Load Data Extract/Load Unit Process Processing Time Institutional Statistics Services Within 4 hours University Statistic Services Within 1 hour Huge time reduction from of institutional statistic sservice reports 14 days to 4 hours 38