SlideShare a Scribd company logo
1 of 3
Download to read offline
DAUM COMMUNICATIONS
Using big data analytics to understand and predict
user behavior
ESSENTIALS
Industry
Telecommunications
Company Size
2,000+ employees
Business Challenges
• Reduced responsiveness due to
inability to perform realtime
analysis
• Increased complexity from NoSQL
database management systems
• Reliance on resource-intensive
data analysis
• Reduced capability to make ad-
hoc queries on unstructured data
Solution
• EMC VNX unified storage
• Pivotal Greenplum Database
OVERVIEW
Daum Communications (Daum) is one of the leading providers of Korean-language
online services, including the news and information portal Daum.net, web-based email
service Hanmail.net, and the Daum Cafe online community. Headquartered in Jeju
Island, the company provides mobile web services, search marketing, and electronic
mapping. It also sells online advertising products through Daum.net. Daum is the
second largest web portal service provider in terms of daily visits in Korea and has
operating centers in Seoul and on Jeju Island.
Through its extensive range of Internet services and sale of online advertising
products, Daum generates vast amounts of unstructured data. The company has one
of the largest Apache Hadoop clusters in Korea, and analyzes its data to gain critical
competitive information in a number of areas, including user preferences and
behavior, search rankings, and advertisement targeting.
COMPLEX ENVIRONMENT IMPEDES DATA ANALYSIS
Facing intense domestic and global competition from a number of search engines that
are growing market share across desktop and mobile searches, Daum’s businesses
needed to make faster and better decisions to protect the company’s 20 percent share
of the Korean search market.
The company needed to analyze and make immediate decisions on its vast data stores
by extracting knowledge from its data in real time. But Daum was more interested in
solving analytic problems than in exploring relationships between data that are
available in traditional relational database systems. As a result, Daum was using
Hadoop to store data, and was using NoSQL non-relational database management
systems such as Cassandra and Storm as the Hadoop Distributed File System (HDFS)
to provide greater speed in performing Big Data analytics on unstructured data. This
solution landscape presented the company with serious challenges.
“Performing ad-hoc and multidimensional queries and analysis through Hadoop on our
unstructured data proved difficult,” says Jun-Sik Eom, Team Manager, Data
Technology Department, Daum Communications. “We were restricted in the speed of
data analysis due the batch processing of both unstructured and structured data,
which meant we relied heavily on the capability of our developers. Data analysis of
complex forms was also challenging in the NoSQL database.”
Because Daum’s data must be constantly reviewed, the company sought a solution
that would enable employees to perform high-speed queries on the data residing in
Hadoop. Additionally, Daum wanted to improve access through tools that were already
familiar to developers and database administrators.
CUSTOMER PROFILE
Benefits
• Increased data loading and
processing speeds
• Improved accuracy in generating
search results and predicting user
behaviour
• Increased efficiency by
performing rapid queries on the
data
• Reduced expenditures through
improved scalability
PIVOTAL GREENPLUM DATABASE ENABLES HIGH-SPEED
ANALYSIS OF UNSTRUCTURED DATA
Daum evaluated solutions that could address the limitations in the resource-intensive
analysis required by Hadoop and the NoSQL database management systems. To meet
the data analysis requirements for its search engine and Internet services businesses,
the company selected Pivotal Greenplum Database, which connects to Hadoop and
enables the co-processing of both structured and unstructured data within a single
solution.
“We were attracted to Pivotal Greenplum Database because of the advantage it had in
mixing the merits of database, data warehouse, and business intelligence,” says Eom.
“We can now use a single platform to run high-speed analytic queries on our most
appropriate data stores.”
“We were attracted to Pivotal Greenplum Database because of the
advantage it had in mixing the merits of database, data
warehouse, and business intelligence. We can now use a single
platform to run high-speed analytic queries on our most
appropriate data stores.”
Jun-Sik Eom,
Team Manager, Data Technology Department, Daum Communications
DELIVERING NEW BUSINESS INSIGHTS FROM REALTIME
ANALYSIS
To support its efforts to gain market share, Daum is using Pivotal Greenplum Database
to provide improved services and search accuracy to its users. Through realtime data
gathering and analysis of Internet searches and user behavior within its various online
services, the company can better predict future behavior and demand.
Daum can now make multiple queries—both in real time and over time as user patterns
and knowledge emerge—due to massively parallel processing (MPP) architecture, which
enables fast data loading and high-speed queries on the data. In addition to performing
real-time weblog analysis, the company can re-analyze data that has already been
processed and gain meaningful results with these various interpretations. Pivotal
helped Daum achieve an increased depth of knowledge, which is just as critical as
breadth in terms of delivering services.
ELIMINATING ROADBLOCKS TO SPEEDY QUERYING
Performing ad-hoc queries on the data stored in NoSQL databases from the Pivotal
Greenplum Database means administrators can use familiar SQL commands to perform
massive and multidimensional analysis. This reduces the company’s reliance on finding
specialist NoSQL and Hadoop skill sets, and minimizes the workload for employees.
“One of the most important elements in effectively using Big Data is securing the right
people,” says Eom. “We used to struggle with having the resources needed to perform
queries, which greatly reduced our processing efficiency. Today, instead of performing
queries on the NoSQL systems, we collect the data residing in Hadoop and NoSQL, and
then save it in Pivotal Greenplum Database to execute the analysis.”
ENABLING CONTINUOUS PROCESSING WHILE REDUCING
COSTS
Because Pivotal Greenplum Database is available as a software-only distribution, Daum
can run the data warehouse on any of its existing x86 servers running Hadoop. This
ensures scalability while eliminating the need for Daum to purchase new data center
infrastructure. Pivotal Greenplum Database enables gNet for Hadoop, a parallel
communications transport, to access the Hadoop cluster and query the data efficiently
using Hadoop servers rather than those running Pivotal Greenplum Database.
“By using our existing x86 servers, we were able to reduce expenditures and expand
capacity through linear scalability,” Eom explains. “We have continuous processing
across Pivotal Greenplum and Hadoop nodes. As the data increases, we can
conveniently expand our capacity just by adding standard x86 servers.”
LEARN MORE
To learn more about Pivotal products, services and solutions, visit gopivotal.com.
CONTACT US
To learn more about how EMC
products, services, and solutions can
help solve your business and IT
challenges, contact your local
representative or authorized reseller—
or visit us at www.EMC.com.
www.EMC.com
EMC2
, EMC and the EMC logo are registered trademarks or trademarks of EMC Corporation in the
United States and other countries. GoPivotal, Pivotal, and the Pivotal logo are registered
trademarks or trademarks of GoPivotal, Inc, in the United States and other jurisdictions. All other
trademarks used herein are the property of their respective owners. © Copyright 2013 EMC
Corporation. All rights reserved. Published in the USA. 12/13 Customer Profile H12705.
EMC believes the information in this document is accurate as of its publication date. The
information is subject to change without notice.

More Related Content

What's hot

Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraCloudera, Inc.
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionAppfluent Technology
 
Dell Technology World - IT as a Business - Multi-Cloud Strategy is your Product
Dell Technology World - IT as a Business - Multi-Cloud Strategy is your ProductDell Technology World - IT as a Business - Multi-Cloud Strategy is your Product
Dell Technology World - IT as a Business - Multi-Cloud Strategy is your ProductManuel "Manny" Rodriguez-Perez
 
Hadoop is not an Island in the Enterprise
Hadoop is not an Island in the EnterpriseHadoop is not an Island in the Enterprise
Hadoop is not an Island in the EnterpriseDataWorks Summit
 
Struggling with data management
Struggling with data managementStruggling with data management
Struggling with data managementDavid Walker
 
Building a data warehouse of call data records
Building a data warehouse of call data recordsBuilding a data warehouse of call data records
Building a data warehouse of call data recordsDavid Walker
 
Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...
Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...
Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...Cloudera, Inc.
 
Introduction to Microsoft HDInsight and BI Tools
Introduction to Microsoft HDInsight and BI ToolsIntroduction to Microsoft HDInsight and BI Tools
Introduction to Microsoft HDInsight and BI ToolsDataWorks Summit
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsDavid Portnoy
 
Modern Data Architecture: In-Memory with Hadoop - the new BI
Modern Data Architecture: In-Memory with Hadoop - the new BIModern Data Architecture: In-Memory with Hadoop - the new BI
Modern Data Architecture: In-Memory with Hadoop - the new BIKognitio
 
Hortonworks kognitio webinar 10 dec 2013
Hortonworks kognitio webinar 10 dec 2013Hortonworks kognitio webinar 10 dec 2013
Hortonworks kognitio webinar 10 dec 2013Michael Hiskey
 
Big data and apache hadoop adoption
Big data and apache hadoop adoptionBig data and apache hadoop adoption
Big data and apache hadoop adoptionfaizrashid1995
 
Intro to HDFS and MapReduce
Intro to HDFS and MapReduceIntro to HDFS and MapReduce
Intro to HDFS and MapReduceRyan Tabora
 
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...NoSQLmatters
 
Delivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDelivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDataWorks Summit
 
J1 - Keynote Data Platform - Rohan Kumar
J1 - Keynote Data Platform - Rohan KumarJ1 - Keynote Data Platform - Rohan Kumar
J1 - Keynote Data Platform - Rohan KumarMS Cloud Summit
 
Hadoop by kamran khan
Hadoop by kamran khanHadoop by kamran khan
Hadoop by kamran khanKamranKhan587
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems divjeev
 

What's hot (20)

Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management Orchestra
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 
Dell Technology World - IT as a Business - Multi-Cloud Strategy is your Product
Dell Technology World - IT as a Business - Multi-Cloud Strategy is your ProductDell Technology World - IT as a Business - Multi-Cloud Strategy is your Product
Dell Technology World - IT as a Business - Multi-Cloud Strategy is your Product
 
Hadoop is not an Island in the Enterprise
Hadoop is not an Island in the EnterpriseHadoop is not an Island in the Enterprise
Hadoop is not an Island in the Enterprise
 
Struggling with data management
Struggling with data managementStruggling with data management
Struggling with data management
 
Building a data warehouse of call data records
Building a data warehouse of call data recordsBuilding a data warehouse of call data records
Building a data warehouse of call data records
 
Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...
Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...
Hadoop World 2011: I Want to Be BIG - Lessons Learned at Scale - David "Sunny...
 
No sql3 rmoug
No sql3 rmougNo sql3 rmoug
No sql3 rmoug
 
Introduction to Microsoft HDInsight and BI Tools
Introduction to Microsoft HDInsight and BI ToolsIntroduction to Microsoft HDInsight and BI Tools
Introduction to Microsoft HDInsight and BI Tools
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse Platforms
 
Modern Data Architecture: In-Memory with Hadoop - the new BI
Modern Data Architecture: In-Memory with Hadoop - the new BIModern Data Architecture: In-Memory with Hadoop - the new BI
Modern Data Architecture: In-Memory with Hadoop - the new BI
 
Hortonworks kognitio webinar 10 dec 2013
Hortonworks kognitio webinar 10 dec 2013Hortonworks kognitio webinar 10 dec 2013
Hortonworks kognitio webinar 10 dec 2013
 
Big data and apache hadoop adoption
Big data and apache hadoop adoptionBig data and apache hadoop adoption
Big data and apache hadoop adoption
 
Actian DataFlow Whitepaper
Actian DataFlow WhitepaperActian DataFlow Whitepaper
Actian DataFlow Whitepaper
 
Intro to HDFS and MapReduce
Intro to HDFS and MapReduceIntro to HDFS and MapReduce
Intro to HDFS and MapReduce
 
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
 
Delivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated ArchitectureDelivering on the Hadoop/HBase Integrated Architecture
Delivering on the Hadoop/HBase Integrated Architecture
 
J1 - Keynote Data Platform - Rohan Kumar
J1 - Keynote Data Platform - Rohan KumarJ1 - Keynote Data Platform - Rohan Kumar
J1 - Keynote Data Platform - Rohan Kumar
 
Hadoop by kamran khan
Hadoop by kamran khanHadoop by kamran khan
Hadoop by kamran khan
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 

Viewers also liked

Catàleg BEEP especial Nadal 2014 en Català
Catàleg BEEP especial Nadal 2014 en CatalàCatàleg BEEP especial Nadal 2014 en Català
Catàleg BEEP especial Nadal 2014 en CatalàBeep Informática
 
Як навчити дитину грамотно писати
Як навчити дитину грамотно писати Як навчити дитину грамотно писати
Як навчити дитину грамотно писати valeria karnatovska
 
19252webtoc
19252webtoc19252webtoc
19252webtocftayci
 
20130513 training on the job apprendimento permanente
20130513 training on the job apprendimento permanente20130513 training on the job apprendimento permanente
20130513 training on the job apprendimento permanenteMarco Muzzarelli
 
What are the important brand architecture decisions in developing a branding ...
What are the important brand architecture decisions in developing a branding ...What are the important brand architecture decisions in developing a branding ...
What are the important brand architecture decisions in developing a branding ...Sameer Mathur
 
Graphene Position Paper (E-Nano Newsletter Special Issue)
Graphene Position Paper (E-Nano Newsletter Special Issue)Graphene Position Paper (E-Nano Newsletter Special Issue)
Graphene Position Paper (E-Nano Newsletter Special Issue)Phantoms Foundation
 
Taxation Reforms - Palash Das
Taxation Reforms - Palash DasTaxation Reforms - Palash Das
Taxation Reforms - Palash DasPalash Das
 
Catàleg BEEP Abril 2015 en Català
Catàleg BEEP Abril 2015 en CatalàCatàleg BEEP Abril 2015 en Català
Catàleg BEEP Abril 2015 en CatalàBeep Informática
 
My four hour body
My four hour bodyMy four hour body
My four hour bodyAndy Clark
 
A contribuicao epistemologica de ludwik fleck na producao academica em educac...
A contribuicao epistemologica de ludwik fleck na producao academica em educac...A contribuicao epistemologica de ludwik fleck na producao academica em educac...
A contribuicao epistemologica de ludwik fleck na producao academica em educac...Augusto Santana
 

Viewers also liked (15)

Catàleg BEEP especial Nadal 2014 en Català
Catàleg BEEP especial Nadal 2014 en CatalàCatàleg BEEP especial Nadal 2014 en Català
Catàleg BEEP especial Nadal 2014 en Català
 
Як навчити дитину грамотно писати
Як навчити дитину грамотно писати Як навчити дитину грамотно писати
Як навчити дитину грамотно писати
 
19252webtoc
19252webtoc19252webtoc
19252webtoc
 
20130513 training on the job apprendimento permanente
20130513 training on the job apprendimento permanente20130513 training on the job apprendimento permanente
20130513 training on the job apprendimento permanente
 
Kyria
KyriaKyria
Kyria
 
What are the important brand architecture decisions in developing a branding ...
What are the important brand architecture decisions in developing a branding ...What are the important brand architecture decisions in developing a branding ...
What are the important brand architecture decisions in developing a branding ...
 
Graphene Position Paper (E-Nano Newsletter Special Issue)
Graphene Position Paper (E-Nano Newsletter Special Issue)Graphene Position Paper (E-Nano Newsletter Special Issue)
Graphene Position Paper (E-Nano Newsletter Special Issue)
 
Taxation Reforms - Palash Das
Taxation Reforms - Palash DasTaxation Reforms - Palash Das
Taxation Reforms - Palash Das
 
D6
D6D6
D6
 
Big data overview by Edgars
Big data overview by EdgarsBig data overview by Edgars
Big data overview by Edgars
 
Catàleg BEEP Abril 2015 en Català
Catàleg BEEP Abril 2015 en CatalàCatàleg BEEP Abril 2015 en Català
Catàleg BEEP Abril 2015 en Català
 
D7
D7D7
D7
 
My four hour body
My four hour bodyMy four hour body
My four hour body
 
CV_CC_engl
CV_CC_englCV_CC_engl
CV_CC_engl
 
A contribuicao epistemologica de ludwik fleck na producao academica em educac...
A contribuicao epistemologica de ludwik fleck na producao academica em educac...A contribuicao epistemologica de ludwik fleck na producao academica em educac...
A contribuicao epistemologica de ludwik fleck na producao academica em educac...
 

Similar to Daum Communications Case Study

Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...
Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...
Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...Vasu S
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with HadoopPrecisely
 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopAppfluent Technology
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
Big Data Platform and Architecture Recommendation
Big Data Platform and Architecture RecommendationBig Data Platform and Architecture Recommendation
Big Data Platform and Architecture RecommendationSofyan Hadi AHmad
 
Data warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaData warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaJyrki Määttä
 
Big Data using NoSQL Technologies
Big Data using NoSQL TechnologiesBig Data using NoSQL Technologies
Big Data using NoSQL TechnologiesAmit Singh
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...MapR Technologies
 
GCP On Prem Buyers Guide - White-paper | Qubole
GCP On Prem Buyers Guide - White-paper | Qubole GCP On Prem Buyers Guide - White-paper | Qubole
GCP On Prem Buyers Guide - White-paper | Qubole Vasu S
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data AnalyticsAttunity
 
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | QuboleVasu S
 
mapr_case_study_experian
mapr_case_study_experianmapr_case_study_experian
mapr_case_study_experianErni Susanti
 
Hadoop and the Data Warehouse: When to Use Which
Hadoop and the Data Warehouse: When to Use Which Hadoop and the Data Warehouse: When to Use Which
Hadoop and the Data Warehouse: When to Use Which DataWorks Summit
 
Analyst Report : The Enterprise Use of Hadoop
Analyst Report : The Enterprise Use of Hadoop Analyst Report : The Enterprise Use of Hadoop
Analyst Report : The Enterprise Use of Hadoop EMC
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization Denodo
 

Similar to Daum Communications Case Study (20)

Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...
Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...
Case Study - DataXu Uses Qubole To Make Big Data Cloud Querying, Highly Avail...
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using Hadoop
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
IBM Dash DB
IBM Dash DBIBM Dash DB
IBM Dash DB
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
Big Data Platform and Architecture Recommendation
Big Data Platform and Architecture RecommendationBig Data Platform and Architecture Recommendation
Big Data Platform and Architecture Recommendation
 
Data warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-clouderaData warehouse-optimization-with-hadoop-informatica-cloudera
Data warehouse-optimization-with-hadoop-informatica-cloudera
 
Big Data using NoSQL Technologies
Big Data using NoSQL TechnologiesBig Data using NoSQL Technologies
Big Data using NoSQL Technologies
 
Big Data
Big DataBig Data
Big Data
 
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
Hadoop in 2015: Keys to Achieving Operational Excellence for the Real-Time En...
 
GCP On Prem Buyers Guide - White-paper | Qubole
GCP On Prem Buyers Guide - White-paper | Qubole GCP On Prem Buyers Guide - White-paper | Qubole
GCP On Prem Buyers Guide - White-paper | Qubole
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
2020 Cloud Data Lake Platforms Buyers Guide - White paper | Qubole
 
mapr_case_study_experian
mapr_case_study_experianmapr_case_study_experian
mapr_case_study_experian
 
Hadoop and the Data Warehouse: When to Use Which
Hadoop and the Data Warehouse: When to Use Which Hadoop and the Data Warehouse: When to Use Which
Hadoop and the Data Warehouse: When to Use Which
 
Analyst Report : The Enterprise Use of Hadoop
Analyst Report : The Enterprise Use of Hadoop Analyst Report : The Enterprise Use of Hadoop
Analyst Report : The Enterprise Use of Hadoop
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
 
The new EDW
The new EDWThe new EDW
The new EDW
 

More from VMware Tanzu

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItVMware Tanzu
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023VMware Tanzu
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleVMware Tanzu
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023VMware Tanzu
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductVMware Tanzu
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready AppsVMware Tanzu
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And BeyondVMware Tanzu
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023VMware Tanzu
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptxVMware Tanzu
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchVMware Tanzu
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishVMware Tanzu
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVMware Tanzu
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - FrenchVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023VMware Tanzu
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootVMware Tanzu
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerVMware Tanzu
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeVMware Tanzu
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsVMware Tanzu
 

More from VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 

Daum Communications Case Study

  • 1. DAUM COMMUNICATIONS Using big data analytics to understand and predict user behavior ESSENTIALS Industry Telecommunications Company Size 2,000+ employees Business Challenges • Reduced responsiveness due to inability to perform realtime analysis • Increased complexity from NoSQL database management systems • Reliance on resource-intensive data analysis • Reduced capability to make ad- hoc queries on unstructured data Solution • EMC VNX unified storage • Pivotal Greenplum Database OVERVIEW Daum Communications (Daum) is one of the leading providers of Korean-language online services, including the news and information portal Daum.net, web-based email service Hanmail.net, and the Daum Cafe online community. Headquartered in Jeju Island, the company provides mobile web services, search marketing, and electronic mapping. It also sells online advertising products through Daum.net. Daum is the second largest web portal service provider in terms of daily visits in Korea and has operating centers in Seoul and on Jeju Island. Through its extensive range of Internet services and sale of online advertising products, Daum generates vast amounts of unstructured data. The company has one of the largest Apache Hadoop clusters in Korea, and analyzes its data to gain critical competitive information in a number of areas, including user preferences and behavior, search rankings, and advertisement targeting. COMPLEX ENVIRONMENT IMPEDES DATA ANALYSIS Facing intense domestic and global competition from a number of search engines that are growing market share across desktop and mobile searches, Daum’s businesses needed to make faster and better decisions to protect the company’s 20 percent share of the Korean search market. The company needed to analyze and make immediate decisions on its vast data stores by extracting knowledge from its data in real time. But Daum was more interested in solving analytic problems than in exploring relationships between data that are available in traditional relational database systems. As a result, Daum was using Hadoop to store data, and was using NoSQL non-relational database management systems such as Cassandra and Storm as the Hadoop Distributed File System (HDFS) to provide greater speed in performing Big Data analytics on unstructured data. This solution landscape presented the company with serious challenges. “Performing ad-hoc and multidimensional queries and analysis through Hadoop on our unstructured data proved difficult,” says Jun-Sik Eom, Team Manager, Data Technology Department, Daum Communications. “We were restricted in the speed of data analysis due the batch processing of both unstructured and structured data, which meant we relied heavily on the capability of our developers. Data analysis of complex forms was also challenging in the NoSQL database.” Because Daum’s data must be constantly reviewed, the company sought a solution that would enable employees to perform high-speed queries on the data residing in Hadoop. Additionally, Daum wanted to improve access through tools that were already familiar to developers and database administrators. CUSTOMER PROFILE
  • 2. Benefits • Increased data loading and processing speeds • Improved accuracy in generating search results and predicting user behaviour • Increased efficiency by performing rapid queries on the data • Reduced expenditures through improved scalability PIVOTAL GREENPLUM DATABASE ENABLES HIGH-SPEED ANALYSIS OF UNSTRUCTURED DATA Daum evaluated solutions that could address the limitations in the resource-intensive analysis required by Hadoop and the NoSQL database management systems. To meet the data analysis requirements for its search engine and Internet services businesses, the company selected Pivotal Greenplum Database, which connects to Hadoop and enables the co-processing of both structured and unstructured data within a single solution. “We were attracted to Pivotal Greenplum Database because of the advantage it had in mixing the merits of database, data warehouse, and business intelligence,” says Eom. “We can now use a single platform to run high-speed analytic queries on our most appropriate data stores.” “We were attracted to Pivotal Greenplum Database because of the advantage it had in mixing the merits of database, data warehouse, and business intelligence. We can now use a single platform to run high-speed analytic queries on our most appropriate data stores.” Jun-Sik Eom, Team Manager, Data Technology Department, Daum Communications DELIVERING NEW BUSINESS INSIGHTS FROM REALTIME ANALYSIS To support its efforts to gain market share, Daum is using Pivotal Greenplum Database to provide improved services and search accuracy to its users. Through realtime data gathering and analysis of Internet searches and user behavior within its various online services, the company can better predict future behavior and demand. Daum can now make multiple queries—both in real time and over time as user patterns and knowledge emerge—due to massively parallel processing (MPP) architecture, which enables fast data loading and high-speed queries on the data. In addition to performing real-time weblog analysis, the company can re-analyze data that has already been processed and gain meaningful results with these various interpretations. Pivotal helped Daum achieve an increased depth of knowledge, which is just as critical as breadth in terms of delivering services. ELIMINATING ROADBLOCKS TO SPEEDY QUERYING Performing ad-hoc queries on the data stored in NoSQL databases from the Pivotal Greenplum Database means administrators can use familiar SQL commands to perform massive and multidimensional analysis. This reduces the company’s reliance on finding specialist NoSQL and Hadoop skill sets, and minimizes the workload for employees. “One of the most important elements in effectively using Big Data is securing the right people,” says Eom. “We used to struggle with having the resources needed to perform queries, which greatly reduced our processing efficiency. Today, instead of performing queries on the NoSQL systems, we collect the data residing in Hadoop and NoSQL, and then save it in Pivotal Greenplum Database to execute the analysis.”
  • 3. ENABLING CONTINUOUS PROCESSING WHILE REDUCING COSTS Because Pivotal Greenplum Database is available as a software-only distribution, Daum can run the data warehouse on any of its existing x86 servers running Hadoop. This ensures scalability while eliminating the need for Daum to purchase new data center infrastructure. Pivotal Greenplum Database enables gNet for Hadoop, a parallel communications transport, to access the Hadoop cluster and query the data efficiently using Hadoop servers rather than those running Pivotal Greenplum Database. “By using our existing x86 servers, we were able to reduce expenditures and expand capacity through linear scalability,” Eom explains. “We have continuous processing across Pivotal Greenplum and Hadoop nodes. As the data increases, we can conveniently expand our capacity just by adding standard x86 servers.” LEARN MORE To learn more about Pivotal products, services and solutions, visit gopivotal.com. CONTACT US To learn more about how EMC products, services, and solutions can help solve your business and IT challenges, contact your local representative or authorized reseller— or visit us at www.EMC.com. www.EMC.com EMC2 , EMC and the EMC logo are registered trademarks or trademarks of EMC Corporation in the United States and other countries. GoPivotal, Pivotal, and the Pivotal logo are registered trademarks or trademarks of GoPivotal, Inc, in the United States and other jurisdictions. All other trademarks used herein are the property of their respective owners. © Copyright 2013 EMC Corporation. All rights reserved. Published in the USA. 12/13 Customer Profile H12705. EMC believes the information in this document is accurate as of its publication date. The information is subject to change without notice.