SlideShare a Scribd company logo
1 of 39
Download to read offline
R for finding the non-dominated rules
in multi-objective optimization

Bo-Han Wu
Jan 27, 2014

Taiwan R User Group/MLDM Monday
Google搜尋「資料科學實驗室」

Wu Bo-Han rippleblue2002@gmail.com
Outline
•
•
•
•
•
•
•
•
•

Introduction
Classification rule
Accuracy
Comprehensibility
Interestingness
Multi-objective optimization
Non-dominated rules
SPEA2
Case study
Wu Bo-Han rippleblue2002@gmail.com
Data growing

Wu Bo-Han rippleblue2002@gmail.com
Introduction
• Facing the age of data
explosion, the amount of
data is increasing very fast
in databases.
• Those data normally include
hidden knowledge, and they
can be used to improve the
decision-making process of
any kinds of company.
Wu Bo-Han rippleblue2002@gmail.com
Classification rule
• Classification rule mining is a common
technology in data mining.
• From the historical data, rule can be generalized
to classify unknown samples or predict the future.

Wu Bo-Han rippleblue2002@gmail.com
Classification rule
• IF <some conditions are satisfied> AND <some
conditions are satisfied> THEN <assign some
values of the goal attribute>
• Example:
IF Sex=Male AND Location = Taipei THEN
Product= beer

Wu Bo-Han rippleblue2002@gmail.com
Classification rule
• Traditional mining techniques mostly focus on
accuracy and usually generate lots of rules that
are hard to choose meaningful ones from.
• In order to select optimally meaningful rules,
accuracy, comprehensibility and interestingness
are employed as three objectives.

Wu Bo-Han rippleblue2002@gmail.com
Accuracy

sup( A & C )
A(R) 
sup( A )
•
•

is the support for the rule R
represents the support for the antecedent
of rule R
Wu Bo-Han rippleblue2002@gmail.com
Comprehensibility

Nc ( R)
C( R)  1 
Mc
• Nc(R)is the number of conditions in the rule
• Mc is the maximum number of conditions that a
rule can have
Wu Bo-Han rippleblue2002@gmail.com
Interestingness
sup( A & C ) sup( A & C )  sup( A & C ) 
I (R) 

 1 

sup( A )
sup( C )
D





• 1
• 1
•

gives the probability of generating the rule depending on the antecedent part
gives the probability of generating the rule depending on the consequent part
gives the probability of generating the rule depending on the whole data-set

Wu Bo-Han rippleblue2002@gmail.com
Multi-objective optimization
Low price and high performance
90%

Performance

40%
10k
Non‐dominated solution

Price

100k

Wu Bo-Han rippleblue2002@gmail.com
Multi-objective optimization
Low price and high performance
90%

4

5

3
2

Performance

40%

1

10k
Non‐dominated solution

Price

100k

Wu Bo-Han rippleblue2002@gmail.com
Multi-objective optimization
Low price and high performance
90%

4

5

3
2

Performance

40%
Non‐dominated solution set
Non‐dominated solution

1

10k

Price

100k

Wu Bo-Han rippleblue2002@gmail.com
Multi-objective optimization
• However, traditional methods handle multiobjective problems by converting them into a
single objective problem.
• But this approach can not guarantee to find
optimal solutions for multiple objectives.

Wu Bo-Han rippleblue2002@gmail.com
SPEA2
• SPEA2 is designed by the
concept "survival of the fittest"
from natural evolution.
• The work intended to improve
quality of individuals from
solution space in each
generation.
• SPEA2 used the strategy of
selection, crossover and
mutation to retain the best
individuals and discard worst
ones.
Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Initial population

Empty archive

Individual
Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Wu Bo-Han rippleblue2002@gmail.com
Non-dominated

Wu Bo-Han rippleblue2002@gmail.com
Non-dominated solution

Wu Bo-Han rippleblue2002@gmail.com
Non-dominated solution set
E

F

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Individual
Nod-dominated Individual
Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Individual
Nod-dominated Individual
Wu Bo-Han rippleblue2002@gmail.com
SPEA2
Truncation
operator

Individual
Nod-dominated Individual
Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

2

4

1
3

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

Wu Bo-Han rippleblue2002@gmail.com
SPEA2
Recombination
= 10101101011001100100010010111
= 01100110010111001011101101101

Mutation
= 01100101011001100100010010111
= 10010101011001100100010010111

Wu Bo-Han rippleblue2002@gmail.com
SPEA2

4

3

2

1

Wu Bo-Han rippleblue2002@gmail.com
Non-dominated rules
• Three objectives

IF Sex=Male AND Location = Taipei
THEN Product= beer 

A = 0.333333
C = 0.875000
I = 0.080000

– Accuracy
– Comprehensibility
– Interestingness

Non‐dominated rules
Wu Bo-Han rippleblue2002@gmail.com
Case study
Transaction data of an insurance broker company
Date : 2005 ‐ 2006

Attribute
Gender
Occupation
Payment frequency
Sales methods
Payment methods
Location
Data source
Company
Product

Attribute value index
男、女
士、工、軍
月、年、躉繳(一次性繳費)
電話行銷、臨櫃保險
信用卡、現金、郵局劃撥、轉帳
北部、中部、南部、東部(含離島)
百貨、電信業、銀行
外商壽險公司、本土壽險公司、本地產險公司
年金險、長年期壽險、短年期壽險、意外險、醫療險

Wu Bo-Han rippleblue2002@gmail.com
Case study

Data Cleaning

Data transaction

Training data and 
Test data

Example: Male→01 Female→10

Accuracy
Data transaction

SPEA2

Comprehensibility
Interestingness

Example: 01→ Male 10→Female

Wu Bo-Han rippleblue2002@gmail.com
Case study
SPEA2
RuleMing.r
Objective 
Functions.r
SPEA2
Functions.r

Truncation.r

Crossover.r

Mutation.r

Wu Bo-Han rippleblue2002@gmail.com
Case study
Non-dominated rules
Sales methods=臨櫃保險 AND Data source=百貨公司 AND Company=外商壽險公司
THEN Product=短年期壽險
Payment methods=現金 AND Data source=百貨公司 AND Company=外商壽險公司
THEN Product=短年期壽險
Payment frequency=月 AND Data source=百貨公司 Company=外商壽險公司

Wu Bo-Han rippleblue2002@gmail.com
Case study
Non-dominated rules
Sales methods=臨櫃保險 AND Data source=
百貨公司 AND Company=外商壽險公司
THEN Product=短年期壽險

「透過臨櫃保險參加保險的百貨公司
客戶,較會考慮在外商壽險公司購買
短年期壽險」

表示外商壽險公司在針對以臨櫃購買
保險的百貨公司客戶,可以推薦短年
期壽險。

Wu Bo-Han rippleblue2002@gmail.com
Thanks for your listening

Wu Bo-Han rippleblue2002@gmail.com

More Related Content

Viewers also liked

[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
台灣資料科學年會
 
21天托福核心单词
21天托福核心单词21天托福核心单词
21天托福核心单词
文嘉 董
 
計量化交易策略的開發與運用 法人版
計量化交易策略的開發與運用 法人版計量化交易策略的開發與運用 法人版
計量化交易策略的開發與運用 法人版
derekhcw168
 

Viewers also liked (10)

[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
[DSC 2016] 系列活動:李宏毅 / 一天搞懂深度學習
 
一位年輕探索者的建議
一位年輕探索者的建議一位年輕探索者的建議
一位年輕探索者的建議
 
哥寫的不是程式,是軟體 - 從嵌入式系統看軟體工程全貌
哥寫的不是程式,是軟體 - 從嵌入式系統看軟體工程全貌哥寫的不是程式,是軟體 - 從嵌入式系統看軟體工程全貌
哥寫的不是程式,是軟體 - 從嵌入式系統看軟體工程全貌
 
(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结(Deep) Neural Networks在 NLP 和 Text Mining 总结
(Deep) Neural Networks在 NLP 和 Text Mining 总结
 
給初學者的Spark教學
給初學者的Spark教學給初學者的Spark教學
給初學者的Spark教學
 
21天托福核心单词
21天托福核心单词21天托福核心单词
21天托福核心单词
 
MySQL技术分享:一步到位实现mysql优化
MySQL技术分享:一步到位实现mysql优化MySQL技术分享:一步到位实现mysql优化
MySQL技术分享:一步到位实现mysql优化
 
計量化交易策略的開發與運用 法人版
計量化交易策略的開發與運用 法人版計量化交易策略的開發與運用 法人版
計量化交易策略的開發與運用 法人版
 
初學R語言的60分鐘
初學R語言的60分鐘初學R語言的60分鐘
初學R語言的60分鐘
 
手把手教你 R 語言資料分析實務/張毓倫&陳柏亨
手把手教你 R 語言資料分析實務/張毓倫&陳柏亨手把手教你 R 語言資料分析實務/張毓倫&陳柏亨
手把手教你 R 語言資料分析實務/張毓倫&陳柏亨
 

Similar to MLDM Monday -- Optimization Series Talk

4 ie 2015 rapid research
4   ie 2015 rapid research4   ie 2015 rapid research
4 ie 2015 rapid research
Maggie Nichols
 
Shawn Wallace - Test automation in brownfield applications
Shawn Wallace - Test automation in brownfield applicationsShawn Wallace - Test automation in brownfield applications
Shawn Wallace - Test automation in brownfield applications
QA or the Highway
 

Similar to MLDM Monday -- Optimization Series Talk (20)

addressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenanceaddressing tim/quality trade-off in view maintenance
addressing tim/quality trade-off in view maintenance
 
From Labelling Open data images to building a private recommender system
From Labelling Open data images to building a private recommender systemFrom Labelling Open data images to building a private recommender system
From Labelling Open data images to building a private recommender system
 
4 ie 2015 rapid research
4   ie 2015 rapid research4   ie 2015 rapid research
4 ie 2015 rapid research
 
Ml2 production
Ml2 productionMl2 production
Ml2 production
 
What Makes Training Multi-modal Classification Networks Hard?​
What Makes Training Multi-modal Classification Networks Hard?​What Makes Training Multi-modal Classification Networks Hard?​
What Makes Training Multi-modal Classification Networks Hard?​
 
The deep bootstrap framework review
The deep bootstrap framework reviewThe deep bootstrap framework review
The deep bootstrap framework review
 
Webinar: Question Answering and Virtual Assistants with Deep Learning
Webinar: Question Answering and Virtual Assistants with Deep LearningWebinar: Question Answering and Virtual Assistants with Deep Learning
Webinar: Question Answering and Virtual Assistants with Deep Learning
 
Lessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scaleLessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scale
 
141015 Discovering Scrum at Scrum Roma
141015 Discovering Scrum at Scrum Roma141015 Discovering Scrum at Scrum Roma
141015 Discovering Scrum at Scrum Roma
 
Recommendation engine Using Genetic Algorithm
Recommendation engine Using Genetic AlgorithmRecommendation engine Using Genetic Algorithm
Recommendation engine Using Genetic Algorithm
 
Approximate Continuous Query Answering Over Streams and Dynamic Linked Data Sets
Approximate Continuous Query Answering Over Streams and Dynamic Linked Data SetsApproximate Continuous Query Answering Over Streams and Dynamic Linked Data Sets
Approximate Continuous Query Answering Over Streams and Dynamic Linked Data Sets
 
How to design powerful experiments - Ying Zhang
How to design powerful experiments - Ying ZhangHow to design powerful experiments - Ying Zhang
How to design powerful experiments - Ying Zhang
 
Shawn Wallace - Test automation in brownfield applications
Shawn Wallace - Test automation in brownfield applicationsShawn Wallace - Test automation in brownfield applications
Shawn Wallace - Test automation in brownfield applications
 
Software Testing’s Future—According to Lee Copeland
Software Testing’s Future—According to Lee CopelandSoftware Testing’s Future—According to Lee Copeland
Software Testing’s Future—According to Lee Copeland
 
Technical Excellence Doesn't Just Happen--Igniting a Craftsmanship Culture
Technical Excellence Doesn't Just Happen--Igniting a Craftsmanship CultureTechnical Excellence Doesn't Just Happen--Igniting a Craftsmanship Culture
Technical Excellence Doesn't Just Happen--Igniting a Craftsmanship Culture
 
Defect Metrics for Organization and Project Health
Defect Metrics for Organization and Project HealthDefect Metrics for Organization and Project Health
Defect Metrics for Organization and Project Health
 
2014 State Of DevOps Findings! Velocity Conference
2014 State Of DevOps Findings! Velocity Conference2014 State Of DevOps Findings! Velocity Conference
2014 State Of DevOps Findings! Velocity Conference
 
Introducing QA Into an Agile Environment
Introducing QA Into an Agile EnvironmentIntroducing QA Into an Agile Environment
Introducing QA Into an Agile Environment
 
Agile Data Warehousing
Agile Data WarehousingAgile Data Warehousing
Agile Data Warehousing
 
Mapping Project Management Work to DevOps - style Workflows
Mapping Project Management Work to DevOps - style WorkflowsMapping Project Management Work to DevOps - style Workflows
Mapping Project Management Work to DevOps - style Workflows
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 

MLDM Monday -- Optimization Series Talk