SlideShare a Scribd company logo
1 of 35
Download to read offline
Default of Credit Card Clients
Presented By,
Hetarth Bhatt – 251056818
Khushali Patel – 25105445
Rajaraman Ganesan – 251056279
Vatsal Shah – 251041322
Subject: Data Analytics
Department of Electrical & Computer Engineering (M.Engg)
Western University, Canada
AGENDA
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Conclusion
Overview
● Banking / Financial Institutes plays a significant role in providing financial service.
● To maintain the integrity, bank/institute must be careful when investing in customers to avoid
financial loss.
● Before giving credit to borrowers, the bank must come to about the potential of customers.
● The term credit scoring, determines the relation between defaulters and loan characteristics.
Goal
● The model we built here will use all possible factors to predict data on customers to find who are
defaulters and non-defaulters next month.
● The goal is to find the whether the clients are able to pay their next month credit amount.
● Identify some potential customers for the bank who can settle their credit balance.
● To determine if their customers could make the credit card payments on-time.
● Default is the failure to pay interest or principal on a loan or credit card payment.
Process Design
● The process is done by Supervised learning Algorithm.
● The idea behind using this is we have a prior knowledge on our output values.
● It acts as a guide to teach the algorithms what conclusion it should come up with.
● Common algorithms in supervised learning includes logistic regression, naive bayes, support vector
machines and decision tree classifier.
Approach Design
Preparation
of data
Applying
models
Comparing the
applied models
Evaluation
Model
Analyse
the Best
model
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Conclusion
Dataset Overview
● Oriented: UCI Machine Learning Repository. (Link)
● Attributes: 24
● Tuples: 30,000
Customers
data
Customers
data
Continue…
Independent
variables:
• Customer ID
• Credit limit
• Gender
• Age
• Marital status
• Level of education
• History of their past payments made (April to
September) (g1 to g6)
• Amount of bill statement (h1 to h6)
• Amount of previous payment (i1 to i6)
Dependent
variables:
• default – A customer who will be default next month
payment (0: no, 1: yes)
Dataset overview
Graph shows total number
of records for defaulters
and non-defaulters.
If they would do payment
or not (yes=1 no=0) for
next month
22% - default
78% - non-default
(Screenshot: Taken during evaluation process)
Continue…
1. It shows count for ‘sex’ attribute
1 - Male and 2 - Female
2. It shows default count for ‘marriage’ attribute
0,3 - Others, 1 - Married, 2 - Single
(Screenshot 1,2: Taken during evaluation process)
Continue…
2. It shows Limit of balance with respect to age.
1. It shows count for Total ‘Credit limit’ attribute
values with respect to Number of Account
(Screenshot 1,2: Taken during evaluation process)
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Conclusion
Data Preprocessing
● Data set is divided in 60:40 ratio for train and test respectively.
● ID column was dropped as its unnecessary for our modeling.
● The attribute name ‘PAY_0’ was converted to ‘PAY_1’ for naming convenience.
● Pay_0: No consumption of credit card=-2, Pay duly(paid on time)=-1, payment delay for one month=1,
payment delay for two months=2, payment delay for nine months and above=9.
● Marital Status 0 is converted to value 3 (Others).
● No Null values in dataset.
● Robust Scaler – It is used to convert variable in the same scale. Ex. in Limit Balance column there are
different range of values which are converted in proper scale.
● Education ● Gender ● Marital status
Feature Engineering
Education Value
Graduate 1
University 2
High School 3
Others 4
Gender Value
Male 1
Female 2
Marital Status Value
- 0
Married 1
Single 2
Others 3
One-Hot Encoding
● The process in which Categorical variables are converted into a form, where we can apply algorithms to do
prediction easily.
● (n-1) features
● Ex. There are 4 features in Education  Converted to 3 features (n-1)
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Conclusion
Proposed Models
Logistic Regression
•It is used for Binary classification.
•Outputs have a nice probabilistic interpretation, and the algorithm can be regularized to avoid over fitting.
•In logistic regression the hypothesis is that the conditional probability p of class belongs to ”1”
•if probability is greater than threshold probability, generally 0.5, else it belongs to the class ”0”.
•Ex. Y 𝑖 = ቊ
1 , 𝑝 ≥ 0.5
0, 𝑝 < 0.5
Support Vector Machine
•Kernel: linear and enabled probability: ‘true’
•Soft-SVM search used in the process.
•Hard-SVM searches for the decision boundary that separates the training data separately with the largest
margin.
•Soft-SVM is based on the assumption that learning data is not perfectly separable.
Continue…
Feed Forward Neural Network
•Feed forward NN is used with Backpropagation algorithm. Below parameters decided tuned parameters
•Activation function : Rectified Linear Unit(ReLU) and Sigmoid
•SGD: adam
•Epochs: 100
•Loss function: Binary cross entropy
•Input layer : 26, Hidden layer : 2, Output: 1
•Early Stopping
Voting Classifier
•Estimators: list of classifiers
•Combined average of Logistic Regression + Decision tree Classifier + Support Vector Machine (SVM)
Naïve Bayes, KNN and Decision Tree classifier
Evaluation Process
Evaluation Metrics:
● Accuracy: Accuracy determine how often the model predicts default and non-default correctly.
● Precision: Precision calculates whenever our models predicts it is default how often it is correct.
● Recall: Recall regulate the actual default that the model is actually predict.
● Precision Recall Curve: PRC will display the tradeoff between precision and recall threshold.
Cross Validation:
● K Fold cross validation; k = 5
Confusion Metrics
True Positive – A person who is defaulter and predicted as defaulter.
True Negative – A person who is non-defaulter and predicted as non-defaulter.
False Positive – A person who is predicted defaulter is non-defaulter.
False Negative – A person who is predicted non-defaulter is defaulter.
# Non-defaulter (predicted) - 0 Defaulter (predicted) - 1
Non-defaulter (actual) - 0 TN FP
Defaulter (actual) - 1 FN TP
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Conclusion
Evaluation Result
No. Algorithms Accuracy Precision Recall Confusion Metrix
- Null 78 - - -
1 Logistic Regression 81.45 66.92 35.95
8927 419
1806 848
2 KNN 78.86 53.47 34.17 8557 789
1747 907
3 Naïve Byes 76.68 47.65 52.23
7736 1610
1188 1466
4 Classification Tree 78.46 52.09 32.81 8545 801
1783 871
5 SVM 81.66 63.99 39.11 8762 584
1616 1038
6 Feed Forward NN 75.65 33.91 40.74
8927 419
1806 848
7 Voting Classifier 83.95 67.49 32.83 8842 504
1822 832
Graphical representation
Bar chart representation of Precision, Recall and Accuracy values
Precision-Recall curve
Comparison of implemented algorithms with Precision-recall curve
Precision Recall curve with threshold
● Precision-Recall comparison with respect to the
threshold value.
● Adjust precision and recall value by adjusting
threshold value
● We set threshold value  0.2
● False Negative value is improved by 40%.
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Process
Conclusion
Conclusion
● We investigated the data, checking for data unbalancing, visualizing the features and understanding the
relationship between different features.
● We used both train-validation split and cross-validation to evaluate the model effectiveness to predict the target
value, i.e. detecting if a credit card client will default next month.
● We then investigated five predictive models:
○ We started with Logistic Regression, Naïve bayes, SVM, KNN, Classification Tree and Feed-forward NN and
Voting classifier accuracy is almost same.
○ We choose based model based on minimum value of False Negative value.
○ This would also inform the issuer’s decisions on who to give a credit card to and what credit limit to
provide.
Credit Card
Default
Problem
(DEMO)
Problem
Overview
Dataset
Overview
Data
Preparation
Proposed
Models
Evaluation
Process
Conclusion
Demo
QR Code
S C A N Q R C O D E A N D A C C ES S P P T A N D C O D E ( A C C E S S T H E C O D E : H T T P : / / V A T S A L S H A H . I N / P R O J E C T S . H T M L )
References
● Li, Xiao-Lin, and Yu Zhong. An overview of personal credit scoring: techniques and future work. Journal:
International Journal of Intelligence Science ISSN 2163-0283. 2012.
● Federal Reserve. (2017) “Report to the Congress on the Profitability of Credit Card Operations of Depository
Institutions”.
● Taiwo Oladipupo Ayodele. (2010) “Types of Machine Learning Algorithms”, New Advances In Machine
Learning, Yagang Zhang (Ed.), Intech
● Default Credit Card Clients Dataset, https://www.kaggle.com/uciml/default-of-credit-card-clients-dataset/
● Davis J., Goadrich M. The Relationship Between Precision-Recall and ROC Curves.
ACM New York, NY, USA 2006. ISBN:1-59593-383-2.
● Arlot, Sylvain, and Alain Celisse. A survey of cross-validation procedures for model
selection. eprint arXiv:0907.4728. DOI:10.1214/09-SS054.
● https://www.analyticsvidhya.com/blog/2016/03/introduction-deep-learning-fundamentals-neural-
networks/
Contact Us
You can share your feedback on below Email-IDs.
● Hetarth Bhatt – hbhatt7@uwo.ca
● Khushali Patel – kpate372@uwo.ca
● Rajaraman Ganesan – rganesa@uwo.ca
● Vatsal Shah – vshh56@uwo.ca (http://vatsalshah.in)
THANK YOU

More Related Content

What's hot

Loan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningLoan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningAlibaba Cloud
 
Credit Card Fraud Detection
Credit Card Fraud DetectionCredit Card Fraud Detection
Credit Card Fraud Detectionijtsrd
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model finalRitu Sarkar
 
Case Study: Loan default prediction
Case Study: Loan default predictionCase Study: Loan default prediction
Case Study: Loan default predictionALTEN Calsoft Labs
 
Analysis of-credit-card-fault-detection
Analysis of-credit-card-fault-detectionAnalysis of-credit-card-fault-detection
Analysis of-credit-card-fault-detectionJustluk Luk
 
Overview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessOverview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessSanjay Kar
 
Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...
Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...
Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...accenture
 
Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case Ramandeep Kaur Bagri
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banksPankaj Baid
 
Machine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskMachine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskQuantUniversity
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1Venkata Reddy Konasani
 
Build Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and GraphsBuild Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and GraphsNeo4j
 
Credit Card Fraud Detection
Credit Card Fraud DetectionCredit Card Fraud Detection
Credit Card Fraud DetectionBinayakreddy
 
Machine Learning for Fraud Detection
Machine Learning for Fraud DetectionMachine Learning for Fraud Detection
Machine Learning for Fraud DetectionNitesh Kumar
 
Time series forecasting with machine learning
Time series forecasting with machine learningTime series forecasting with machine learning
Time series forecasting with machine learningDr Wei Liu
 
Credit card fraud detection
Credit card fraud detectionCredit card fraud detection
Credit card fraud detectionvineeta vineeta
 

What's hot (20)

Loan Default Prediction with Machine Learning
Loan Default Prediction with Machine LearningLoan Default Prediction with Machine Learning
Loan Default Prediction with Machine Learning
 
Loan prediction
Loan predictionLoan prediction
Loan prediction
 
Credit Card Fraud Detection
Credit Card Fraud DetectionCredit Card Fraud Detection
Credit Card Fraud Detection
 
Credit risk scoring model final
Credit risk scoring model finalCredit risk scoring model final
Credit risk scoring model final
 
Case Study: Loan default prediction
Case Study: Loan default predictionCase Study: Loan default prediction
Case Study: Loan default prediction
 
Analysis of-credit-card-fault-detection
Analysis of-credit-card-fault-detectionAnalysis of-credit-card-fault-detection
Analysis of-credit-card-fault-detection
 
Overview of Data Analytics in Lending Business
Overview of Data Analytics in Lending BusinessOverview of Data Analytics in Lending Business
Overview of Data Analytics in Lending Business
 
Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...
Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...
Measuring and Managing Credit Risk With Machine Learning and Artificial Intel...
 
Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case Big Data: Banking Industry Use Case
Big Data: Banking Industry Use Case
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
 
Artificial Intelligence in Banking
Artificial Intelligence in BankingArtificial Intelligence in Banking
Artificial Intelligence in Banking
 
Machine Learning Applications in Credit Risk
Machine Learning Applications in Credit RiskMachine Learning Applications in Credit Risk
Machine Learning Applications in Credit Risk
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1
 
NLP in Finance
NLP in FinanceNLP in Finance
NLP in Finance
 
Build Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and GraphsBuild Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and Graphs
 
Machine learning & Time Series Analysis
Machine learning & Time Series AnalysisMachine learning & Time Series Analysis
Machine learning & Time Series Analysis
 
Credit Card Fraud Detection
Credit Card Fraud DetectionCredit Card Fraud Detection
Credit Card Fraud Detection
 
Machine Learning for Fraud Detection
Machine Learning for Fraud DetectionMachine Learning for Fraud Detection
Machine Learning for Fraud Detection
 
Time series forecasting with machine learning
Time series forecasting with machine learningTime series forecasting with machine learning
Time series forecasting with machine learning
 
Credit card fraud detection
Credit card fraud detectionCredit card fraud detection
Credit card fraud detection
 

Similar to Machine Learning Project - Default credit card clients

Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922stone55
 
Supervised learning
Supervised learningSupervised learning
Supervised learningJohnson Ubah
 
Loan Approval Prediction Using Machine Learning
Loan Approval Prediction Using Machine LearningLoan Approval Prediction Using Machine Learning
Loan Approval Prediction Using Machine LearningSouma Maiti
 
Loan Risk Assessment & Scoring Model
Loan Risk Assessment & Scoring ModelLoan Risk Assessment & Scoring Model
Loan Risk Assessment & Scoring ModelSaurabh Singh
 
A high level overview of all that is Analytics
A high level overview of all that is AnalyticsA high level overview of all that is Analytics
A high level overview of all that is AnalyticsRamkumar Ravichandran
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionGirish Gore
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network ModelEric Esajian
 
Loan Eligibility Checker
Loan Eligibility CheckerLoan Eligibility Checker
Loan Eligibility CheckerKiranVodela
 
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...Hendri Karisma
 
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUESCREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUESijaia
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - ReportAkanksha Gohil
 
Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...AdekunleJoseph4
 

Similar to Machine Learning Project - Default credit card clients (20)

Machine_Learning.pptx
Machine_Learning.pptxMachine_Learning.pptx
Machine_Learning.pptx
 
Credit iconip
Credit iconipCredit iconip
Credit iconip
 
Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922Summer 07-mfin7011-tang1922
Summer 07-mfin7011-tang1922
 
Supervised learning
Supervised learningSupervised learning
Supervised learning
 
Loan Approval Prediction Using Machine Learning
Loan Approval Prediction Using Machine LearningLoan Approval Prediction Using Machine Learning
Loan Approval Prediction Using Machine Learning
 
Credit iconip
Credit iconipCredit iconip
Credit iconip
 
Credit iconip
Credit iconipCredit iconip
Credit iconip
 
presentationIDC - 14MAY2015
presentationIDC - 14MAY2015presentationIDC - 14MAY2015
presentationIDC - 14MAY2015
 
Loan Risk Assessment & Scoring Model
Loan Risk Assessment & Scoring ModelLoan Risk Assessment & Scoring Model
Loan Risk Assessment & Scoring Model
 
A high level overview of all that is Analytics
A high level overview of all that is AnalyticsA high level overview of all that is Analytics
A high level overview of all that is Analytics
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
 
scrib.pptx
scrib.pptxscrib.pptx
scrib.pptx
 
Credit Scoring Capstone Project- Pallavi Mohanty.pptx
Credit Scoring Capstone Project- Pallavi Mohanty.pptxCredit Scoring Capstone Project- Pallavi Mohanty.pptx
Credit Scoring Capstone Project- Pallavi Mohanty.pptx
 
Neural Network Model
Neural Network ModelNeural Network Model
Neural Network Model
 
Loan Eligibility Checker
Loan Eligibility CheckerLoan Eligibility Checker
Loan Eligibility Checker
 
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
Comparison Study of Neural Network and Deep Neural Network on Repricing GAP P...
 
Credit scorecard
Credit scorecardCredit scorecard
Credit scorecard
 
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUESCREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
CREDIT RISK MANAGEMENT USING ARTIFICIAL INTELLIGENCE TECHNIQUES
 
Data Analytics Using R - Report
Data Analytics Using R - ReportData Analytics Using R - Report
Data Analytics Using R - Report
 
Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...
 

More from Vatsal N Shah

I am sharing my journey with Indus University and Indus Family. Read Page No:...
I am sharing my journey with Indus University and Indus Family. Read Page No:...I am sharing my journey with Indus University and Indus Family. Read Page No:...
I am sharing my journey with Indus University and Indus Family. Read Page No:...Vatsal N Shah
 
Floor cleaning robot(autonomus mannual) vatsal shah-ec_4th year
Floor cleaning robot(autonomus mannual)  vatsal shah-ec_4th yearFloor cleaning robot(autonomus mannual)  vatsal shah-ec_4th year
Floor cleaning robot(autonomus mannual) vatsal shah-ec_4th yearVatsal N Shah
 
Presentation IOT Robot
Presentation IOT RobotPresentation IOT Robot
Presentation IOT RobotVatsal N Shah
 
Poster IOTROBOT vatsalnshah_ec_indusuniversity
Poster IOTROBOT vatsalnshah_ec_indusuniversityPoster IOTROBOT vatsalnshah_ec_indusuniversity
Poster IOTROBOT vatsalnshah_ec_indusuniversityVatsal N Shah
 
Project ideas ece students
Project ideas ece studentsProject ideas ece students
Project ideas ece studentsVatsal N Shah
 
Raspbeery Pi : An Introduction
Raspbeery Pi : An IntroductionRaspbeery Pi : An Introduction
Raspbeery Pi : An IntroductionVatsal N Shah
 
Report Remote communication of Robotic module using lifa
Report Remote communication of Robotic module using lifaReport Remote communication of Robotic module using lifa
Report Remote communication of Robotic module using lifaVatsal N Shah
 
Floor cleaning robot report vatsal shah_ec_7th sem
Floor cleaning robot report vatsal shah_ec_7th semFloor cleaning robot report vatsal shah_ec_7th sem
Floor cleaning robot report vatsal shah_ec_7th semVatsal N Shah
 
IRC Magazine Issue_1
IRC Magazine Issue_1IRC Magazine Issue_1
IRC Magazine Issue_1Vatsal N Shah
 
Advanced wheel chair vatsal shah
Advanced wheel chair   vatsal shah Advanced wheel chair   vatsal shah
Advanced wheel chair vatsal shah Vatsal N Shah
 
Configuring lifa for remote communication using web architecture
Configuring lifa for remote communication using web architectureConfiguring lifa for remote communication using web architecture
Configuring lifa for remote communication using web architectureVatsal N Shah
 
Control robotic module using LIFA
Control robotic module using LIFAControl robotic module using LIFA
Control robotic module using LIFAVatsal N Shah
 
E-sync-Revista-Editon-1
E-sync-Revista-Editon-1E-sync-Revista-Editon-1
E-sync-Revista-Editon-1Vatsal N Shah
 

More from Vatsal N Shah (20)

Nodejs vatsal shah
Nodejs vatsal shahNodejs vatsal shah
Nodejs vatsal shah
 
Search and rescue
Search and rescueSearch and rescue
Search and rescue
 
Am fm transmitter
Am fm transmitterAm fm transmitter
Am fm transmitter
 
I am sharing my journey with Indus University and Indus Family. Read Page No:...
I am sharing my journey with Indus University and Indus Family. Read Page No:...I am sharing my journey with Indus University and Indus Family. Read Page No:...
I am sharing my journey with Indus University and Indus Family. Read Page No:...
 
Floor cleaning robot(autonomus mannual) vatsal shah-ec_4th year
Floor cleaning robot(autonomus mannual)  vatsal shah-ec_4th yearFloor cleaning robot(autonomus mannual)  vatsal shah-ec_4th year
Floor cleaning robot(autonomus mannual) vatsal shah-ec_4th year
 
Report IOT Robot
Report IOT RobotReport IOT Robot
Report IOT Robot
 
Presentation IOT Robot
Presentation IOT RobotPresentation IOT Robot
Presentation IOT Robot
 
Poster IOTROBOT vatsalnshah_ec_indusuniversity
Poster IOTROBOT vatsalnshah_ec_indusuniversityPoster IOTROBOT vatsalnshah_ec_indusuniversity
Poster IOTROBOT vatsalnshah_ec_indusuniversity
 
Project ideas ece students
Project ideas ece studentsProject ideas ece students
Project ideas ece students
 
Projects
ProjectsProjects
Projects
 
Raspbeery Pi : An Introduction
Raspbeery Pi : An IntroductionRaspbeery Pi : An Introduction
Raspbeery Pi : An Introduction
 
Report Remote communication of Robotic module using lifa
Report Remote communication of Robotic module using lifaReport Remote communication of Robotic module using lifa
Report Remote communication of Robotic module using lifa
 
Floor cleaning robot report vatsal shah_ec_7th sem
Floor cleaning robot report vatsal shah_ec_7th semFloor cleaning robot report vatsal shah_ec_7th sem
Floor cleaning robot report vatsal shah_ec_7th sem
 
IRC Magazine Issue_1
IRC Magazine Issue_1IRC Magazine Issue_1
IRC Magazine Issue_1
 
Advanced wheel chair vatsal shah
Advanced wheel chair   vatsal shah Advanced wheel chair   vatsal shah
Advanced wheel chair vatsal shah
 
Configuring lifa for remote communication using web architecture
Configuring lifa for remote communication using web architectureConfiguring lifa for remote communication using web architecture
Configuring lifa for remote communication using web architecture
 
Control robotic module using LIFA
Control robotic module using LIFAControl robotic module using LIFA
Control robotic module using LIFA
 
Trapatt diode
Trapatt diode Trapatt diode
Trapatt diode
 
Telemedicine
Telemedicine Telemedicine
Telemedicine
 
E-sync-Revista-Editon-1
E-sync-Revista-Editon-1E-sync-Revista-Editon-1
E-sync-Revista-Editon-1
 

Recently uploaded

Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfkalichargn70th171
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalLionel Briand
 

Recently uploaded (20)

Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdfExploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
Exploring Selenium_Appium Frameworks for Seamless Integration with HeadSpin.pdf
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Precise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive GoalPrecise and Complete Requirements? An Elusive Goal
Precise and Complete Requirements? An Elusive Goal
 

Machine Learning Project - Default credit card clients

  • 1. Default of Credit Card Clients Presented By, Hetarth Bhatt – 251056818 Khushali Patel – 25105445 Rajaraman Ganesan – 251056279 Vatsal Shah – 251041322 Subject: Data Analytics Department of Electrical & Computer Engineering (M.Engg) Western University, Canada
  • 3. Overview ● Banking / Financial Institutes plays a significant role in providing financial service. ● To maintain the integrity, bank/institute must be careful when investing in customers to avoid financial loss. ● Before giving credit to borrowers, the bank must come to about the potential of customers. ● The term credit scoring, determines the relation between defaulters and loan characteristics.
  • 4. Goal ● The model we built here will use all possible factors to predict data on customers to find who are defaulters and non-defaulters next month. ● The goal is to find the whether the clients are able to pay their next month credit amount. ● Identify some potential customers for the bank who can settle their credit balance. ● To determine if their customers could make the credit card payments on-time. ● Default is the failure to pay interest or principal on a loan or credit card payment.
  • 5. Process Design ● The process is done by Supervised learning Algorithm. ● The idea behind using this is we have a prior knowledge on our output values. ● It acts as a guide to teach the algorithms what conclusion it should come up with. ● Common algorithms in supervised learning includes logistic regression, naive bayes, support vector machines and decision tree classifier.
  • 6. Approach Design Preparation of data Applying models Comparing the applied models Evaluation Model Analyse the Best model
  • 8. Dataset Overview ● Oriented: UCI Machine Learning Repository. (Link) ● Attributes: 24 ● Tuples: 30,000 Customers data Customers data
  • 9. Continue… Independent variables: • Customer ID • Credit limit • Gender • Age • Marital status • Level of education • History of their past payments made (April to September) (g1 to g6) • Amount of bill statement (h1 to h6) • Amount of previous payment (i1 to i6) Dependent variables: • default – A customer who will be default next month payment (0: no, 1: yes)
  • 10. Dataset overview Graph shows total number of records for defaulters and non-defaulters. If they would do payment or not (yes=1 no=0) for next month 22% - default 78% - non-default (Screenshot: Taken during evaluation process)
  • 11. Continue… 1. It shows count for ‘sex’ attribute 1 - Male and 2 - Female 2. It shows default count for ‘marriage’ attribute 0,3 - Others, 1 - Married, 2 - Single (Screenshot 1,2: Taken during evaluation process)
  • 12. Continue… 2. It shows Limit of balance with respect to age. 1. It shows count for Total ‘Credit limit’ attribute values with respect to Number of Account (Screenshot 1,2: Taken during evaluation process)
  • 14. Data Preprocessing ● Data set is divided in 60:40 ratio for train and test respectively. ● ID column was dropped as its unnecessary for our modeling. ● The attribute name ‘PAY_0’ was converted to ‘PAY_1’ for naming convenience. ● Pay_0: No consumption of credit card=-2, Pay duly(paid on time)=-1, payment delay for one month=1, payment delay for two months=2, payment delay for nine months and above=9. ● Marital Status 0 is converted to value 3 (Others). ● No Null values in dataset. ● Robust Scaler – It is used to convert variable in the same scale. Ex. in Limit Balance column there are different range of values which are converted in proper scale.
  • 15. ● Education ● Gender ● Marital status Feature Engineering Education Value Graduate 1 University 2 High School 3 Others 4 Gender Value Male 1 Female 2 Marital Status Value - 0 Married 1 Single 2 Others 3
  • 16. One-Hot Encoding ● The process in which Categorical variables are converted into a form, where we can apply algorithms to do prediction easily. ● (n-1) features ● Ex. There are 4 features in Education  Converted to 3 features (n-1)
  • 18. Proposed Models Logistic Regression •It is used for Binary classification. •Outputs have a nice probabilistic interpretation, and the algorithm can be regularized to avoid over fitting. •In logistic regression the hypothesis is that the conditional probability p of class belongs to ”1” •if probability is greater than threshold probability, generally 0.5, else it belongs to the class ”0”. •Ex. Y 𝑖 = ቊ 1 , 𝑝 ≥ 0.5 0, 𝑝 < 0.5 Support Vector Machine •Kernel: linear and enabled probability: ‘true’ •Soft-SVM search used in the process. •Hard-SVM searches for the decision boundary that separates the training data separately with the largest margin. •Soft-SVM is based on the assumption that learning data is not perfectly separable.
  • 19. Continue… Feed Forward Neural Network •Feed forward NN is used with Backpropagation algorithm. Below parameters decided tuned parameters •Activation function : Rectified Linear Unit(ReLU) and Sigmoid •SGD: adam •Epochs: 100 •Loss function: Binary cross entropy •Input layer : 26, Hidden layer : 2, Output: 1 •Early Stopping Voting Classifier •Estimators: list of classifiers •Combined average of Logistic Regression + Decision tree Classifier + Support Vector Machine (SVM) Naïve Bayes, KNN and Decision Tree classifier
  • 20. Evaluation Process Evaluation Metrics: ● Accuracy: Accuracy determine how often the model predicts default and non-default correctly. ● Precision: Precision calculates whenever our models predicts it is default how often it is correct. ● Recall: Recall regulate the actual default that the model is actually predict. ● Precision Recall Curve: PRC will display the tradeoff between precision and recall threshold. Cross Validation: ● K Fold cross validation; k = 5
  • 21. Confusion Metrics True Positive – A person who is defaulter and predicted as defaulter. True Negative – A person who is non-defaulter and predicted as non-defaulter. False Positive – A person who is predicted defaulter is non-defaulter. False Negative – A person who is predicted non-defaulter is defaulter. # Non-defaulter (predicted) - 0 Defaulter (predicted) - 1 Non-defaulter (actual) - 0 TN FP Defaulter (actual) - 1 FN TP
  • 23. Evaluation Result No. Algorithms Accuracy Precision Recall Confusion Metrix - Null 78 - - - 1 Logistic Regression 81.45 66.92 35.95 8927 419 1806 848 2 KNN 78.86 53.47 34.17 8557 789 1747 907 3 Naïve Byes 76.68 47.65 52.23 7736 1610 1188 1466 4 Classification Tree 78.46 52.09 32.81 8545 801 1783 871 5 SVM 81.66 63.99 39.11 8762 584 1616 1038 6 Feed Forward NN 75.65 33.91 40.74 8927 419 1806 848 7 Voting Classifier 83.95 67.49 32.83 8842 504 1822 832
  • 24. Graphical representation Bar chart representation of Precision, Recall and Accuracy values
  • 25. Precision-Recall curve Comparison of implemented algorithms with Precision-recall curve
  • 26. Precision Recall curve with threshold ● Precision-Recall comparison with respect to the threshold value. ● Adjust precision and recall value by adjusting threshold value ● We set threshold value  0.2 ● False Negative value is improved by 40%.
  • 28. Conclusion ● We investigated the data, checking for data unbalancing, visualizing the features and understanding the relationship between different features. ● We used both train-validation split and cross-validation to evaluate the model effectiveness to predict the target value, i.e. detecting if a credit card client will default next month. ● We then investigated five predictive models: ○ We started with Logistic Regression, Naïve bayes, SVM, KNN, Classification Tree and Feed-forward NN and Voting classifier accuracy is almost same. ○ We choose based model based on minimum value of False Negative value. ○ This would also inform the issuer’s decisions on who to give a credit card to and what credit limit to provide.
  • 30. Demo
  • 31. QR Code S C A N Q R C O D E A N D A C C ES S P P T A N D C O D E ( A C C E S S T H E C O D E : H T T P : / / V A T S A L S H A H . I N / P R O J E C T S . H T M L )
  • 32. References ● Li, Xiao-Lin, and Yu Zhong. An overview of personal credit scoring: techniques and future work. Journal: International Journal of Intelligence Science ISSN 2163-0283. 2012. ● Federal Reserve. (2017) “Report to the Congress on the Profitability of Credit Card Operations of Depository Institutions”. ● Taiwo Oladipupo Ayodele. (2010) “Types of Machine Learning Algorithms”, New Advances In Machine Learning, Yagang Zhang (Ed.), Intech ● Default Credit Card Clients Dataset, https://www.kaggle.com/uciml/default-of-credit-card-clients-dataset/ ● Davis J., Goadrich M. The Relationship Between Precision-Recall and ROC Curves. ACM New York, NY, USA 2006. ISBN:1-59593-383-2. ● Arlot, Sylvain, and Alain Celisse. A survey of cross-validation procedures for model selection. eprint arXiv:0907.4728. DOI:10.1214/09-SS054. ● https://www.analyticsvidhya.com/blog/2016/03/introduction-deep-learning-fundamentals-neural- networks/
  • 33.
  • 34. Contact Us You can share your feedback on below Email-IDs. ● Hetarth Bhatt – hbhatt7@uwo.ca ● Khushali Patel – kpate372@uwo.ca ● Rajaraman Ganesan – rganesa@uwo.ca ● Vatsal Shah – vshh56@uwo.ca (http://vatsalshah.in)