Predicting Customer Churn in Telecom (Corporate Presentation)

•

1 like•576 views

This study attempted to formulate a predictive model that identifies whether a customer is probable to switch telecommunications providers (Churn) or stay with the company. We started with a Logistic Regression classifier, and moved on to methods such as Decision Tree, Random Forest, XGBoost, Adaboost, SVM, KNN and Naive- Bayes. We concluded that the best predictive model we could find was XGBoost, which manages to identify correctly almost all the non-churners and the vast majority of the churners. Closely trailing was the Decision Tree model, which is more easily interpretable and applicable in real business problems. On the other hand, Cluster Analysis was a bit more challenging. The Hierarchical Clustering methods we used weren’t very effective. Using the Mahalanobis distance and the Gower distance, we managed to produce 2 clustering methods with Silhouette values equal to 0.2. Using the K-Means method, the results became a little bit better, especially using Principal Components and creating 4 clusters.

Data & Analytics

SOTIRIOS BARATSAS
MSc in Business Analytics
sotbaratsas@gmail.com
PREDICTING
CUSTOMER CHURN
USING CLASSIFICATION & CLUSTERING

The Problem
Over the previous period
~15%
Churn Rate
*based on a sample of 3333 customers
How can we predict which
customers are probable to churn?

Classifying Churners & Non-Churners
M E T H O D S C O M P A R I S O N
We attempted to formulate a model with good predictive
abilities, that identifies both churners and non-churners.
Model_ID Description McFaddens R^2
Cox and Snell
R^2
Nagelkerke R^2
Hosmer
Lemeshow
p-value
Model15
Unifying the charges under one variable “Domestic.Charge” > Stepwise with
AIC (multicollinearity fixed) 0.257 0.185 0.338 0.247
Model2
Starting with all the variables, and performing Stepwise Selection with AIC
(multicollinearity fixed)
0.258 0.186 0.338 0.117
Model3 Starting with all the variables, and performing Stepwise Selection with BIC 0.258 0.186 0.338 0.117
Model9
Unifying the number of calls under one variable “Domestic.Calls” > Stepwise
with AIC (multicollinearity fixed)
0.258 0.186 0.338 0.117
Model6
Unifying the minutes under one variable “Domestic.Mins” > Stepwise Selection
with AIC (multicollinearity fixed)
0.258 0.186 0.338 0.108
Model12
Domestic.Calls + Domestic.Mins (aggregates) > Stepwise with AIC
(multicollinearity fixed)
0.258 0.186 0.338 0.108

XGBoost was the best performing model
in 4 out of 5 key metrics
Best Performing Model
ü It classified correctly 97,7% of
the validation customers
ü It managed to classify almost all the
non-churners correctly (99,7%)
ü It was the best-performer in classifying
churners correctly (84,9%)
Reference
Prediction
Not Churn Churn
Not Churn 855 20
Churn 3 112

Using the Decision Tree classifier we can have equally
good performance and great interpretability
Good performance & intepretability
ü It classified correctly 97,4% of
the validation customers
ü It was the best model at classifying
non-churners correctly (99,8%)
ü It also performed good in classifying
churners correctly (82,6%)
Reference
Prediction
Not Churn Churn
Not Churn 856 23
Churn 2 109

Decision Tree Example
Q3: “Are his/her Total Charges over
the previous period greater than or
equal to 54?”
Answer: YES
Q1: “Are his/her total charges over
the previous period, less than 72?”
Answer: YES
Q2: “Did he/she perform less than
4 customer service calls during the
previous period?”
Answer: NO
We can predict that this customer
is probable to Churn.

Hierarchical Clustering
We can identify 2 or 3 clusters of customers, but the
separation between them is not very distinct.

A Better Approach: Principal Components & K-Means
Transform the data we have about the customers, using Principal Components
and then perform K-Means clustering to identify 4 clusters

Thank You
for your attention
Sotiris Baratsas
sotbaratsas@gmail.com
MSc in Business Analytics

Similar to Predicting Customer Churn in Telecom (Corporate Presentation)

CHAPTER 7 (2).pptx part print planning processRDTCPOLYTECHNICMECHD

The Beginnings Of A Search EngineVirenKhandal

The Beginnings of a Search EngineVirenKhandal

Machine learning for sanctions screeningEnigma

ML MODULE 5.pdfShiwani Gupta

Neural Network ModelEric Esajian

Chap 9 A Process Capability & Spc Hkajithsrc

Quality andc apability hand out 091123200010 Phpapp01jasonhian

How to Measure UncertaintyRandox

Krishna Chaitanya Yarlagadda Main Poster- Support Vector machinesKrishna Chaitanya Yarlagadda

Six sigma simply explainedgaurav_einstien

Marketing Analytics RM ReportLogan Moore

Om0010 operations managementsmumbahelp

Six sigma in 10 stepsPROF. PUTTU GURU PRASAD

Brand Equity: Dish Home of NepalSajeeb Shrestha, PhD

WCM PPT-1 for private limited - demo lokeshLokesh153390

Creating an Explainable Machine Learning AlgorithmBill Fite

Explainable Machine LearningBill Fite

Telecom Churn AnalysisVasudev pendyala

Similar to Predicting Customer Churn in Telecom (Corporate Presentation) (20)

CHAPTER 7 (2).pptx part print planning process

The Beginnings Of A Search Engine

The Beginnings of a Search Engine

Machine learning for sanctions screening

ML MODULE 5.pdf

Neural Network Model

Chap 9 A Process Capability & Spc Hk

Quality andc apability hand out 091123200010 Phpapp01

How to Measure Uncertainty

Krishna Chaitanya Yarlagadda Main Poster- Support Vector machines

Six sigma simply explained

Marketing Analytics RM Report

Om0010 operations management

Six sigma in 10 steps

Brand Equity: Dish Home of Nepal

WCM PPT-1 for private limited - demo lokesh

Creating an Explainable Machine Learning Algorithm

Explainable Machine Learning

Telecom Churn Analysis

Recently uploaded

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534

How we prevented account sharing with MFAAndrei Kaleshka

IMA MSN - Medical Students Network (2).pptxdolaknnilon

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss

RadioAdProWritingCinderellabyButleri.pdfgstagge

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档208367051

Easter Eggs From Star Wars and in cars 1 and 217djon017

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Multiple time frame trading analysis -brianshannon.pdfchwongval

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

Recently uploaded (20)

Customer Service Analytics - Make Sense of All Your Data.pptx

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...

How we prevented account sharing with MFA

IMA MSN - Medical Students Network (2).pptx

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

20240419 - Measurecamp Amsterdam - SAM.pdf

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

MK KOMUNIKASI DATA (TI)komdat komdat.docx

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改

RadioAdProWritingCinderellabyButleri.pdf

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

Easter Eggs From Star Wars and in cars 1 and 2

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Multiple time frame trading analysis -brianshannon.pdf

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

Predicting Customer Churn in Telecom (Corporate Presentation)

1. SOTIRIOS BARATSAS MSc in Business Analytics sotbaratsas@gmail.com PREDICTING CUSTOMER CHURN USING CLASSIFICATION & CLUSTERING

2. The Problem Over the previous period ~15% Churn Rate *based on a sample of 3333 customers How can we predict which customers are probable to churn?

3. Classifying Churners & Non-Churners M E T H O D S C O M P A R I S O N We attempted to formulate a model with good predictive abilities, that identifies both churners and non-churners. Model_ID Description McFaddens R^2 Cox and Snell R^2 Nagelkerke R^2 Hosmer Lemeshow p-value Model15 Unifying the charges under one variable “Domestic.Charge” > Stepwise with AIC (multicollinearity fixed) 0.257 0.185 0.338 0.247 Model2 Starting with all the variables, and performing Stepwise Selection with AIC (multicollinearity fixed) 0.258 0.186 0.338 0.117 Model3 Starting with all the variables, and performing Stepwise Selection with BIC 0.258 0.186 0.338 0.117 Model9 Unifying the number of calls under one variable “Domestic.Calls” > Stepwise with AIC (multicollinearity fixed) 0.258 0.186 0.338 0.117 Model6 Unifying the minutes under one variable “Domestic.Mins” > Stepwise Selection with AIC (multicollinearity fixed) 0.258 0.186 0.338 0.108 Model12 Domestic.Calls + Domestic.Mins (aggregates) > Stepwise with AIC (multicollinearity fixed) 0.258 0.186 0.338 0.108

4. Classifying Churners & Non-Churners M E T H O D S C O M P A R I S O N We attempted to formulate a model with good predictive abilities, that identifies both churners and non-churners. Classification Model Accuracy Sensitivity (Recall) Specificity AUROC Kappa Value XGBoost 0.977 0.849 0.997 0.922 0.89 Adaboost 0.975 0.833 0.997 0.915 0.88 Decision Tree 0.974 0.826 0.998 0.912 0.88 Random Forest 0.969 0.788 0.997 0.892 0.85 Support Vector Machines (SVM) 0.893 0.235 0.994 0.614 0.33 Logistic Regression Classifier 0.873 0.265 0.966 0.616 0.26 K-Means (6) 0.837 0.083 0.954 0.518 0.05 Naive Bayes 0.273 0.765 0.197 0.481 -0.01

5. XGBoost was the best performing model in 4 out of 5 key metrics Best Performing Model ü It classified correctly 97,7% of the validation customers ü It managed to classify almost all the non-churners correctly (99,7%) ü It was the best-performer in classifying churners correctly (84,9%) Reference Prediction Not Churn Churn Not Churn 855 20 Churn 3 112

6. Using the Decision Tree classifier we can have equally good performance and great interpretability Good performance & intepretability ü It classified correctly 97,4% of the validation customers ü It was the best model at classifying non-churners correctly (99,8%) ü It also performed good in classifying churners correctly (82,6%) Reference Prediction Not Churn Churn Not Churn 856 23 Churn 2 109

7. Decision Tree Example Q3: “Are his/her Total Charges over the previous period greater than or equal to 54?” Answer: YES Q1: “Are his/her total charges over the previous period, less than 72?” Answer: YES Q2: “Did he/she perform less than 4 customer service calls during the previous period?” Answer: NO We can predict that this customer is probable to Churn.

8. Hierarchical Clustering We can identify 2 or 3 clusters of customers, but the separation between them is not very distinct.

9. A Better Approach: Principal Components & K-Means Transform the data we have about the customers, using Principal Components and then perform K-Means clustering to identify 4 clusters

10. Thank You for your attention Sotiris Baratsas sotbaratsas@gmail.com MSc in Business Analytics

Predicting Customer Churn in Telecom (Corporate Presentation)

Recommended

Recommended

More Related Content

Similar to Predicting Customer Churn in Telecom (Corporate Presentation)

Similar to Predicting Customer Churn in Telecom (Corporate Presentation) (20)

More from Sotiris Baratsas

More from Sotiris Baratsas (20)

Recently uploaded

Recently uploaded (20)

Predicting Customer Churn in Telecom (Corporate Presentation)