Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction

•Download as PPTX, PDF•

1 like•13 views

Can you foresee a customer's next move? The Boston Institute of Analytics (BIA) proudly presents a collection of student presentations on data analysis projects exploring customer transaction prediction. Embark on a journey to uncover the secrets of anticipating customer behavior. These presentations showcase innovative approaches to analyzing customer data and predicting future transactions, providing valuable insights for: Retail and e-commerce businesses looking to optimize inventory management and targeted promotions Marketing professionals aiming to personalize customer experiences and boost conversions Data analysts and enthusiasts seeking to learn cutting-edge customer behavior analysis techniques This compelling collection offers: visit https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/ In-depth exploration of data analysis methods for transaction prediction Real-world case studies demonstrating the power of predictive analytics Insights and findings from the research of talented BIA students A springboard for developing your own customer transaction prediction strategies Gain a competitive edge by harnessing the power of data. Watch these presentations and unlock the secrets of customer behavior prediction today! visit https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/

Data & Analytics

Santander Customer Transaction
Prediction
By Atharva Kulkarni

By Atharva Kulkarni
What is Santander ?
Santander is an American bank operating as a wholly-
owned subsidiary of the Spanish Santander Group. It is
based in Boston and
its principal market is the Northeastern United States.
Why this model is
important?
To identify who will make a
transaction.
What will be the Impact of
Model?
The model will predict and help Santander with the problem of
identification of the customers who will make a transaction with
the bank in future.

By Atharva Kulkarni
WORK FLOW
Data
collection
Exploratory
Data Analysis
(EDA)
Preprocessing
Visualization
Dividing Data
into X and Y
Model
selection and
Evaluation
First importing necessary libraries like Pandas, Seaborn,
Matplotlib.pyplot and Numpy.

By Atharva Kulkarni
DATASET
Santander Customer
Transaction Prediction
This Dataset Consist of Two Files
Train Data Test Data
Rows Columns Rows Columns
200000 202 200000 201
Null Values
Train and Test Data does not
contain Null Values
Duplicate Values
Train and Test Data does not
contain Duplicate Values

By Atharva Kulkarni
Exploratory
Data Analysis
(EDA)
Exploratory Data Analysis (EDA) refersto the method of studying and exploring record sets to apprehend
their predominant traits, discover patterns, locate outliers,and identifyrelationships betweenvariables.
EDA is normally carried out as a preliminarystep beforeundertakingextra formalstatistical analyses or
modeling.
df.head() = Display first Five Rows of Dataset
df.tail() = Display last Five Rows of Dataset
df.describe() = Gives descripive Statistics of Datadet
df.isnull().sum() = Display the number of Null Values in Dataset
df.shape() = Display the number of Rows and Columns of Dataset
df.dtypes() = Display the Data Type of each Feature of Dataset
df.info() = Gives the Summary of Dataset including column name, data type,
non-null values and memory usage

By Atharva Kulkarni
Observing the Distribution of ‘Target’ in Train
data
Drop the Column ‘ID_code’

By Atharva Kulkarni
Observing the Distribution of Train Features

By Atharva Kulkarni
Observing the Distribution of Test Features

By Atharva Kulkarni
Heatmap to
understand
the
Correlation
between
Features

By Atharva Kulkarni
Dividing Dataset into
‘X’ and ‘Y’
Split the Data into Training Data and
Testing Data
The Data is imbalanced so
we used UnderSampling to
Balanced the Data

By Atharva Kulkarni
Model1 = RandomForestClassifier
Accuracy Score
Model2 = LogisticRegression
Accuracy Score
MODEL Selection

By Atharva Kulkarni
Model3 = XGBClassifier
Accuracy Score
Model4 = KNeighborsClassifier
Accuracy Score

By Atharva Kulkarni
ALGORITHM ACCURACY CONFUSION MATRIX CLASSIFICATION
REPORT
Random Forest
Classifier
0.6122
Logistic Regression 0.6122
XGBClassifier 0.62135
Kneighbors
Classifier
0.891825

By Atharva Kulkarni
CONCLUSION
● Our model can be used to find the right customers to target and
increase profits, as well as return on marketing investment.
● After dealing with data imbalance our data was ready for feature
engineering.
● Best Method – The Algorithm KNeighborsClassifier Suits best for my
dataset because it gives accuracy of 0.891825.

Similar to Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction

Data PreprocessingVijayasankariS

Data Quality with AIVera Ekimenko

Lecture 1 Pandas Basics.pptx machine learningmy6305874

Data Science Training | Data Science For Beginners | Data Science With Python...Simplilearn

METODOLOGIA DEA EN STATALuhSm

Implementing a data_science_project (Python Version)_part1Dr Sulaimon Afolabi

Applied python for correlation on churn and stocks datasetsMahmoud Fouad

Trust Measurement Presentation_Part 3Gan Chun Chet

Recommendation SystemAnamta Sayyed

THE IMPLICATION OF STATISTICAL ANALYSIS AND FEATURE ENGINEERING FOR MODEL BUI...IJCSES Journal

THE IMPLICATION OF STATISTICAL ANALYSIS AND FEATURE ENGINEERING FOR MODEL BUI...ijcseit

De-Cluttering-ML | TechWeekendsDSCUSICT

Data Manipulation with Numpy and Pandas in PythonStarting with NOllieShoresna

interenship.pptxNaveen316549

Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics

Generating test data for Statistical and ML modelsVladimir Ulogov

Introduction to data_structurePurvi Prajapati

07 learningankit_ppt

Intro to Machine Learning for non-Data ScientistsParinaz Ameri

big-data-anallytics.pptxSangamesh Kalyan

Similar to Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction (20)

Data Preprocessing

Data Quality with AI

Lecture 1 Pandas Basics.pptx machine learning

Data Science Training | Data Science For Beginners | Data Science With Python...

METODOLOGIA DEA EN STATA

Implementing a data_science_project (Python Version)_part1

Applied python for correlation on churn and stocks datasets

Trust Measurement Presentation_Part 3

Recommendation System

THE IMPLICATION OF STATISTICAL ANALYSIS AND FEATURE ENGINEERING FOR MODEL BUI...

De-Cluttering-ML | TechWeekends

Data Manipulation with Numpy and Pandas in PythonStarting with N

interenship.pptx

Predicting Employee Churn: A Data-Driven Approach Project Presentation

Generating test data for Statistical and ML models

Introduction to data_structure

07 learning

Intro to Machine Learning for non-Data Scientists

big-data-anallytics.pptx

Recently uploaded

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408

Industrialised data - the key to AI success.pdfLars Albertsson

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一ffjhghh

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

B2 Creative Industry Response Evaluation.docxStephen266013

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Invezz.com - Grow your wealth with trading signalsInvezz1

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh9953056974 Low Rate Call Girls In Saket, Delhi NCR

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

Recently uploaded (20)

Unveiling Insights: The Role of a Data Analyst

RA-11058_IRR-COMPRESS Do 198 series of 1998

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps

Industrialised data - the key to AI success.pdf

04242024_CCC TUG_Joins and Relationships

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Customer Service Analytics - Make Sense of All Your Data.pptx

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

B2 Creative Industry Response Evaluation.docx

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Invezz.com - Grow your wealth with trading signals

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779

Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction

1. By Atharva Kulkarni

2. Santander Customer Transaction Prediction By Atharva Kulkarni

3. By Atharva Kulkarni What is Santander ? Santander is an American bank operating as a wholly- owned subsidiary of the Spanish Santander Group. It is based in Boston and its principal market is the Northeastern United States. Why this model is important? To identify who will make a transaction. What will be the Impact of Model? The model will predict and help Santander with the problem of identification of the customers who will make a transaction with the bank in future.

4. By Atharva Kulkarni WORK FLOW Data collection Exploratory Data Analysis (EDA) Preprocessing Visualization Dividing Data into X and Y Model selection and Evaluation First importing necessary libraries like Pandas, Seaborn, Matplotlib.pyplot and Numpy.

5. By Atharva Kulkarni DATASET Santander Customer Transaction Prediction This Dataset Consist of Two Files Train Data Test Data Rows Columns Rows Columns 200000 202 200000 201 Null Values Train and Test Data does not contain Null Values Duplicate Values Train and Test Data does not contain Duplicate Values

6. By Atharva Kulkarni Exploratory Data Analysis (EDA) Exploratory Data Analysis (EDA) refersto the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers,and identifyrelationships betweenvariables. EDA is normally carried out as a preliminarystep beforeundertakingextra formalstatistical analyses or modeling. df.head() = Display first Five Rows of Dataset df.tail() = Display last Five Rows of Dataset df.describe() = Gives descripive Statistics of Datadet df.isnull().sum() = Display the number of Null Values in Dataset df.shape() = Display the number of Rows and Columns of Dataset df.dtypes() = Display the Data Type of each Feature of Dataset df.info() = Gives the Summary of Dataset including column name, data type, non-null values and memory usage

7. By Atharva Kulkarni Observing the Distribution of ‘Target’ in Train data Drop the Column ‘ID_code’

8. By Atharva Kulkarni Observing the Distribution of Train Features

9. By Atharva Kulkarni Observing the Distribution of Test Features

10. By Atharva Kulkarni Heatmap to understand the Correlation between Features

11. By Atharva Kulkarni Dividing Dataset into ‘X’ and ‘Y’ Split the Data into Training Data and Testing Data The Data is imbalanced so we used UnderSampling to Balanced the Data

12. By Atharva Kulkarni Model1 = RandomForestClassifier Accuracy Score Model2 = LogisticRegression Accuracy Score MODEL Selection

13. By Atharva Kulkarni Model3 = XGBClassifier Accuracy Score Model4 = KNeighborsClassifier Accuracy Score

14. By Atharva Kulkarni ALGORITHM ACCURACY CONFUSION MATRIX CLASSIFICATION REPORT Random Forest Classifier 0.6122 Logistic Regression 0.6122 XGBClassifier 0.62135 Kneighbors Classifier 0.891825

15. By Atharva Kulkarni CONCLUSION ● Our model can be used to find the right customers to target and increase profits, as well as return on marketing investment. ● After dealing with data imbalance our data was ready for feature engineering. ● Best Method – The Algorithm KNeighborsClassifier Suits best for my dataset because it gives accuracy of 0.891825.

16. By Atharva Kulkarni

Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction

Recommended

Recommended

More Related Content

Similar to Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction

Similar to Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction (20)

More from Boston Institute of Analytics

More from Boston Institute of Analytics (20)

Recently uploaded

Recently uploaded (20)

Predicting the Perfect Purchase: Student Presentation on Customer Transaction Prediction