Churn Modelling - How to predict if a bank’s customer will stay or leave the bank

The app is deployed to Render, and can be found at this URL

Our full presentation deck can be found here.

Churn Modelling - How to predict if a bank’s customer will stay or leave the bank

Using a source of 10,000 bank records, we created an app to demonstrate the ability to apply machine learning models to predict the likelihood of customer churn. We accomplished this using the following steps:

1. Clean the data

By reading the dataset into a dataframe using pandas, we removed unnecessary data fields including individual customer IDs and names. This left us with a list of columns for Credit Score, Geography, Gender, Age, Length of time as a Bank customer, Balance, Number Of Bank Products Used, Has a Credit Card, Is an Active Member, Estimated Salary and Exited.

2. Analyze initial DataFrame

Utilizing Matplotlib, Seaborn and Pandas, we next analyzed the data. We can see that our dataset was imbalanced. The majority class, "Stays" (0), has around 80% data points and the minority class, "Exits" (1), has around 20% datapoints. To address this, we utilized SMOTE in our machine learning algorithms (Synthetic Minority Over-sampling Technique). More on that later on.

In percentage, female customers are more likely to leave the bank at 25%, compared to 16% of males.

The smallest number of customers are from Germany, and they are also the most likely to leave the bank. Almost one in three German customers in our sample left the bank.

3. Machine Learning using 7 different models

We tested seven different machine learning models (and used six in the final application) to predict customer churn, including Logistic Regression, Decision Tree, Random Forest, Deep Learning (TensorFlow), K-Nearest Neighbor, Support Vector Machine and XGBoost.

As mentioned earlier, we also used SMOTE to handle issues with the imbalanced data on the Support Vector Machine model. SMOTE (Synthetic Minority Over-sampling Technique) is an over-sampling method that creates new (synthetic) samples based on the samples in our minority classes. It finds the k-nearest-neighbors of each member of the minority classes. The new samples should be generated only in the training set to ensure our model generalizes well to unseen data. We used imblearn python package. Using SMOTE gave us better recall results which is a general goal for customer churning tasks.

4. Load models to display predictions on app

Finally, using Flask and HTML/CSS, we created the user-facing app to add information to our data set matching our initial dataframe to predict the likelihood of a customer departing the bank. This was then deployed to Heroku. In November 2022 it was moved to Render and it can be found at this URL: https://bank-churn-predictions.onrender.com

References

For a deeper dive into our analysis and process, take a look at our full presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Resources		Resources
modelling_notebooks		modelling_notebooks
models		models
static		static
templates		templates
.gitignore		.gitignore
.render-buildpacks.json		.render-buildpacks.json
Bank Customer Churn Prediction - EDA.ipynb		Bank Customer Churn Prediction - EDA.ipynb
Bank Customer Churn Prediction - Summary.ipynb		Bank Customer Churn Prediction - Summary.ipynb
Dockerfile.render		Dockerfile.render
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.py		app.py
keras_model.py		keras_model.py
render.yaml		render.yaml
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Churn Modelling - How to predict if a bank’s customer will stay or leave the bank

1. Clean the data

2. Analyze initial DataFrame

3. Machine Learning using 7 different models

4. Load models to display predictions on app

References

About

Releases

Packages

Languages

License

zunicd/Bank-Churn-Prediction

Folders and files

Latest commit

History

Repository files navigation

Churn Modelling - How to predict if a bank’s customer will stay or leave the bank

1. Clean the data

2. Analyze initial DataFrame

3. Machine Learning using 7 different models

4. Load models to display predictions on app

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages