📊 Customer Churn Prediction – End-to-End ML Project

This project builds a complete Machine Learning pipeline to predict customer churn using real telecom data.

🔍 Objective

Predict whether a customer is likely to leave the telecom service (churn) based on their usage and profile.

📁 Dataset

Source: Telco Customer Churn - Kaggle
Rows: 7043 customers
Target Column: Churn (Yes/No)

🧠 Model: XGBoost Classifier

Tuned with GridSearchCV
Evaluated with Accuracy, Confusion Matrix, and Classification Report
Explained using SHAP and LIME

⚙️ Pipeline Steps

Data Loading using Pandas
Data Cleaning & Preprocessing
- Handled missing values
- Encoded categorical variables
- Converted TotalCharges to numeric
- Target column: Churn → binary
Train/Test Split
Model Training with XGBoost + hyperparameter tuning
Model Interpretation
- Global + Local explanations using SHAP
- Individual prediction explained using LIME
Saving Outputs
- Model saved as .pkl
- Predictions saved as .csv and .db

📈 Results

Best Parameters:
n_estimators=50, learning_rate=0.1, max_depth=4
Accuracy: 79.3%
Classification Report:

Class	Precision	Recall	F1-Score
No Churn	0.83	0.90	0.87
Churn	0.65	0.49	0.56

📊 SHAP Interpretations

*Explains which features most impacted predictions.*

📊 LIME Interpretations

*local explanation for one specific prediction made by the model*

📦 Files in Repo

File	Description
`churn_model.pkl`	Trained XGBoost model
`churn_predictions.csv`	Saved test predictions
`churn_results.db`	SQLite version of predictions
`Telco-Customer-Churn.csv`	Raw dataset
`churn_notebook.ipynb`	Full end-to-end ML notebook
`README.md`	This project overview

✅ Skills Demonstrated

Data Cleaning & Preprocessing
Feature Engineering
XGBoost + Hyperparameter Tuning
Model Interpretation (SHAP + LIME)
SQLite Integration
Git & GitHub project documentation

🚀 Author

Sushma Sandanshiv
🔗 LinkedIn
💻 GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitattributes		.gitattributes
README.md		README.md
Telco_Customer_Churn.csv		Telco_Customer_Churn.csv
churn_model.pkl		churn_model.pkl
churn_prediction.csv		churn_prediction.csv
churn_predictions.db		churn_predictions.db
customer_churn_prediction.ipynb		customer_churn_prediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📊 Customer Churn Prediction – End-to-End ML Project

🔍 Objective

📁 Dataset

🧠 Model: XGBoost Classifier

⚙️ Pipeline Steps

📈 Results

📊 SHAP Interpretations

📊 LIME Interpretations

📦 Files in Repo

✅ Skills Demonstrated

🚀 Author

About

Uh oh!

Releases

Packages

Languages

sushma-prog/customer-churn-prediction

Folders and files

Latest commit

History

Repository files navigation

📊 Customer Churn Prediction – End-to-End ML Project

🔍 Objective

📁 Dataset

🧠 Model: XGBoost Classifier

⚙️ Pipeline Steps

📈 Results

📊 SHAP Interpretations

📊 LIME Interpretations

📦 Files in Repo

✅ Skills Demonstrated

🚀 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages