The object of the project is to identify which customers are likely to leave the bank and identify key drivers of churn to support retention strategies.
Process: I made use of R, Decision Tree, and Regression Tree, and performed EDA.
Data Cleaning: Removed duplicates, handled missing values
Feature Engineering: Encoded categorical variables, normalized features
Modeling: Trained multiple models — XGBoost achieved the best performance
Evaluation:
Accuracy: 86%
AUC Score: 0.89
Precision-Recall optimized for business impact