I addressed several key challenges, including dealing with irrelevant columns and missing values through strategic imputation techniques. I employed Variance Inflation Factor analysis to combat multicollinearity, ensuring stable model estimates. The project also required tackling imbalanced data, a common issue in fraud detection, which I resolved using undersampling and oversampling methods. My feature engineering efforts included creating industry-standard financial ratios like Loan-to-Value (LTV) and Loan-to-Income (LTI), enhancing the model's predictive power. I applied Principal Component Analysis for dimensionality reduction to optimize performance and reduce noise.
The final model was fine-tuned using grid search