
Advanced fraud detection system for e-commerce transactions and credit card payments using machine learning
class (0 = Legitimate, 1 = Fraud)Class (0 = Legitimate, 1 = Fraud)pandas - Data manipulationnumpy - Numerical computingmatplotlib - Visualizationseaborn - Statistical visualizationscikit-learn - Machine learning algorithmsimbalanced-learn - SMOTE and class balancingxgboost - Gradient boosting ensemble modelsshap - Model explainability and interpretabilityjoblib - Model serializationjupyter - Interactive notebooksKernel → Restart & Run AllShift + Entercreditcard.csv and Fraud_Data.csvmodels/temp_charts/temp_charts/builtin_feature_importance_*.pngtemp_charts/shap_summary_plot_*.pngtemp_charts/shap_force_plot_*.htmltemp_charts/importance_comparison_*.pngSHAP_ANALYSIS_GUIDE.md for detailed documentation.data/raw/creditcard.csv and data/raw/Fraud_Data.csvmodels/notebooks/eda-fraud-data.ipynbdata/processed/fraud_data_cleaned.csvdata/processed/creditcard_cleaned.csvnotebooks/eda-fraud-data.ipynb (Sections 9-10)notebooks/eda-fraud-data.ipynb (Sections 10-11)data/processed/fraud_X_train_smote.csvdata/processed/fraud_y_train_smote.csvdata/processed/fraud_X_test.csvdata/processed/fraud_y_test.csvscripts/train_fraud_models.pyclass_weight='balanced'models/temp_charts/scripts/train_fraud_models.pyscripts/shap_model_explainability.pytemp_charts/builtin_feature_importance_*.pngtemp_charts/shap_summary_plot_*.pngtemp_charts/shap_force_plot_*.htmltemp_charts/importance_comparison_*.pngSHAP_ANALYSIS_GUIDE.md for comprehensive documentation.data/processed/ with clear naming conventions:fraud_data_cleaned.csv Cleaned e-commerce transaction data creditcard_cleaned.csv Cleaned credit card transaction data fraud_X_train_smote.csv SMOTE-balanced training features fraud_y_train_smote.csv SMOTE-balanced training labels fraud_X_test.csv Test features (imbalanced - real distribution) fraud_y_test.csv Test labelstemp_charts/:builtin_feature_importance_*.png Model's built-in feature importance shap_summary_plot_*.png Global SHAP summary showing feature impact shap_importance_*.png Mean absolute SHAP values ranking importance_comparison_*.png Comparison of built-in vs SHAP importance shap_force_plot_TP_*.html Interactive force plot for True Positive case shap_force_plot_FP_*.html Interactive force plot for False Positive case shap_force_plot_FN_*.html Interactive force plot for False Negative caseaccount_age < X strongly increases fraud probability.sample_size: Background samples for explainer (default: 1000)max_samples: Test samples to explain (default: 500)SHAP_ANALYSIS_GUIDE.md.src/ modules (not inline in notebooks).gitignore excludes large files and sensitive datagit checkout -b feature/AmazingFeature)git commit -m 'Add some AmazingFeature')git push origin feature/AmazingFeature)notebooks/eda-fraud-data.ipynb to understand the datasetspython scripts/train_fraud_models.py to build and evaluate modelspython scripts/shap_model_explainability.py to understand model decisionstemp_charts/ for visualizations and console output for metricsmodels/ for production predictionsPosted May 11, 2026
Built a machine learning system for detecting fraudulent e-commerce and credit card transactions.
0
0