Medicine Supply Chain Disruption Predictor
Built a production-ready ML system that predicts drug shortage risk using real FDA data. Collected and cleaned 1703 real drug shortage records, trained an XGBoost model, and deployed it as a live REST API — accessible to anyone worldwide.
The biggest challenge was identifying and removing 6 sources of data leakage that were causing fake 100% accuracy. After fixing this, the model delivers honest, generalisable predictions.
The entire system is containerised with Docker, automatically rebuilt and redeployed via a Jenkins CI/CD pipeline on every code push, and visualised through an interactive Power BI dashboard.
Result: A complete ML + DevOps project — from raw data to live deployed API — built independently in under 2 weeks.
Live API: https://medicine-supply-predictor.onrender.com/docs
GitHub: https://github.com/nehaM906/medicine-supply-predictor
2
2
44
Medicine Supply Chain Risk Dashboard
An interactive dashboard built on 1703 real FDA drug shortage records, visualising ML model predictions across 6 dynamic charts.
Key findings: 35.2% of monitored drugs are at risk, 2025 recorded the highest shortages in history, and Anesthesia and Cardiovascular drugs are the most vulnerable categories. Features year and disease category filters for drill-down analysis enabling hospital procurement teams to take preventive action before shortages occur.
1
39
This Superstore Sales Dashboard gives an overview of sales performance across time, regions, customers, and products. Total sales are 1bn, with peak performance in Q4, showing strong seasonal trends. California and Texas are the top-performing states, while New York and Houston lead at the city level. Binders and Phones are the best-selling sub-categories. The dashboard also highlights key customers contributing to revenue and includes filters for deeper analysis. Overall, it helps in identifying trends and making data-driven decisions.
1
42
Global Internet Shutdown Analysis
What the project was about:
I analyzed global internet shutdown data to understand where, when, and why shutdowns happen. The goal was to uncover patterns across countries and time periods and highlight their impact on digital access.
What I did:
Collected and cleaned real-world dataset
Performed EDA (Exploratory Data Analysis)
Analyzed shutdowns by:
Country
Year/month
Reasons (political, security, protests, etc.)
Built Power BI dashboards to visualize trends
Identified high-risk regions and frequent shutdown patterns
Key Findings:
Certain countries showed repeated shutdown patterns over time
Most shutdowns were linked to:
Political instability
Government control during protests/elections
Some regions experienced long-duration shutdowns, affecting communication and businesses
There was a rise in shutdown frequency during specific global events
Challenges I faced:
Data cleaning issues
Missing values, inconsistent country names
Unstructured reasons
Different formats for shutdown causes → needed standardization
Time-based analysis
Converting dates and extracting meaningful trends
Visualization complexity
Making dashboards simple yet insightful
How I solved them:
Cleaned and standardized data using Python (Pandas)
Grouped and categorized shutdown reasons
Used time-series analysis for trends
Designed clear and interactive Power BI dashboards
Impact / Outcome:
Provided a clear view of global digital disruptions
Helped identify high-risk regions and key causes
Created dashboards that make complex data easy to understand