Allergy Prediction Model Development

Uthara

Uthara

healthcare-management-system

The problem that our applied project aims to solve is that many hospital patients develop allergies due to a number of factors such as administration of certain medications ,treatments, their demographics, etc. By investigating a dataset from Harvard Dataverse with over 30,000 synthetic patient records, we are using Machine Learning analysis to be able to predict what factors impact developments of allergies the most.

File Structure:

analysis.ipynb is the notebook with a majoirty of data visualization and exploration. This files also includes a few different ML models we tested. allergy_prediction.ipynb is the notebook with our final Logistic Regression Model.

Data Set:

Built with:

pandas - Ppen source data analysis and manipulation tool
matplotlib - Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python
seaborn - Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics.

Recommendations

Find clearer correlations between differenct factors and allergies
Make more accurate predictions to reduce MSE

Team

Dhyanashri Raman
Uthara Das
Like this project

Posted Sep 23, 2025

Developed ML models to predict allergy factors using patient data.