COVID-19 Infection & Fatality Risk Modeling in New York State by Chaya ChaipitakpornCOVID-19 Infection & Fatality Risk Modeling in New York State by Chaya Chaipitakporn

COVID-19 Infection & Fatality Risk Modeling in New York State

Chaya Chaipitakporn

Chaya Chaipitakporn

๐Ÿ“š COVID-19 Infection & Fatality Risk Modeling in New York State

๐Ÿ“Œ Project Type: Academic Research โ€“ Environmental & Epidemiological Modeling
๐Ÿ“ Published in: Science of the Total Environment (STOTEN)
๐Ÿ‘จโ€๐Ÿ”ฌ Role: Co-Author โ€“ Stepwise Regression & Data Support

๐Ÿงฉ Project Summary

This research study examined how demographics and air quality influenced COVID-19 infection and fatality rates across counties in New York State during the pandemic's first wave. The study revealed that infection and death were highest near NYC, while fatality (deaths per infection) was paradoxically higher in rural areas.

๐Ÿงช My Technical Contributions

โœ… Stepwise Regression Modeling
Applied forward selection and backward elimination techniques to identify statistically significant predictors (demographic & environmental) for COVID-19 infection and fatality.
Helped determine which features most improved model accuracy (e.g., PM2.5, population age, distance to epicenter).
โœ… Data Wrangling & Cleaning
Merged multiple datasets (census, pollution, and epidemiological data) at the county level.
Preprocessed variables for model readiness: normalization, missing value handling, and encoding.
โœ… Feature Impact Interpretation
Analyzed how variable inclusion/exclusion altered regression accuracy and output.
Supported result validation to ensure models aligned with observed cluster behaviors.

๐Ÿ”ง Techniques Used

๐Ÿ” Key Insights from the Study

PM2.5 and distance to NYC were major predictors of infection spread
Fatality was more associated with elderly population and long-term pollution exposure
Spatial and demographic segmentation is crucial for targeted public health response
Model interpretability helped explain why certain rural areas had high fatality despite low infection

โœ… Relevance to Freelance Work

This project shows my ability to:
Select and justify data modeling techniques
Build and interpret multivariate regression models
Understand feature importance & business impact
Handle real-world public datasets at scale
Applicable for:
Health analytics, churn modeling, marketing attribution, or KPI drivers

๐Ÿ› ๏ธ Tools & Skills Demonstrated

Python โ€ข pandas โ€ข stepwise regression (forward/backward) โ€ข multivariate analysis โ€ข data cleaning โ€ข clustering analysis โ€ข scientific writing
Like this project

Posted Apr 18, 2025

Modeled COVID-19 risks in NY using regression and data analysis.