This project dives into The Pima Indian Diabetes Dataset.
Originally from the National Institute of Diabetes and Digestive and Kidney Diseases, contains information of 768 women from a population near Phoenix, Arizona, USA.
My objective is to organize the dataset through an exploratory data analysis (EDA), visualize the cleaned data and understand the statistical distribution, and create a model to predict diabetes for a new person, outside original data.
Like this project
0
Posted Feb 24, 2024
After cleaning the data and exploring it, I used a Machine Learning Random Forest model that predicted diabetes about 75% of the time.