Disease Prediction Using Machine Learning: Predicting Diabetes
Francesco Stara
Data Scientist
Data Analyst
Statistician
Matplotlib
pandas
Python
This project dives into The Pima Indian Diabetes Dataset.
Originally from the National Institute of Diabetes and Digestive and Kidney Diseases, contains information of 768 women from a population near Phoenix, Arizona, USA.
My objective is to organize the dataset through an exploratory data analysis (EDA), visualize the cleaned data and understand the statistical distribution, and create a model to predict diabetes for a new person, outside original data.