Diabetes Prediction

Anderson Barros

Data Scientist
Python
In the ETL stage I carried out:
- Exclusion of null values, duplicates, and outliers;
- Normalization of predictor variables;
- Balancing the target variable;
- Correlation analysis;
I leveraged this data to compare 2 techniques widely used in data science:
- Machine Learning with Random Forest;
- Deep Learning with Neural Networking;
In this case, the accuracy of the Random Forest model was more efficient.
Partner With Anderson
View Services

More Projects by Anderson