Discover the Untold Challenges of Data Cleaning in Data ScienceDiscover the Untold Challenges of Data Cleaning in Data Science
The network for creativity
Join 1.25M professional creatives like you
Connect with clients, get discovered, and run your business 100% commission-free
Creatives on Contra have earned over $150M and we are just getting started
I cleaned data for 3 days. Model trained in 10 minutes. Nobody talks about this part of Data Science. 80% of the job is: — Missing values that make no sense — Columns named "col_1", "col_2", "col_final_FINAL" — Duplicate rows that shouldn't exist — Dates in 6 different formats The model is the easy part. The data is the real work. Next time someone says "just run a model on it" — you'll know why that sentence hurts.
Post image
Back to feed
The network for creativity
Join 1.25M professional creatives like you
Connect with clients, get discovered, and run your business 100% commission-free
Creatives on Contra have earned over $150M and we are just getting started