Mastering Data Cleaning: Refresh Your SQL Skills EfficientlyMastering Data Cleaning: Refresh Your SQL Skills Efficiently
The network for creativity
Join 1.25M professional creatives like you
Connect with clients, get discovered, and run your business 100% commission-free
Creatives on Contra have earned over $150M and we are just getting started
I am currently working on a project to refresh my SQL skills and here's what I've learned (again) when it comes to data cleaning-
1. In the data world, cleansing, scraping and making the raw data compatible enough to be analyzed is 85% of the job.
2. As time passes, you will know how to skim through rows and columns to find the duplicate, blank or any other anomaly. This is an important skill to have.
3. Punctations due to human error creates a row entry which becomes a 'DISTINCT' value for that column in the database.
4. Data standardization is equally important- white spaces, punctuations, misspellings, wrong data type are some examples of non-standardized raw data.
5. And lastly- looking at the same CSV file for the umpteenth time, you will definitely start questing your own spelling and language skills. Is it 'Argentina' or 'Argantani'?
Post image
Back to feed
The network for creativity
Join 1.25M professional creatives like you
Connect with clients, get discovered, and run your business 100% commission-free
Creatives on Contra have earned over $150M and we are just getting started