Database deduplication task

Julián Acuña

0

Data Modelling Analyst

Data Scientist

Data Analyst

pandas

Python

Given a dataset, find and link duplicates, duplicates can be present due to typos, aliases, missing fields.
I use state of the art tools (statistical packages, machine learning) to find and link database records for the client
Like this project
0

Find and link duplicated records in a database

Likes

0

Views

15

Tags

Data Modelling Analyst

Data Scientist

Data Analyst

pandas

Python

Julián Acuña

Mathematician, Data Scientist, Backend Developer

Set up CI for your python project in Github
Set up CI for your python project in Github
Download publicly available dataset
Download publicly available dataset