Developed a custom Python tool to automate deal deduplication and data cleanup in a cluttered CRM environment (Pipedrive). The sales team was struggling with duplicate deals—caused by inconsistent naming, regional codes, and manual entry errors—which inflated KPIs and created reporting chaos.
The script I built:
Pulled data via Pipedrive’s API
Applied fuzzy matching, regex logic, and business-specific rules (e.g., prioritizing recent activity, deal status, or country codes)
Flagged or merged duplicates automatically, depending on risk
Generated clear logs and manual override options for edge cases
The tool helped reduce manual review by over 80%, improved reporting accuracy, and restored trust in the CRM pipeline—without involving backend devs or messy Excel macros.