Automated Web Scraper Project

Ikhsan

Ikhsan Arif

πŸš€ BizBuySell Web Scraper (Automation Project)

A client needed a fully automated scraper to collect all business listings from BizBuySell.com β€” a popular marketplace for buying and selling businesses β€” and push the data into Airtable for analysis.
πŸ’‘ Goal
Automate the entire data collection workflow β€” from fetching listings to parsing detailed info and updating Airtable β€” while staying undetected by modern Web Application Firewalls.
βš™οΈ What I Built
Asynchronous scraper using Playwright + BeautifulSoup, capable of scraping hundreds of pages efficiently.
A custom proxy integration with DataImpulse to automatically connect via a U.S. IP before every session β€” helping bypass anti-bot detection.
A modular architecture with separate layers for scraping, data parsing, proxy management, and Airtable integration.
Smart upsert logic for Airtable:
If both Link and EBITDA match β†’ record updates.
If Link is the same but EBITDA differs β†’ new record is created (tracking price or performance changes).
Daily scheduler that runs automatically at a fixed time using schedule β€” no manual trigger required.
🧩 Tech Stack
Python Β· Playwright Β· BeautifulSoup Β· Airtable API Β· Redis (optional) Β· DataImpulse Proxy Β· AsyncIO
🧠 Result
The scraper now runs fully unattended, collecting thousands of listings daily and keeping Airtable up to date β€” saving the client hours of manual work each day.
A video showing this process can be viewed here:
And the github repo for the project can be found here:
Like this project

Posted Oct 19, 2025

Developed an automated scraper for listings, integrating with Airtable.