Automated Web Scraper Project by Ikhsan ArifAutomated Web Scraper Project by Ikhsan Arif

Automated Web Scraper Project

Ikhsan Arif

Ikhsan Arif

🚀 BizBuySell Web Scraper (Automation Project)

A client needed a fully automated scraper to collect all business listings from BizBuySell.com — a popular marketplace for buying and selling businesses — and push the data into Airtable for analysis.
💡 Goal
Automate the entire data collection workflow — from fetching listings to parsing detailed info and updating Airtable — while staying undetected by modern Web Application Firewalls.
⚙️ What I Built
Asynchronous scraper using Playwright + BeautifulSoup, capable of scraping hundreds of pages efficiently.
A custom proxy integration with DataImpulse to automatically connect via a U.S. IP before every session — helping bypass anti-bot detection.
A modular architecture with separate layers for scraping, data parsing, proxy management, and Airtable integration.
Smart upsert logic for Airtable:
If both Link and EBITDA match → record updates.
If Link is the same but EBITDA differs → new record is created (tracking price or performance changes).
Daily scheduler that runs automatically at a fixed time using schedule — no manual trigger required.
🧩 Tech Stack
Python · Playwright · BeautifulSoup · Airtable API · Redis (optional) · DataImpulse Proxy · AsyncIO
🧠 Result
The scraper now runs fully unattended, collecting thousands of listings daily and keeping Airtable up to date — saving the client hours of manual work each day.
A video showing this process can be viewed here:
And the github repo for the project can be found here:
Like this project

Posted Oct 19, 2025

Developed an automated scraper for listings, integrating with Airtable.