Movies Data Scraping and Analysis

Júlio

Júlio Silva

Web_Scraping_Project

This project scrapes top-rated movie data from The Movie Database (TMDB) and compiles it into a structured dataset (CSV/Excel format). The main goal is to collect the data in a reproducible format. After the data is scraped, a light analysis is conducted to demonstrate how features such as genre, certification, and duration relate to a movie's financial performance.

Key Highlights

Scraping multiple pages from TMDB's "Top-Rated Movies" section using BeautifulSoup.
Extracted details: Title, Certification, Release Date, Genre(s), Runtime, Score, Budget, Revenue.
Dataset exported to CSV/Excel for reuse in other tools or dashboards.
Optional analysis included
Lightweight and easy to modify for scraping additional attributes or pages.

Dataset Features

Descriptive: Title, Genre(s), Release Date, Runtime
Audience: Certification (age rating), User Score
Financial: Budget, Revenue

Tools & Libraries

Python · Pandas · BeautifulSoup · Requests · Seaborn · Matplotlib
Like this project

Posted Jul 18, 2025

This project scrapes top-rated movie data from The Movie Database (TMDB) and compiles it into a structured dataset (CSV/Excel format).