I began by identifying and analyzing the target websites where financial publications were available. Collaborating with the client, I pinpointed the most relevant sources and studied their structures to determine the optimal methods for data extraction. I wrote custom web scraping scripts in Python using Scrapy. The scripts were tailored to handle challenges such as pagination, dynamic content, and diverse HTML layouts. Throughout this process, I ensured that my methods complied with each site's policies and respected their robots.txt files. Once the data was extracted, I focused on cleaning and preprocessing the information. I removed duplicate entries, normalized formats, and stored the processed data in a structured format for easy access in later stages.