In the initial stages, I found that the raw data presented numerous issues such as missing values, outliers, and inconsistent data formats. To rectify this, I leveraged Python for data preprocessing and cleansing. For instance, I utilized the Pandas library to fill in missing values, the NumPy library for detecting and filtering outliers, and Regular Expressions to standardize data formats.