OList Store BI Analysis

Abhinav

Abhinav Dubey

OList Store BI Analysis

šŸ“Œ Overview

This project analyses the Brazilian e-commerce dataset 'OList', providing key business insights into sales performance, customer behaviour, product performance, and seller logistics.
This project leverages Python and related libraries for data manipulation and Tableau for interactive dashboards.

šŸ“Š Interactive Visualisations

The analysis is presented through an interactive Tableau dashboard, offering a dynamic exploration of key business metrics:
This dashboard enables stakeholders to explore:
Sales trends
Customer distribution
Shipping & logistics insights
Product performance
Key business KPIs

āš™ļø Installation & Usage

This project is structured as a Jupyter Notebook, guiding users through data preparation, exploratory data analysis (EDA), and business insights.

Steps to Set Up the Project Locally

1ļøāƒ£ Clone the repository:
git clone <repository_link>
2ļøāƒ£ Create a Conda environment:
conda env create -f environment.yml
3ļøāƒ£ Activate the environment:
conda activate dataanalysis
4ļøāƒ£ Launch Jupyter Lab:
jupyter lab
5ļøāƒ£ Open the project notebook and begin analysis.
šŸ”¹ The required Python libraries are listed in environment.yml. šŸ”¹ Machine Learning (ML) operations using scikit-learn are included for certain analyses. šŸ”¹ Dataset profile reports (generated using ydata_profiling) are available in the repository. šŸ”¹ The dataset consists of 8 tables, which were merged and optimised for in-depth analysis using Python. šŸ”¹ A custom function describex() is provided for more detailed descriptive statistics compared to pandas.describe().

šŸ“ Project Structure

šŸ“‚ OList-Store-BI-Analysis/ │-- šŸ“„ README (Project overview & setup instructions) │-- šŸ“„ environment.yml (Conda environment dependencies) │-- šŸ“œ BI Analysis (Olist).ipynb (Jupyter Notebook with analysis) │-- šŸ“œ Dataset Profile Reports (HTML) (Auto-generated dataset insights)

šŸ” Analysis Overview

šŸ“ˆ Sales Performance Analysis

Trends over time šŸ“Š
Performance by region šŸ™ļø
Average Order Value (AOV) šŸ’°
Seasonality

šŸŽÆ Customer Insights

Customer segmentation (RFM analysis)
Sentiment analysis on customer reviews
Customer Lifetime Value (CLTV) estimation
Geolocation analysis

šŸš› Seller & Logistics Insights

Seller performance analysis
Shipping delays & logistics efficiency
Seller ratings & ML-based performance prediction

šŸ“¢ Seller Marketing Insights

Seller lead conversion analysis
Seller classification using ML techniques

šŸ›ļø Payment Methods Insights

Payment method distribution & profitability trends

šŸ›ļø Product & Payment Insights

Most profitable products & categories
Product listing quality analysis
šŸ”¹ Key KPIs such as total sales, AOV, ratings, and delivery delays are highlighted in the Tableau dashboard for executive insights.

šŸ“‚ Dataset Reference

This project utilises the OList Brazilian e-commerce dataset, available on Kaggle:
šŸ“Œ OList Dataset
Datasets can be downloaded and explored using these links.

šŸ™Œ Acknowledgements

Special thanks to: šŸ’” ChatGPT – For assisting in structuring the analysis, refining documentation, and optimising project workflow.

šŸ“œ Licence

This project is open-source under the MIT Licence. āš ļø The dataset usage is subject to the terms and conditions of the dataset provider.
šŸš€ Contributions & feedback are welcome! Feel free to fork, modify, and enhance this project.
Like this project

Posted Apr 24, 2025

Analyzed OList e-commerce data for business insights using Python and Tableau.