This project demonstrates the creation and analysis of a data pipeline using Snowflake. The focus is on analyzing the distribution of products across different plants in a supply chain.
Data Description
ProductsPerPlant.csv: Contains the plant codes and corresponding product IDs.
TopPlants_ProductCount.csv: Shows the top plants by the number of unique products they handle.
ProductDistribution_Pattern.csv: Displays the distribution pattern of products across plants.
Detailed_ProductList_PerPlant.csv: Provides a detailed list of products for each plant.
Analysis
The analysis includes:
Count of products per plant.
Top plants by product count.
Product distribution pattern.
Detailed product list per plant.
Results
Visualizations created using Excel.
SQL scripts used for data transformation and analysis.
How to Run the Project
Load the data into Snowflake using the load_data.sql script.
Perform the analysis using the data_analysis.sql script.