This project was for a Data Engineering course. The goal of the project was to develop our analytical and data engineering skills. In the project we were given a large dataset with information about the Olympics from the first one to the most recent one. We were asked to clean the data and visualize it. Then we had to come up with 6 questions that were intriguing that we had to solve using the dataset. All of this was done in Python using its libraries such as Pandas, Numpy, Matplotlib and many others. We also used Airflow, the data engineering software, to write scripts that would run frequently to update the data and run the analysis periodically