Understand, clean and organize data to facilitate the Data Science step.
It can be done using many technologies, for example Python, Airflow, AWS and Excel.
All of that may be combined with another tools aiming to simplify the process, such as OneDrive, GitHub and Miro.