The dataset provided / made will be first cleaned and pre-processed to make it ready for the ML model. This will also be provided to you in the format of your choice. This clean data will ensure that if you or someone else needs to build another custom ML model for it, they already have the data ready for it.