This project will test your knowledge of the various tools related to batch processing, which you have learnt throughout this course. The project mainly revolves around Apache Sqoop, Apache PySpark, Amazon S3 and Amazon RedShift, which are some of the most widely used tools in the industry.