This project models user activity data for a music streaming app called Sparkify to optimize queries for understanding what songs users are listening to by using Apache Cassandra.
Build up ETL to iterate/process events raw dataset and generate new dataset
Creating appropriate Apache Cassandra tables to answer 3 specific questions
Inserting data from new dataset to Apache Cassandra tables
In this project I built data modeling with Cassandra Apache (NoSQL) and ETL pipeline using Python as a driver. I modelled a set of data and put them into table…