Implementing Real-time streaming pipeline
Built Flask api to consume and handle json payloads dynamically, adding streaming functionality using Kafka streams, spark streaming and big data
Data Engineer
Kafka
Python
Built data pipelines on AWS cloud
Built scala spark framework to enable full and incremental data load to redshift. Enabled near real time pipelines using lambda functions.
Data Engineer
AWS
AWS Lambda
AWS RDS
Python
Data scrapped from 100+ APIs
Developed application to seamlessly scrap data from multiple api's and apply transformations on top of it.
Data Scraper
Data Engineer
BeautifulSoup
Python
Scrapy