streaming pipeline and data warehousing with stock market data

Mohammad Roshaan Butt

This project aims to construct a comprehensive Streaming Pipeline using the capabilities of DBT Cloud, Snowflake, and Amazon Kinesis to handle and process Stock Market Data. The project's objectives encompass various pivotal tasks, including setting up the DBT configuration within a cloud-based environment. It involves the creation of a dynamic data ingestion pipeline seamlessly integrated with a streaming data source, specifically Amazon Kinesis Firehose. The ingested data is then directed to a Raw Ingestion Layer hosted on Amazon S3, with data extraction from the yfinance library accomplished through Python code. The project also involves crafting a robust data transformation and consumption layer utilizing DBT Cloud's advanced features. The orchestration of the DBT pipeline within the DBT Cloud ecosystem ensures efficient execution, while the project's scope extends to streamlining Git merging and pipeline execution within the DBT Cloud environment. Additionally, the integration of alerting and notification functionalities provided by DBT Cloud enhances the project's overall functionality. In summary, this project strives to harmonize these technologies, resulting in a proficient Streaming Pipeline for real-time processing, transformation, and analysis of Stock Market Data.
Like this project
0

Posted Sep 8, 2024

streaming ETL pipeline using DBT and snowflake for stock data analysis

Automated batch processing for POS transactions
Automated batch processing for POS transactions
E Commerce Analytics Pipeline
E Commerce Analytics Pipeline