I just built a real-time data dashboard to track what is
trending on YouTube! š
This tool helps creators and brands see which videos get the
most views and how viewers feel about them.
⨠Key Features
Live tracking: Updates automatically every two minutes.
Sentiment analysis: Scores viewer feelings across different video categories.
Smart filtering: Sorts data by country, category, and custom date ranges.
Top channel stats: Visualizes the biggest players by total view counts.
š ļø Tech Stack
Python, Streamlit, YouTube Data API, Data Visualization (Pandas & Plotly)
0
0
A few months ago, I was manually comparing source and target tables to find data issues.
Today, I built a Data Quality Monitoring Dashboard that automatically detects anomalies, tracks validation metrics, and provides real-time visibility into data health.
Here's what the project looked like š
āāāāāāāāāāāāāāāāāāāāā
š· The Problem
āāāāāāāāāāāāāāāāāāāāā
Modern data pipelines process thousands of records every day.
But one missing value, duplicate record, or failed transformation can impact reports, dashboards, and business decisions.
Teams often discover these issues too late.
My goal was simple:
Build a centralized dashboard that continuously monitors data quality and highlights issues before they become business problems.
āāāāāāāāāāāāāāāāāāāāā
āļø Step 1 ā Data Validation Framework
āāāāāāāāāāāāāāāāāāāāā
Created automated validation checks across multiple datasets.
The framework monitors:
ā Record count mismatches
ā Null value violations
ā Duplicate records
ā Data completeness
ā Business rule validation
ā Source-to-target reconciliation
Instead of manually checking tables, every validation runs automatically.
āāāāāāāāāāāāāāāāāāāāā
š Step 2 ā Monitoring Dashboard
āāāāāāāāāāāāāāāāāāāāā
Built an interactive dashboard to visualize data quality metrics.
Key sections included:
š¢ Data Quality Score
š Trend Analysis
ā ļø Failed Validation Checks
š Table-Level Health Monitoring
š Data Reconciliation Status
The dashboard enables teams to quickly identify issues and take corrective action.
āāāāāāāāāāāāāāāāāāāāā
š” Key Outcomes
āāāāāāāāāāāāāāāāāāāāā
ā Reduced manual validation effort
ā” Faster identification of data issues
š Improved reporting accuracy
š Increased visibility into pipeline health
š Enhanced trust in business-critical data
āāāāāāāāāāāāāāāāāāāāā
Tech Stack:
⢠SQL
⢠BigQuery
⢠ETL Pipelines
⢠Data Validation
⢠Data Quality Monitoring
⢠Data Warehousing
Data quality isn't just about fixing errors.
It's about building confidence in every business decision powered by data.
#DataEngineering #DataQuality #BigQuery #SQL #ETL #DataWarehouse #Analytics #Dashboard #PortfolioProject #DataGovernance
0
15
YouTube Trend Analyzer ā End-to-End Data Analytics Project
Built an end-to-end analytics solution to analyze YouTube trending videos using Python, SQL, and data visualization techniques. Extracted, transformed, and analyzed trending data to identify content patterns, engagement metrics, and category performance. Created interactive dashboards and generated actionable insights from large datasets.