Interactive Streamlit dashboard backed by DuckDB — 167,858,646 NYC Yellow Cab records across 4 years (2022–2025),
48 Parquet files, queried live on a single workstation.
Features: KPI cards (total trips, fare revenue, YoY change), monthly trip volume line chart by year, payment type
shift analysis 2022→2025, interactive year/month filters. All queries run in DuckDB — no database server, no
cluster, no Spark.
This is the same data stack delivered to clients. Raw Parquet files in, live dashboard out — pipeline built once,
runs automatically.
Built with: DuckDB 1.4 · Python · Streamlit · Plotly · Pandas