Interactive Streamlit dashboard backed by by Scott BakerInteractive Streamlit dashboard backed by by Scott Baker

Interactive Streamlit dashboard backed by

Scott Baker

Scott Baker

Interactive Streamlit dashboard backed by DuckDB — 167,858,646 NYC Yellow Cab records across 4 years (2022–2025),   48 Parquet files, queried live on a single workstation.                                                                                                                         Features: KPI cards (total trips, fare revenue, YoY change), monthly trip volume line chart by year, payment type   shift analysis 2022→2025, interactive year/month filters. All queries run in DuckDB — no database server, no  cluster, no Spark.                                                                                                                                                               This is the same data stack delivered to clients. Raw Parquet files in, live dashboard out — pipeline built once,   runs automatically.                                                                                                                      Built with: DuckDB 1.4 · Python · Streamlit · Plotly · Pandas    
Like this project

Posted May 14, 2026

Interactive Streamlit dashboard backed by DuckDB — 167,858,646 NYC Yellow Cab records across 4 years (2022–2025),   48 Parquet files, queried live on a sing...