Financial RAG System — Fine-Tuned LLM, 100% Citation Rate by Rushikesh SagarFinancial RAG System — Fine-Tuned LLM, 100% Citation Rate by Rushikesh Sagar

Financial RAG System — Fine-Tuned LLM, 100% Citation Rate

Rushikesh Sagar

Rushikesh Sagar

Dalal Street

Two-track system for Indian equity markets Quantitative analysis + fundamental analysis — both serving one goal: a systematic framework for Indian equity investment decisions.

The Central Goal

Indian equity markets need two kinds of analysis working together.
Quantitative track: What do the price and volatility signals say? Which regime is the market in? Which stocks have momentum? When should capital be deployed or pulled back?
Fundamental track: What does the annual report say about this company? Is it a Tier 1 compounder or a Tier 2 growth stock? Is the management guidance credible? Is ROCE above 18% for three consecutive years?
Neither track alone is sufficient. A quant signal tells you when to act. A fundamental signal tells you what to act on. Both eventually feed the three-tier graduation framework — a rule-based system for classifying Indian equities into Tier 1 (compounders), Tier 2 (growth), and Tier 3 (early stage) based on specific financial thresholds.

Two Tracks at a Glance

Quant Track RAG Track Goal Volatility regimes + portfolio signals Fundamental analysis + tier classification Data Daily OHLCV · 16 Nifty 50 assets · 2015–2025 Annual reports · 11 companies · FY22–FY25 Models GARCH · HAR-V · LightGBM HMM labels + 5-stage RAG + fine-tuned LLM Phase 1 ✅ Complete ✅ Complete Phase 1.5 — ✅ Complete — model published on HuggingFace Phase 2 🔧 Designing 🔧 Designing Headline Signal exists — cost drag consumed it Citation rate 0% → 100% README → Quant → RAG

System Vision


Current Status

Track Phase Status Headline Quant Phase 1 ✅ Complete RMSE 0.0076 · signal exists · cost drag killed it RAG Phase 1 ✅ Complete 5 companies · 6 modules · citation issue found RAG Phase 1.5 ✅ Complete 11 companies · citation 0% → 100% · model published Quant Phase 2 🔧 Designing Regime detection · conditional allocation RAG Phase 2 🔧 Designing Concalls · quarterly reports · agentic retrieval

Navigate

📊 Quant Track Volatility forecasting · GARCH · HAR-V · LightGBM · backtesting 📄 RAG Track Document AI · RAG pipeline · fine-tuned LLM · tier classification 🤗 Published Model Rushisagar221/dalal-street-financial-llm

About

Final-year CS student in Pune building toward ML engineering roles at YC-backed startups. This two-track system is the fundamental analysis component of a longer-term goal: a systematic framework for Indian equity investment — both quantitative signals and fundamental quality filters informing one allocation decision.
Other projects:
Crypto Quant — Two-phase quantitative system · LSTM + GNN + PPO · Paper trading live on Binance
Poker PPO Bot — Deployed RL agent · FastAPI backend · React frontend · three difficulty levels
Dalal Street · Quant + RAG · Indian equity markets · Pune, India
Like this project

Posted Apr 27, 2026

RAG pipeline over 44 annual reports. Fine-tuned Llama 3.2-3B with QLoRA — 100% citation rate, 100% tier accuracy across 11 companies. FastAPI deployed.