What you get: a RAG pipeline over your docs, vector store setup, source citations in every answer, system-prompt tuning, a deployed endpoint plus a simple chat UI, and a short demo of it answering correctly. Tools: Python, FastAPI, LangChain, a vector database (Qdrant or Chroma), and Next.js.