More snapshots are available in docs/ if you need different angles (Screenshot 2025-11-17 035359.png, ...035416.png, ...035444.png). Directory Guide AI WAIFU/ ββ backend/ # FastAPI app β ββ routers/ # API routes (chat, stt, tts, debug) β ββ services/ # Integrations (FunASR, GPT-SoVITS, LLM, viseme) β ββ audio/generated/ # TTS output + cache β ββ
config.py # Settings model (Pydantic) β ββ
main.py # FastAPI entrypoint β ββ env_template.txt # .env reference ββ frontend/ # React + Vite SPA β ββ src/hooks/ # useStreamingStt, useWebSpeechStt β ββ src/pages/Chat.jsx # Main UI / mic logic / avatar β ββ public/models/ # VRM avatars ββ GPT-SoVITS/ # vendored GPT-SoVITS server β ββ riko_reference/ # voice reference assets ββ scripts/ # PowerShell launch helpers ββ fbx2vrma-converter/ # Mixamo FBX -> VRMA CLI from nanumatt/fbx2vrma-converter ββ tools/ # FFmpeg, Rhubarb, pixiv three-vrm helper packages