More snapshots are available in docs/ if you need different angles (Screenshot 2025-11-17 035359.png, ...035416.png, ...035444.png). Directory Guide AI WAIFU/ ├─ backend/ # FastAPI app │ ├─ routers/ # API routes (chat, stt, tts, debug) │ ├─ services/ # Integrations (FunASR, GPT-SoVITS, LLM, viseme) │ ├─ audio/generated/ # TTS output + cache │ ├─
config.py # Settings model (Pydantic) │ ├─
main.py # FastAPI entrypoint │ └─ env_template.txt # .env reference ├─ frontend/ # React + Vite SPA │ ├─ src/hooks/ # useStreamingStt, useWebSpeechStt │ ├─ src/pages/Chat.jsx # Main UI / mic logic / avatar │ └─ public/models/ # VRM avatars ├─ GPT-SoVITS/ # vendored GPT-SoVITS server │ └─ riko_reference/ # voice reference assets ├─ scripts/ # PowerShell launch helpers ├─ fbx2vrma-converter/ # Mixamo FBX -> VRMA CLI from nanumatt/fbx2vrma-converter └─ tools/ # FFmpeg, Rhubarb, pixiv three-vrm helper packages