AI-powered medical transcription tool aiming to assist doctors at private clinics to transcribe their medical examination and issue reports faster
🚀 Project Overview
Problem: Slovenian doctors waste hours manually transcribing patient notes, risking errors and burnout.
Solution: Built a secure, AI-powered tool to automate medical dictation transcription while preserving doctor-specific formatting preferences.
Key Features:
Slovenian speech-to-text fine-tuned for medical terminology
Real-time transcription progress via WebSockets
GDPR-compliant patient data workflows
Custom formatting presets per doctor
Browser-based recording with auto-save fallback
My Role: Sole Full-Stack Developer & Product Designer
Timeline: 3 months (MVP to launch)
🛠️ Technical Stack
Frontend:
Next.js (App Router) + Vite (React)
Shadcn UI + Tailwind
WebSockets for real-time updates
Backend:
Node.js 23 + Express + Typescript
PostgreSQL + Prisma ORM
Google Cloud Tasks (queue management)
AI/ML:
Google Vertex AI Speech-to-Text API
Custom model trained on 1,000+ doctor-corrected Slovenian transcripts
DevOps:
Dockerized monorepo (FE/BE/DB)
Hetzner VPS (Ubuntu 24) + Nginx
GCP Storage (temp) + Backblaze (archive)
Auth: Auth0 (JWT sessions + user caching)
💡 Technical Challenges & Solutions
1. Slovenian Medical Speech Recognition
Trained Vertex AI model using paired raw/corrected transcripts
Added context-aware correction (e.g., "kardio-log" vs "kardiolog")