Designed and built a production AI pipeline for generative models.
The system includes GPU inference services, model orchestration, and automated quality evaluation. It supports SDXL / Flux models, API-based inference, and scalable deployment using FastAPI and Docker.
The project focuses on building reliable AI infrastructure for real-world applications with optimized GPU performance and modular microservice architecture.