• Containerized a Retrieval-Augmented Generation (RAG) API using Docker for consistent, portable deployments
• Deployed the application to a Kubernetes cluster (Minikube) using Deployments & Services
• Configured NodePort Service for external API access and proper traffic routing
• Demonstrated Kubernetes self-healing by deleting pods and validating automatic recovery
• Implemented semantic testing to validate response accuracy and data quality
• Integrated CI/CD pipeline using GitHub Actions for automated testing on every push
• Designed mock LLM mode to ensure deterministic, repeatable CI test results
• Structured project for scalability with clean separation of code, tests, workflows, and embeddings
#LLMOps#GitHubActions #CICD#PlatformEngineering