AI Evaluation and Testing Workflow Development by Obili Narugari Somanath ReddyAI Evaluation and Testing Workflow Development by Obili Narugari Somanath Reddy

AI Evaluation and Testing Workflow Development

Obili Narugari Somanath Reddy

AI Engineer

Labelbox

Python

Artificial Intelligence

Built a robust AI eval and testing workflow to measure and improve LLM quality on code-focused tasks (Swift) and general NLP. Delivered automated evaluation suites, human-in-the-loop review workflows, and prompt optimization cycles to raise accuracy, reliability, and consistency across releases. Designed metrics, test sets, and dashboards to make model quality visible and actionable for product and engineering.

Like this project

Posted Aug 14, 2025

Developed AI evaluation and testing workflow for LLM quality improvement.

Likes

Views

Timeline

Aug 3, 2025 - Aug 13, 2025