Worked on an RLHF (Reinforcement Learning from Human Feedback) pipeline focused on dataset creation, data annotation, and model evaluation. My role involved designing and curating high-quality prompt datasets, reviewing AI-generated responses, and providing structured feedback based on accuracy, relevance, safety, and helpfulness. Contributed to improving model performance by ensuring consistent evaluation standards and high-quality human feedback for training alignment and refinement.