In training a Large Language Model, I was responsible for the model's performance through rigorous review and sanity checks. One of the key areas I specialized in was RLHF, short for Reinforcement Learning from Human Feedback. This involved refining the algorithms that enable large language models to learn from and adapt to feedback, optimizing existing code to ensure maximum efficiency. I also evaluated the quality of AI-generated code, by providing feedback to enhance response quality in English and Hindi.