Preference Alignment using the LLM-as-judge Approach
Tutorial to train CodeLLMA with strong supervision from GPT-4 and other LLMs
![Conceptual image of LLM preference alignment](test.png)
Insights into models & techniques I've worked closely with.
Tutorial to train CodeLLMA with strong supervision from GPT-4 and other LLMs