Technical Deep Learning

Insights into models & techniques I've worked closely with.

Preference Alignment using the LLM-as-judge Approach

September 15, 2023

Tutorial to train CodeLLMA with strong supervision from GPT-4 and other LLMs

Conceptual image of LLM preference alignment