Skip to yearly menu bar Skip to main content


Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization

Hao Sun ⋅ Thomas Pouplin ⋅ Nicolás Astorga ⋅ Tennison Liu ⋅ Mihaela van der Schaar

Abstract

Chat is not available.