Skip to yearly menu bar Skip to main content


Spotlight Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

Reinforcement Learning for Out-of-Distribution Reasoning in LLMs: An Empirical Study on Diagnosis-Related Group Coding

Hanyin Wang ⋅ Zhenbang Wu ⋅ Gururaj Kolar ⋅ Hariprasad Korsapati ⋅ Brian Bartlett ⋅ Bryan Hull ⋅ Jimeng Sun

Abstract

Video

Chat is not available.