Skip to yearly menu bar Skip to main content


Internal Value Functions: Leveraging Hidden States for Efficient Test-Time Scaling in Large Reasoning Models

Duc Khiem Pham ⋅ Sai Muralidhar Jayanthi ⋅ Saket Dingliwal ⋅ Bhavana Ganesh ⋅ Karthik Valmeekam ⋅ Xiangchen Song ⋅ Vivek Govindan ⋅ Beidi Chen ⋅ Sravan Babu Bodapati ⋅ Aram Galstyan

Abstract

Chat is not available.