firstbacksecondback
6 Results
Workshop
|
Prioritization Strategies for LLM-Designed Restless Bandit Rewards in Public Health Shresth Verma · Niclas Boehmer · Lingkai Kong · Milind Tambe |
||
Workshop
|
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Shresth Verma · Niclas Boehmer · Lingkai Kong · Milind Tambe |
||
Workshop
|
From Laws to Motivation: Guiding Exploration through Law-Based Intrinsic Reasoning and Rewards Ziyu Chen · Zhiqing Xiao · Xinbei Jiang · Junbo Zhao |
||
Workshop
|
Mechanism Design for LLM Fine-tuning with Multiple Reward Models Haoran Sun · Yurong Chen · Siwei Wang · Wei Chen · Xiaotie Deng |
||
Workshop
|
Fine-Tuning Discrete Diffusion Models via Reward Optimization: Applications to DNA and Protein Design Chenyu Wang · Masatoshi Uehara · Yichun He · Amy Wang · Tommaso Biancalani · Avantika Lal · Tommi Jaakkola · Sergey Levine · Hanchen Wang · Aviv Regev |
||
Workshop
|
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Chenyu Wang · Masatoshi Uehara · Yichun He · Amy Wang · Tommaso Biancalani · Avantika Lal · Tommi Jaakkola · Sergey Levine · Hanchen Wang · Aviv Regev |