firstbacksecondback
17 Results
Workshop
|
Weak to Strong Learning from Aggregate Labels Yukti Makhija · Rishi Saket |
||
Poster
|
Wed 16:30 |
On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton · Noah Siegel · Janos Kramar · Jonah Brown-Cohen · Samuel Albanie · Jannis Bulian · Rishabh Agarwal · David Lindner · Yunhao Tang · Noah Goodman · Rohin Shah |
|
Poster
|
Fri 11:00 |
On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion Chenghao Fan · Zhenyi Lu · Wei Wei · Jie Tian · Xiaoye Qu · Dangyang Chen · Yu Cheng |
|
Oral
|
Wed 15:30 |
Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs Matthew Zurek · Yudong Chen |
|
Poster
|
Wed 16:30 |
Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs Matthew Zurek · Yudong Chen |