firstbacksecondback
2 Results
Workshop
|
Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference Anton Xue · Avishree Khare · Rajeev Alur · Surbhi Goel · Eric Wong |
||
Workshop
|
Declarative characterizations of direct preference alignment algorithms Kyle Richardson · Vivek Srikumar · Ashish Sabharwal |