firstbacksecondback
40 Results
Workshop
|
Trustworthy Human-AI Interaction Through Agreement Protocols Natalie Collina · Surbhi Goel · Varun Gupta · Aaron Roth |
||
Workshop
|
Neural Interactive Proofs Lewis Hammond · Sam Adam-Day |
||
Affinity Event
|
Tue 14:00 |
Invited Talk 2 by Lama Ahmad (Technical Program Manager, Trustworthy AI at OpenAI): Human and AI Evaluations for Safety and Robustness Testing Lama Ahmad |
|
Poster
|
Wed 16:30 |
Learning the Latent Causal Structure for Modeling Label Noise Yexiong Lin · Yu Yao · Tongliang Liu |
|
Thu 19:30 |
Space and AI Anne Spalding · Gabriel Sutherland · Alexander Lavin |
||
Workshop
|
Sat 12:00 |
Weak-to-Strong Confidence Prediction Yukai Yang · Tracy Zhu · Marco Morucci · Tim G. J. Rudner |
|
Poster
|
Thu 11:00 |
What If the Input is Expanded in OOD Detection? Boxuan Zhang · Jianing Zhu · Zengmao Wang · Tongliang Liu · Bo Du · Bo Han |
|
Workshop
|
Sun 9:00 |
Towards Safe & Trustworthy Agents Alexander Pan · Kimin Lee · Bo Li · Karthik Narasimhan · Dawn Song · Isabelle Barrass |
|
Poster
|
Wed 16:30 |
Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space Xin Qiu · Risto Miikkulainen |
|
Poster
|
Wed 11:00 |
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents Wenkai Yang · Xiaohan Bi · Yankai Lin · Sishuo Chen · Jie Zhou · Xu Sun |
|
Poster
|
Fri 16:30 |
CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence Chaochao Chen · Jiaming Zhang · Yizhao Zhang · Li Zhang · Lingjuan Lyu · Yuyuan Li · Biao Gong · Chenggang Yan |
|
Poster
|
Wed 16:30 |
Few-Shot Adversarial Prompt Learning on Vision-Language Models Yiwei Zhou · Xiaobo Xia · Zhiwei Lin · Bo Han · Tongliang Liu |