Skip to yearly menu bar Skip to main content


Oral

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Boxin Wang ⋅ Weixin Chen ⋅ Hengzhi Pei ⋅ Chulin Xie ⋅ Mintong Kang ⋅ Chenhui Zhang ⋅ Chejian Xu ⋅ Zidi Xiong ⋅ Ritik Dutta ⋅ Rylan Schaeffer ⋅ Sang Truong ⋅ Simran Arora ⋅ Mantas Mazeika ⋅ Dan Hendrycks ⋅ Zinan Lin ⋅ Yu Cheng ⋅ Sanmi Koyejo ⋅ Dawn Song ⋅ Bo Li
2023 Oral

Abstract

Video

Chat is not available.