firstbacksecondback
3 Results
Poster
|
Wed 9:00 |
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models Maribeth Rauh · John Mellor · Jonathan Uesato · Po-Sen Huang · Johannes Welbl · Laura Weidinger · Sumanth Dathathri · Amelia Glaese · Geoffrey Irving · Iason Gabriel · William Isaac · Lisa Anne Hendricks |
|
Poster
|
Tue 14:00 |
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models Boxin Wang · Wei Ping · Chaowei Xiao · Peng Xu · Mostofa Patwary · Mohammad Shoeybi · Bo Li · Anima Anandkumar · Bryan Catanzaro |
|
Workshop
|
Combating Toxicity in Online Games with HCAI Regan Mandryk · Julian Frommel |