Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Reliable ML from Unreliable Data

AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin

Shuo Yang ⋅ Qihui Zhang ⋅ Yuyang Liu ⋅ Yue Huang ⋅ Xiaojun Jia ⋅ Kun-Peng Ning ⋅ Jia-Yu Yao ⋅ jigang wang ⋅ Dai Hailiang ⋅ Yibing Song ⋅ Li Yuan
2025 Poster
in
Workshop: Reliable ML from Unreliable Data

Abstract

Chat is not available.