notesum.ai

Published at December 3

Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining

cs.CL
cs.AI
cs.CR

Released Date: December 3, 2024

Authors: Zongru Wu1, Pengzhou Cheng, Lingyong Fang, Zhuosheng Zhang, Gongshen Liu

Aff.: 1Shanghai Jiao Tong University

Arxiv: http://arxiv.org/pdf/2412.02454v1