notesum.ai

Published at October 18

Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation

cs.CV
cs.AI

Released Date: October 18, 2024

Authors: Shuai Zhao1, Xiaobao Wu1, Cong-Duy Nguyen1, Meihuizi Jia1, Yichao Feng1, Luu Anh Tuan1

Aff.: 1Nanyang Technological University, Singapore

Arxiv: https://arxiv.org/abs/2410.14425v1