Knowledge Distillation

Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation

Authors: Shuai Zhao, Xiaobao Wu, Cong-Duy Nguyen, Yanhao Jia, Meihuizi Jia, Yichao Feng, Luu Anh Tuan | Published: 2024-10-18 | Updated: 2025-05-20

Backdoor Detection

Backdoor Attack Techniques

Knowledge Distillation

2024.10.18 2025.05.28

Literature Database

Knowledge Distillation with Adversarial Samples Supporting Decision Boundary

Authors: Byeongho Heo, Minsik Lee, Sangdoo Yun, Jin Young Choi | Published: 2018-05-15 | Updated: 2018-12-14

Adversarial Example

Adversarial Attack Detection

Knowledge Distillation

2018.05.15 2025.05.28

Literature Database