スパース性防御

Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs

Authors: Yukun Jiang, Hai Huang, Mingjie Li, Yage Zhang, Michael Backes, Yang Zhang | Published: 2026-02-09
スパース性防御
プロンプトインジェクション
安全性分析

Sparsity-based Defense against Adversarial Attacks on Linear Classifiers

Authors: Zhinus Marzi, Soorya Gopalakrishnan, Upamanyu Madhow, Ramtin Pedarsani | Published: 2018-01-15 | Updated: 2018-06-19
スパース性防御
敵対的学習
敵対的攻撃