機械学習の忘却

An Adversarial Perspective on Machine Unlearning for AI Safety

Authors: Jakub Łucki, Boyi Wei, Yangsibo Huang, Peter Henderson, Florian Tramèr, Javier Rando | Published: 2024-09-26 | Updated: 2025-04-10
プロンプトインジェクション
安全性アライメント
機械学習の忘却

Digital Forgetting in Large Language Models: A Survey of Unlearning Methods

Authors: Alberto Blanco-Justicia, Najeeb Jebreel, Benet Manzanares, David Sánchez, Josep Domingo-Ferrer, Guillem Collell, Kuan Eeik Tan | Published: 2024-04-02
LLM性能評価
プロンプトインジェクション
機械学習の忘却

Machine Unlearning for Traditional Models and Large Language Models: A Short Survey

Authors: Yi Xu | Published: 2024-04-01
データプライバシー評価
モデル性能評価
機械学習の忘却

Machine Unlearning: Taxonomy, Metrics, Applications, Challenges, and Prospects

Authors: Na Li, Chunyi Zhou, Yansong Gao, Hui Chen, Anmin Fu, Zhi Zhang, Yu Shui | Published: 2024-03-13
バックドア攻撃
メンバーシップ推論
機械学習の忘却

Towards Independence Criterion in Machine Unlearning of Features and Labels

Authors: Ling Han, Nanqing Luo, Hao Huang, Jing Chen, Mary-Anne Hartley | Published: 2024-03-12
ウォーターマーキング
プライバシー保護
機械学習の忘却

Unlearnable Algorithms for In-context Learning

Authors: Andrei Muresanu, Anvith Thudi, Michael R. Zhang, Nicolas Papernot | Published: 2024-02-01
Few-Shot Learning
アルゴリズム
機械学習の忘却

Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation

Authors: Hyunjune Kim, Sangyong Lee, Simon S. Woo | Published: 2023-12-28
ポイズニング
機械学習の忘却
透かし評価

Deep Unlearning: Fast and Efficient Gradient-free Approach to Class Forgetting

Authors: Sangamesh Kodge, Gobinda Saha, Kaushik Roy | Published: 2023-12-01 | Updated: 2024-08-05
ウォーターマーキング
機械学習の忘却
透かし評価

Selective Forgetting of Deep Networks at a Finer Level than Samples

Authors: Tomohiro Hayase, Suguru Yasutomi, Takashi Katoh | Published: 2020-12-22 | Updated: 2020-12-31
データ削除アルゴリズム
損失関数
機械学習の忘却

Amnesiac Machine Learning

Authors: Laura Graves, Vineel Nagisetty, Vijay Ganesh | Published: 2020-10-21
機械学習の忘却
法律遵守