敵対的攻撃手法

Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples

Authors: Shaokui Wei, Mingda Zhang, Hongyuan Zha, Baoyuan Wu | Published: 2023-07-20
バックドア攻撃
敵対的攻撃手法
透かし評価

Jailbroken: How Does LLM Safety Training Fail?

Authors: Alexander Wei, Nika Haghtalab, Jacob Steinhardt | Published: 2023-07-05
セキュリティ保証
プロンプトインジェクション
敵対的攻撃手法

Your Attack Is Too DUMB: Formalizing Attacker Scenarios for Adversarial Transferability

Authors: Marco Alecci, Mauro Conti, Francesco Marchiori, Luca Martinelli, Luca Pajola | Published: 2023-06-27
マルウェア分類
敵対的サンプル
敵対的攻撃手法

Are aligned neural networks adversarially aligned?

Authors: Nicholas Carlini, Milad Nasr, Christopher A. Choquette-Choo, Matthew Jagielski, Irena Gao, Anas Awadalla, Pang Wei Koh, Daphne Ippolito, Katherine Lee, Florian Tramer, Ludwig Schmidt | Published: 2023-06-26 | Updated: 2024-05-06
プロンプトインジェクション
敵対的サンプル
敵対的攻撃手法

On the Resilience of Machine Learning-Based IDS for Automotive Networks

Authors: Ivo Zenden, Han Wang, Alfonso Iacovazzi, Arash Vahidi, Rolf Blom, Shahid Raza | Published: 2023-06-26
マルウェア検出手法
敵対的攻撃手法
車両ネットワーク

Adversarial Robustness in Unsupervised Machine Learning: A Systematic Review

Authors: Mathias Lundteigen Mohus, Jinyue Li | Published: 2023-06-01
プライバシー保護手法
ポイズニング
敵対的攻撃手法

Constructing Semantics-Aware Adversarial Examples with a Probabilistic Perspective

Authors: Andi Zhang, Mingtian Zhang, Damon Wischik | Published: 2023-06-01 | Updated: 2024-11-24
ポイズニング
拡散モデル
敵対的攻撃手法

Verifiable Learning for Robust Tree Ensembles

Authors: Stefano Calzavara, Lorenzo Cazzaro, Giulio Ermanno Pibiri, Nicola Prezza | Published: 2023-05-05 | Updated: 2023-11-11
ランダムフォレスト
敵対的攻撃手法
決定木

SoK: Pragmatic Assessment of Machine Learning for Network Intrusion Detection

Authors: Giovanni Apruzzese, Pavel Laskov, Johannes Schneider | Published: 2023-04-30
サイバーセキュリティ
敵対的攻撃手法
運用シナリオ

SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning

Authors: Maxwell Standen, Junae Kim, Claudia Szabo | Published: 2023-01-11
DNN IP保護手法
敵対的攻撃手法
構造的攻撃