敵対的攻撃

Adversarial Demonstration Attacks on Large Language Models

Authors: Jiongxiao Wang, Zichen Liu, Keun Hee Park, Zhuojun Jiang, Zhaoheng Zheng, Zhuofeng Wu, Muhao Chen, Chaowei Xiao | Published: 2023-05-24 | Updated: 2023-10-14
悪意のあるデモ構築
敵対的サンプル
敵対的攻撃

Poisoning Web-Scale Training Datasets is Practical

Authors: Nicholas Carlini, Matthew Jagielski, Christopher A. Choquette-Choo, Daniel Paleka, Will Pearce, Hyrum Anderson, Andreas Terzis, Kurt Thomas, Florian Tramèr | Published: 2023-02-20 | Updated: 2024-05-06
ポイズニング
攻撃シナリオ分析
敵対的攻撃

Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization

Authors: Ziquan Liu, Antoni B. Chan | Published: 2022-10-11
ポイズニング
性能評価指標
敵対的攻撃

Characterizing Internal Evasion Attacks in Federated Learning

Authors: Taejin Kim, Shubhranshu Singh, Nikhil Madaan, Carlee Joe-Wong | Published: 2022-09-17 | Updated: 2023-10-21
ポイズニング
敵対的攻撃
適応型敵対的訓練

Membership Inference Attacks by Exploiting Loss Trajectory

Authors: Yiyong Liu, Zhengyu Zhao, Michael Backes, Yang Zhang | Published: 2022-08-31
メンバーシップ推論
モデルアーキテクチャ
敵対的攻撃

A Black-Box Attack on Optical Character Recognition Systems

Authors: Samet Bayram, Kenneth Barner | Published: 2022-08-30
敵対的サンプル
敵対的攻撃
最適化手法

Architectural Backdoors in Neural Networks

Authors: Mikel Bober-Irizar, Ilia Shumailov, Yiren Zhao, Robert Mullins, Nicolas Papernot | Published: 2022-06-15
敵対的学習
敵対的攻撃
脅威モデル

Statically Detecting Adversarial Malware through Randomised Chaining

Authors: Matthew Crawford, Wei Wang, Ruoxi Sun, Minhui Xue | Published: 2021-11-28 | Updated: 2021-12-04
マルウェア検出手法
敵対的攻撃
防御手法

Dissecting Malware in the Wild

Authors: Hamish Spencer, Wei Wang, Ruoxi Sun, Minhui Xue | Published: 2021-11-28 | Updated: 2021-12-04
バックドア攻撃
マルウェア検出手法
敵対的攻撃

The Geometry of Adversarial Training in Binary Classification

Authors: Leon Bungert, Nicolás García Trillos, Ryan Murray | Published: 2021-11-26 | Updated: 2022-08-01
敵対的攻撃
正則化
非局所変分正則化