敵対的サンプル

PhishLang: A Lightweight, Client-Side Phishing Detection Framework using MobileBERT for Real-Time, Explainable Threat Mitigation

Authors: Sayak Saha Roy, Shirin Nilizadeh | Published: 2024-08-11 | Updated: 2024-09-09
フィッシング検出
敵対的サンプル
説明可能なブロックリスト

LaFA: Latent Feature Attacks on Non-negative Matrix Factorization

Authors: Minh Vu, Ben Nebgen, Erik Skau, Geigh Zollicoffer, Juan Castorena, Kim Rasmussen, Boian Alexandrov, Manish Bhattarai | Published: 2024-08-07
ウォーターマーキング
攻撃手法
敵対的サンプル

Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models?

Authors: Mohammad Bahrami Karkevandi, Nishant Vishwamitra, Peyman Najafirad | Published: 2024-08-05
プロンプトインジェクション
強化学習
敵対的サンプル

On the Robustness of Malware Detectors to Adversarial Samples

Authors: Muhammad Salman, Benjamin Zi Hao Zhao, Hassan Jameel Asghar, Muhammad Ikram, Sidharth Kaushik, Mohamed Ali Kaafar | Published: 2024-08-05
ウォーターマーキング
マルウェア分類
敵対的サンプル

A Geometric Framework for Adversarial Vulnerability in Machine Learning

Authors: Brian Bell | Published: 2024-07-03
ポイズニング
敵対的サンプル
文献リスト

The Effect of Similarity Measures on Accurate Stability Estimates for Local Surrogate Models in Text-based Explainable AI

Authors: Christopher Burger, Charles Walter, Thai Le | Published: 2024-06-22 | Updated: 2025-01-17
敵対的サンプル
評価手法
類似性測定

Nonlinear Transformations Against Unlearnable Datasets

Authors: Thushari Hapuarachchi, Jing Lin, Kaiqi Xiong, Mohamed Rahouti, Gitte Ost | Published: 2024-06-05
データ保護手法
モデル性能評価
敵対的サンプル

Updating Windows Malware Detectors: Balancing Robustness and Regression against Adversarial EXEmples

Authors: Matous Kozak, Luca Demetrio, Dmitrijs Trizna, Fabio Roli | Published: 2024-05-04
マルウェア分類
敵対的サンプル
敵対的訓練

Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks

Authors: Yunzhen Feng, Tim G. J. Rudner, Nikolaos Tsilivis, Julia Kempe | Published: 2024-04-27
不確実性の定量化
敵対的サンプル
透かし評価

Evaluations of Machine Learning Privacy Defenses are Misleading

Authors: Michael Aerni, Jie Zhang, Florian Tramèr | Published: 2024-04-26 | Updated: 2024-09-05
プライバシー保護手法
メンバーシップ推論
敵対的サンプル