Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets Authors: Yechao Zhang, Yuxuan Zhou, Tianyu Li, Minghui Li, Shengshan Hu, Wei Luo, Leo Yu Zhang | Published: 2025-04-16 バックドアモデルの検知学習の改善防御手法の効果分析 2025.04.16 文献データベース
STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models Authors: Xunguang Wang, Wenxuan Wang, Zhenlan Ji, Zongjie Li, Pingchuan Ma, Daoyuan Wu, Shuai Wang | Published: 2025-03-23 プロンプトインジェクション悪意のあるプロンプト防御手法の効果分析 2025.03.23 2025.04.03 文献データベース
Bias Busters: Robustifying DL-based Lithographic Hotspot Detectors Against Backdooring Attacks Authors: Kang Liu, Benjamin Tan, Gaurav Rajavendra Reddy, Siddharth Garg, Yiorgos Makris, Ramesh Karri | Published: 2020-04-26 ポイズニング深層学習技術防御手法の効果分析 2020.04.26 2025.04.03 文献データベース
Minimax Defense against Gradient-based Adversarial Attacks Authors: Blerta Lindqvist, Rauf Izmailov | Published: 2020-02-04 敵対的摂動手法敵対的移転性防御手法の効果分析 2020.02.04 2025.04.03 文献データベース
Defending Adversarial Attacks via Semantic Feature Manipulation Authors: Shuo Wang, Tianle Chen, Surya Nepal, Carsten Rudolph, Marthie Grobler, Shangyu Chen | Published: 2020-02-03 | Updated: 2020-04-22 ロバスト性向上手法敵対的サンプル防御手法の効果分析 2020.02.03 2025.04.03 文献データベース
Ensemble Noise Simulation to Handle Uncertainty about Gradient-based Adversarial Attacks Authors: Rehana Mahfuz, Rajeev Sahay, Aly El Gamal | Published: 2020-01-26 敵対的学習敵対的攻撃検出防御手法の効果分析 2020.01.26 2025.04.03 文献データベース
ATHENA: A Framework based on Diverse Weak Defenses for Building Adversarial Defense Authors: Ying Meng, Jianhai Su, Jason O'Kane, Pooyan Jamshidi | Published: 2020-01-02 | Updated: 2020-10-16 敵対的学習透かし評価防御手法の効果分析 2020.01.02 2025.04.03 文献データベース
Benchmarking Adversarial Robustness Authors: Yinpeng Dong, Qi-An Fu, Xiao Yang, Tianyu Pang, Hang Su, Zihao Xiao, Jun Zhu | Published: 2019-12-26 ポイズニング敵対的サンプル防御手法の効果分析 2019.12.26 2025.04.03 文献データベース
Explainability and Adversarial Robustness for RNNs Authors: Alexander Hartl, Maximilian Bachl, Joachim Fabini, Tanja Zseby | Published: 2019-12-20 | Updated: 2020-02-19 攻撃の分類敵対的学習防御手法の効果分析 2019.12.20 2025.04.03 文献データベース
A Survey of Black-Box Adversarial Attacks on Computer Vision Models Authors: Siddhant Bhambri, Sumanyu Muku, Avinash Tulasi, Arun Balaji Buduru | Published: 2019-12-03 | Updated: 2020-02-07 ポイズニング敵対的サンプルの脆弱性防御手法の効果分析 2019.12.03 2025.04.03 文献データベース