DefenSee: Dissecting Threat from Sight and Text – A Multi-View Defensive Pipeline for Multi-modal Jailbreaks Authors: Zihao Wang, Kar Wai Fok, Vrizlynn L. L. Thing | Published: 2025-12-01 Prompt InjectionModel DoSRobustness Improvement Method 2025.12.01 2025.12.03 Literature Database
On the Feasibility of Hijacking MLLMs’ Decision Chain via One Perturbation Authors: Changyue Li, Jiaying Li, Youliang Yuan, Jiaming He, Zhicong Huang, Pinjia He | Published: 2025-11-25 Robustness Improvement MethodImage ProcessingAdaptive Adversarial Training 2025.11.25 2025.11.27 Literature Database
Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security Authors: Wei Zhao, Zhe Li, Yige Li, Jun Sun | Published: 2025-11-20 Prompt leakingRobustness Improvement MethodDigital Watermarking for Generative AI 2025.11.20 2025.11.22 Literature Database
FlowPure: Continuous Normalizing Flows for Adversarial Purification Authors: Elias Collaert, Abel Rodríguez, Sander Joos, Lieven Desmet, Vera Rimmer | Published: 2025-05-19 Robustness Improvement MethodAdversarial LearningEffectiveness Analysis of Defense Methods 2025.05.19 2025.05.28 Literature Database
Addressing Neural Network Robustness with Mixup and Targeted Labeling Adversarial Training Authors: Alfred Laugros, Alice Caplier, Matthieu Ospici | Published: 2020-08-19 Robustness Improvement MethodAdversarial ExampleVulnerability of Adversarial Examples 2020.08.19 2025.05.28 Literature Database
Provably robust deep generative models Authors: Filipe Condessa, Zico Kolter | Published: 2020-04-22 Robustness Improvement MethodAdversarial attackDeep Learning Method 2020.04.22 2025.05.28 Literature Database
Certifying Joint Adversarial Robustness for Model Ensembles Authors: Mainuddin Ahmad Jonas, David Evans | Published: 2020-04-21 Model EnsembleRobustness Improvement MethodAdversarial Example 2020.04.21 2025.05.28 Literature Database
Luring of transferable adversarial perturbations in the black-box paradigm Authors: Rémi Bernhard, Pierre-Alain Moellic, Jean-Max Dutertre | Published: 2020-04-10 | Updated: 2021-03-03 Robustness Improvement MethodAttack EvaluationAdversarial Example 2020.04.10 2025.05.28 Literature Database
Adversarial Robustness for Code Authors: Pavol Bielik, Martin Vechev | Published: 2020-02-11 | Updated: 2020-08-15 PoisoningRobustness Improvement MethodAdversarial Training 2020.02.11 2025.05.28 Literature Database
Robustness of Bayesian Neural Networks to Gradient-Based Attacks Authors: Ginevra Carbone, Matthew Wicker, Luca Laurenti, Andrea Patane, Luca Bortolussi, Guido Sanguinetti | Published: 2020-02-11 | Updated: 2020-06-24 Robustness EvaluationRobustness Improvement MethodAdversarial attack 2020.02.11 2025.05.28 Literature Database