On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds Authors: Matteo Vilucchio, Nikolaos Tsilivis, Bruno Loureiro, Julia Kempe | Published: 2024-10-21 2024.10.21 2025.05.27 Literature Database
Jailbreaking and Mitigation of Vulnerabilities in Large Language Models Authors: Benji Peng, Keyu Chen, Qian Niu, Ziqian Bi, Ming Liu, Pohsun Feng, Tianyang Wang, Lawrence K. Q. Yan, Yizhu Wen, Yichao Zhang, Caitlyn Heqi Yin | Published: 2024-10-20 | Updated: 2025-05-08 2024.10.20 2025.05.27 Literature Database
A Novel Reinforcement Learning Model for Post-Incident Malware Investigations Authors: Dipo Dunsin, Mohamed Chahine Ghanem, Karim Ouazzane, Vassil Vassilev | Published: 2024-10-19 | Updated: 2025-01-12 2024.10.19 2025.05.27 Literature Database
Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation Authors: Shuai Zhao, Xiaobao Wu, Cong-Duy Nguyen, Yanhao Jia, Meihuizi Jia, Yichao Feng, Luu Anh Tuan | Published: 2024-10-18 | Updated: 2025-05-20 2024.10.18 2025.05.28 Literature Database
Low-Rank Adversarial PGD Attack Authors: Dayana Savostianova, Emanuele Zangrando, Francesco Tudisco | Published: 2024-10-16 2024.10.16 2025.05.27 Literature Database
Deep Learning Based XIoT Malware Analysis: A Comprehensive Survey, Taxonomy, and Research Challenges Authors: Rami Darwish, Mahmoud Abdelsalam, Sajad Khorsandroo | Published: 2024-10-14 2024.10.14 2025.05.27 Literature Database
Denial-of-Service Poisoning Attacks against Large Language Models Authors: Kuofeng Gao, Tianyu Pang, Chao Du, Yong Yang, Shu-Tao Xia, Min Lin | Published: 2024-10-14 2024.10.14 2025.05.27 Literature Database
Adversarially Robust Out-of-Distribution Detection Using Lyapunov-Stabilized Embeddings Authors: Hossein Mirzaei, Mackenzie W. Mathis | Published: 2024-10-14 | Updated: 2025-01-26 2024.10.14 2025.05.27 Literature Database
Towards Calibrated Losses for Adversarial Robust Reject Option Classification Authors: Vrund Shah, Tejas Chaudhari, Naresh Manwani | Published: 2024-10-14 2024.10.14 2025.05.27 Literature Database
Regularized Robustly Reliable Learners and Instance Targeted Attacks Authors: Avrim Blum, Donya Saless | Published: 2024-10-14 | Updated: 2025-05-08 2024.10.14 2025.05.27 Literature Database