Ollabench: Evaluating LLMs’ Reasoning for Human-centric Interdependent Cybersecurity Authors: Tam n. Nguyen | Published: 2024-06-11 2024.06.11 2025.04.03 文献データベース
A Survey of Recent Backdoor Attacks and Defenses in Large Language Models Authors: Shuai Zhao, Meihuizi Jia, Zhongliang Guo, Leilei Gan, Xiaoyu Xu, Xiaobao Wu, Jie Fu, Yichao Feng, Fengjun Pan, Luu Anh Tuan | Published: 2024-06-10 | Updated: 2025-01-04 2024.06.10 2025.04.03 文献データベース
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection Authors: Shenao Yan, Shen Wang, Yue Duan, Hanbin Hong, Kiho Lee, Doowon Kim, Yuan Hong | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース
Robust Distribution Learning with Local and Global Adversarial Corruptions Authors: Sloan Nietert, Ziv Goldfeld, Soroosh Shafiee | Published: 2024-06-10 | Updated: 2024-06-24 2024.06.10 2025.04.03 文献データベース
LLM Dataset Inference: Did you train on my dataset? Authors: Pratyush Maini, Hengrui Jia, Nicolas Papernot, Adam Dziedzic | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース
SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection Authors: Sakshi Mahendru, Tejul Pandit | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース
Siren — Advancing Cybersecurity through Deception and Adaptive Analysis Authors: Girish Kulathumani, Samruth Ananthanarayanan, Ganesh Narayanan | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース
Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning Authors: Xiaoting Lyu, Yufei Han, Wei Wang, Jingkai Liu, Yongsheng Zhu, Guangquan Xu, Jiqiang Liu, Xiangliang Zhang | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース
A Survey on Machine Unlearning: Techniques and New Emerged Privacy Risks Authors: Hengzhu Liu, Ping Xiong, Tianqing Zhu, Philip S. Yu | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース
Safety Alignment Should Be Made More Than Just a Few Tokens Deep Authors: Xiangyu Qi, Ashwinee Panda, Kaifeng Lyu, Xiao Ma, Subhrajit Roy, Ahmad Beirami, Prateek Mittal, Peter Henderson | Published: 2024-06-10 2024.06.10 2025.04.03 文献データベース