Blind PRNG Hijacking: An Undetectable Integrity-Preserving Attack Against LLM Watermarking Authors: Ziyang You, Huilong He, Xiaoke Yang, Xuxing Lu | Published: 2026-05-27 2026.05.27 文献データベース
Towards Cybersecurity SuperIntelligence (CSI): What’s the best harness for cybersecurity? Authors: Víctor Mayoral-Vilches, Francesco Balassone, María Sanz-Gómez, Paul Zabalegui Landa, Daniel Sánchez Prieto, Marina Oteiza Álvarez, Davide Quarta, Martin Pinzger | Published: 2026-05-27 2026.05.27 文献データベース
SPARD: Defending Harmful Fine-Tuning Attack via Safety Projection with Relevance-Diversity Data Selection Authors: Shuhao Chen, Weisen Jiang, Yeqi Gong, Shengda Luo, Chengxiang Zhuo, Zang Li, James T. Kwok, Yu Zhang | Published: 2026-05-27 2026.05.27 文献データベース
MRMMIA: Membership Inference Attacks on Memory in Chat Agents Authors: Kai Chen, Yan Pang, Tianhao Wang | Published: 2026-05-27 2026.05.27 文献データベース
Disentangling Adversarial Prompts: A Semantic-Graph Defense for Robust LLM Security Authors: Xiang Fang, Wanlong Fang | Published: 2026-05-27 2026.05.27 文献データベース
Revisiting ML Training under Fully Homomorphic Encryption: Convergence Guarantees, Differential Privacy, and Efficient Algorithms Authors: Yvonne Zhou, Mingyu Liang, Ivan Brugere, Danial Dervovic, Yue Guo, Antigoni Polychroniadou, Min Wu, Dana Dachman-Soled | Published: 2026-05-27 2026.05.27 文献データベース
Detectability in Diversity: Improved Canary Crafting for Privacy Auditing in One Run Authors: Mathieu Dagréou, Aurélien Bellet | Published: 2026-05-26 2026.05.26 文献データベース
Privacy-Preserving Screening for Record Linkage Authors: Chenyu Huang, Fan Zhang, Huangxun Chen, Yongjun Zhao, Huaming Rao, Peng Chen, Danqing Huang | Published: 2026-05-26 2026.05.26 文献データベース
Cordyceps: Covert Control Attacks on LLMs via Data Poisoning Authors: Zedian Shao, Charles Fleming, Teodora Baluta | Published: 2026-05-26 2026.05.26 文献データベース
GradSentry: Gradient Spectral Entropy for Backdoor Sample Filtering in Large Language Model Fine-Tuning Authors: Haodong Zhao, Tianyi Xu, Tianhang Zhao, Zhuosheng Zhang, Gongshen Liu | Published: 2026-05-26 2026.05.26 文献データベース