SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism Authors: Beitao Chen, Xinyu Lyu, Lianli Gao, Jingkuan Song, Heng Tao Shen | Published: 2025-07-02 2025.07.02 文献データベース
ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks Authors: Zhiyao Ren, Siyuan Liang, Aishan Liu, Dacheng Tao | Published: 2025-07-02 2025.07.02 文献データベース
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models Authors: Anthony M. Barrett, Jessica Newman, Brandie Nonnecke, Nada Madkour, Dan Hendrycks, Evan R. Murphy, Krystal Jackson, Deepika Raman | Published: 2025-06-30 2025.06.30 文献データベース
RawMal-TF: Raw Malware Dataset Labeled by Type and Family Authors: David Bálik, Martin Jureček, Mark Stamp | Published: 2025-06-30 2025.06.30 文献データベース
Breaking Out from the TESSERACT: Reassessing ML-based Malware Detection under Spatio-Temporal Drift Authors: Theo Chow, Mario D'Onghia, Lorenz Linhardt, Zeliang Kan, Daniel Arp, Lorenzo Cavallaro, Fabio Pierazzi | Published: 2025-06-30 2025.06.30 文献データベース
SoK: Semantic Privacy in Large Language Models Authors: Baihe Ma, Yanna Jiang, Xu Wang, Guangshen Yu, Qin Wang, Caijun Sun, Chen Li, Xuelei Qi, Ying He, Wei Ni, Ren Ping Liu | Published: 2025-06-30 2025.06.30 文献データベース
A Practical and Secure Byzantine Robust Aggregator Authors: De Zhang Lee, Aashish Kolluri, Prateek Saxena, Ee-Chien Chang | Published: 2025-06-29 | Updated: 2025-07-02 2025.06.29 文献データベース
SPA: Towards More Stealth and Persistent Backdoor Attacks in Federated Learning Authors: Chengcheng Zhu, Ye Li, Bosen Rao, Jiale Zhang, Yunlong Mao, Sheng Zhong | Published: 2025-06-26 2025.06.26 文献データベース
ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models Authors: Mina Namazi, Alexander Nemecek, Erman Ayday | Published: 2025-06-26 2025.06.26 文献データベース
Counterfactual Influence as a Distributional Quantity Authors: Matthieu Meeus, Igor Shilov, Georgios Kaissis, Yves-Alexandre de Montjoye | Published: 2025-06-25 2025.06.25 文献データベース