Are AI-Generated Fixes Secure? Analyzing LLM and Agent Patches on SWE-bench Authors: Amirali Sajadi, Kostadin Damevski, Preetha Chatterjee | Published: 2025-06-30 | Updated: 2025-07-24 Software SecurityPrompt InjectionLarge Language Model 2025.06.30 2025.07.26 Literature Database
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models Authors: Anthony M. Barrett, Jessica Newman, Brandie Nonnecke, Nada Madkour, Dan Hendrycks, Evan R. Murphy, Krystal Jackson, Deepika Raman | Published: 2025-06-30 Model InversionRisk Assessment MethodEducation and Follow-up 2025.06.30 2025.07.02 Literature Database
PBa-LLM: Privacy- and Bias-aware NLP using Named-Entity Recognition (NER) Authors: Gonzalo Mancera, Aythami Morales, Julian Fierrez, Ruben Tolosana, Alejandro Penna, Miguel Lopez-Duran, Francisco Jurado, Alvaro Ortigosa | Published: 2025-06-30 | Updated: 2025-07-09 BiasPerformance EvaluationPrivacy Risk Management 2025.06.30 2025.07.11 Literature Database
RawMal-TF: Raw Malware Dataset Labeled by Type and Family Authors: David Bálik, Martin Jureček, Mark Stamp | Published: 2025-06-30 Backdoor DetectionPrompt InjectionDataset for Malware Classification 2025.06.30 2025.07.02 Literature Database
Breaking Out from the TESSERACT: Reassessing ML-based Malware Detection under Spatio-Temporal Drift Authors: Theo Chow, Mario D'Onghia, Lorenz Linhardt, Zeliang Kan, Daniel Arp, Lorenzo Cavallaro, Fabio Pierazzi | Published: 2025-06-30 BiasDataset for Malware Classification評価メトリクス 2025.06.30 2025.07.02 Literature Database
SoK: Semantic Privacy in Large Language Models Authors: Baihe Ma, Yanna Jiang, Xu Wang, Guangshen Yu, Qin Wang, Caijun Sun, Chen Li, Xuelei Qi, Ying He, Wei Ni, Ren Ping Liu | Published: 2025-06-30 Semantic Information ExtractionPrivacy ProtectionLarge Language Model 2025.06.30 2025.07.02 Literature Database
A Practical and Secure Byzantine Robust Aggregator Authors: De Zhang Lee, Aashish Kolluri, Prateek Saxena, Ee-Chien Chang | Published: 2025-06-29 | Updated: 2025-07-02 Poisoning attack on RAGPoisoning AttackRobust Classification 2025.06.29 2025.07.04 Literature Database
MetaCipher: A Time-Persistent and Universal Multi-Agent Framework for Cipher-Based Jailbreak Attacks for LLMs Authors: Boyuan Chen, Minghao Shao, Abdul Basit, Siddharth Garg, Muhammad Shafique | Published: 2025-06-27 | Updated: 2025-08-13 FrameworkLarge Language Model脱獄攻撃手法 2025.06.27 2025.08.15 Literature Database
ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models Authors: Mina Namazi, Alexander Nemecek, Erman Ayday | Published: 2025-06-26 Privacy ProtectionLarge Language ModelWatermarking Technology 2025.06.26 2025.06.28 Literature Database
Counterfactual Influence as a Distributional Quantity Authors: Matthieu Meeus, Igor Shilov, Georgios Kaissis, Yves-Alexandre de Montjoye | Published: 2025-06-25 Privacy ProtectionPerformance Evaluation Metrics評価メトリクス 2025.06.25 2025.06.27 Literature Database