AIセキュリティポータルbot

Are AI-Generated Fixes Secure? Analyzing LLM and Agent Patches on SWE-bench

Authors: Amirali Sajadi, Kostadin Damevski, Preetha Chatterjee | Published: 2025-06-30 | Updated: 2025-07-24
Software Security
Prompt Injection
Large Language Model

AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models

Authors: Anthony M. Barrett, Jessica Newman, Brandie Nonnecke, Nada Madkour, Dan Hendrycks, Evan R. Murphy, Krystal Jackson, Deepika Raman | Published: 2025-06-30
Model Inversion
Risk Assessment Method
Education and Follow-up

PBa-LLM: Privacy- and Bias-aware NLP using Named-Entity Recognition (NER)

Authors: Gonzalo Mancera, Aythami Morales, Julian Fierrez, Ruben Tolosana, Alejandro Penna, Miguel Lopez-Duran, Francisco Jurado, Alvaro Ortigosa | Published: 2025-06-30 | Updated: 2025-07-09
Bias
Performance Evaluation
Privacy Risk Management

RawMal-TF: Raw Malware Dataset Labeled by Type and Family

Authors: David Bálik, Martin Jureček, Mark Stamp | Published: 2025-06-30
Backdoor Detection
Prompt Injection
Dataset for Malware Classification

Breaking Out from the TESSERACT: Reassessing ML-based Malware Detection under Spatio-Temporal Drift

Authors: Theo Chow, Mario D'Onghia, Lorenz Linhardt, Zeliang Kan, Daniel Arp, Lorenzo Cavallaro, Fabio Pierazzi | Published: 2025-06-30
Bias
Dataset for Malware Classification
評価メトリクス

SoK: Semantic Privacy in Large Language Models

Authors: Baihe Ma, Yanna Jiang, Xu Wang, Guangshen Yu, Qin Wang, Caijun Sun, Chen Li, Xuelei Qi, Ying He, Wei Ni, Ren Ping Liu | Published: 2025-06-30
Semantic Information Extraction
Privacy Protection
Large Language Model

A Practical and Secure Byzantine Robust Aggregator

Authors: De Zhang Lee, Aashish Kolluri, Prateek Saxena, Ee-Chien Chang | Published: 2025-06-29 | Updated: 2025-07-02
Poisoning attack on RAG
Poisoning Attack
Robust Classification

MetaCipher: A Time-Persistent and Universal Multi-Agent Framework for Cipher-Based Jailbreak Attacks for LLMs

Authors: Boyuan Chen, Minghao Shao, Abdul Basit, Siddharth Garg, Muhammad Shafique | Published: 2025-06-27 | Updated: 2025-08-13
Framework
Large Language Model
脱獄攻撃手法

ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models

Authors: Mina Namazi, Alexander Nemecek, Erman Ayday | Published: 2025-06-26
Privacy Protection
Large Language Model
Watermarking Technology

Counterfactual Influence as a Distributional Quantity

Authors: Matthieu Meeus, Igor Shilov, Georgios Kaissis, Yves-Alexandre de Montjoye | Published: 2025-06-25
Privacy Protection
Performance Evaluation Metrics
評価メトリクス