AIセキュリティポータルbot

GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection

Authors: Paulo Ricardo Ferreira Neves, Edson Rodrigues da Cruz Filho, Paulo Henrique Eleuterio Falsetti, João Vitor Pavan, Ian Degaspari, Henrique Vieira Laturrague, Patrick Vieira Laturrague, Guilherme Nielsen Dias, Marccello Wilson Perez Berto, Gustavo Voltani Von Atzingen | Published: 2026-06-04
Data Collection Method
Prompt Injection
Model Extraction Attack

Agent libOS: A Library-OS-Inspired Runtime for Long-Running, Capability-Controlled LLM Agents

Authors: Yingqi Zhang | Published: 2026-06-02
アクセス制御モデル
System Development
Data Protection

AI Agents Enable Adaptive Computer Worms

Authors: Jonas Guan, Tom Blanchard, Hanna Foerster, Hengrui Jia, Gabriel Huang, Nicolas Papernot | Published: 2026-06-02
Indirect Prompt Injection
Penetration Testing Methods
Dataset for Malware Classification

Testing LLM Arithmetic Reasoning Generalization with Automatic Numeric-Remapping Attacks

Authors: Malia Barker, Bishal Lakha, Edoardo Serra, Francesco Gullo | Published: 2026-06-02
Prompt Injection
Prompt leaking
Robustness Evaluation

Learn from Your Mistakes: Tree-like Self-Play for Secure Code LLMs

Authors: Wenqi Chen, Ziyan Zhang, Bing Wang, Lin Liu, Hengheng Zhang, Zhengsu Chen | Published: 2026-06-02
Safety of Data Generation
Prompt leaking
Risk Assessment

NeuroArmor: Safe-Variant-Guided Representation Consistency for Selective Re-Anchoring in Jailbreak Defense

Authors: Zhongyang Lin, Ziran Zhao, Feifei Zhai, Pengyuan Liu | Published: 2026-06-02
Risk Assessment
Robustness Evaluation
Large Language Model

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

Authors: Farhan Sheth, Ziyuan Yang, Yongying Lan, Si Yong Yeo | Published: 2026-06-02
Privacy-Preserving Algorithm
Privacy-Preserving Machine Learning
Encryption Technology

Operationalizing Cyber Attack Prediction: A Gap-Prioritized Framework with Dataset and Model Selection Guidelines

Authors: Aminu Muhammad Auwal | Published: 2026-06-02
Dataset Integration
Adversarial Example Detection
Interpretability

FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences

Authors: Gurvan Richardeau, Gohar Dashyan, Erwan Le Merrer, Gilles Tredan | Published: 2026-06-02
Token Identification Method
Prompt Injection
Efficiency Evaluation

The Role of Domain-Specific Features in Malware Detection: A macOS Case Study

Authors: Biagio Montaruli, Andrea Oliveri, Savino Dambra, Davide Balzarotti | Published: 2026-06-02
API利用分析
Dataset evaluation
機械学習によるマルウェア分類