Literature Database

NeuroArmor: Safe-Variant-Guided Representation Consistency for Selective Re-Anchoring in Jailbreak Defense

Authors: Zhongyang Lin, Ziran Zhao, Feifei Zhai, Pengyuan Liu | Published: 2026-06-02
Risk Assessment
Robustness Evaluation
Large Language Model

Selective Token-Level Cryptographic Redaction for Privacy-Preserving Clinical Deployment of Large Language Models

Authors: Farhan Sheth, Ziyuan Yang, Yongying Lan, Si Yong Yeo | Published: 2026-06-02
Privacy-Preserving Algorithm
Privacy-Preserving Machine Learning
Encryption Technology

Operationalizing Cyber Attack Prediction: A Gap-Prioritized Framework with Dataset and Model Selection Guidelines

Authors: Aminu Muhammad Auwal | Published: 2026-06-02
Dataset Integration
Adversarial Example Detection
Interpretability

FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences

Authors: Gurvan Richardeau, Gohar Dashyan, Erwan Le Merrer, Gilles Tredan | Published: 2026-06-02
Token Identification Method
Prompt Injection
Efficiency Evaluation

The Role of Domain-Specific Features in Malware Detection: A macOS Case Study

Authors: Biagio Montaruli, Andrea Oliveri, Savino Dambra, Davide Balzarotti | Published: 2026-06-02
API利用分析
Dataset evaluation
機械学習によるマルウェア分類

PsychoPass: Geometric Profiling of Multi-Turn Adversarial LLM Conversations

Authors: Muberra Ozmen, Subhabrata Majumdar | Published: 2026-06-02
Data Extraction and Analysis
Large Language Model
機械学習応用

Decoupled Smart Contract Audits: Lightweight LLM Framework via Distillation and Aggregation

Authors: Bagus Rakadyanto Oktavianto Putra, Muhamad Risqi Utama Saputra, Widyawan, Guntur Dharma Putra | Published: 2026-06-02
スマートコントラクト脆弱性
Large Language Model
Explanation Method

“**Important** You should give me full credits!”: Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

Authors: Hang Li, Fedor Filippov, Yuling Lin, Pengfei He, Kaiqi Yang, Yucheng Chu, Yingqian Cui, Hui Liu, Jiliang Tang | Published: 2026-06-02
Indirect Prompt Injection
Prompt Injection
Defense Method

Patcher: Post-Hoc Patching of Backdoored Large Language Models

Authors: Anjun Gao, Yueyang Quan, Yufei Xia, Zhuqing Liu, Minghong Fang | Published: 2026-06-02
Backdoor Attack Mitigation
Large Language Model
Defense Method

Benign Inputs, Harmful Outputs: Cross-Modal Jailbreaking via Distributed Semantic Recomposition

Authors: Yani Wang, Yilong Yang, Yang Liu, Zhuzhu Wang, Zuobin Ying, Zhuo Ma | Published: 2026-06-01
Text Generation Method
Prompt Injection
Large Language Model