Literature Database

bi-GRPO: Bidirectional Optimization for Jailbreak Backdoor Injection on LLMs

Authors: Wence Ji, Jiancan Wu, Aiying Li, Shuyi Zhang, Junkang Wu, An Zhang, Xiang Wang, Xiangnan He | Published: 2025-09-24
Disabling Safety Mechanisms of LLM
Prompt Injection
Generative Model

Unmasking Fake Careers: Detecting Machine-Generated Career Trajectories via Multi-layer Heterogeneous Graphs

Authors: Michiharu Yamashita, Thanh Tran, Delvin Ce Zhang, Dongwon Lee | Published: 2025-09-24
キャリアデータ生成
構造的パターン検出
Generative Model Characteristics

Defending against Stegomalware in Deep Neural Networks with Permutation Symmetry

Authors: Birk Torpmann-Hagen, Michael A. Riegler, Pål Halvorsen, Dag Johansen | Published: 2025-09-23 | Updated: 2025-10-15
Security Analysis Method
Certified Robustness
Information Hiding Techniques

LLMs as verification oracles for Solidity

Authors: Massimo Bartoletti, Enrico Lipparini, Livio Pompianu | Published: 2025-09-23
Prompt Injection
Model DoS
Vulnerability Assessment Method

LLM-based Vulnerability Discovery through the Lens of Code Metrics

Authors: Felix Weissberg, Lukas Pirch, Erik Imgrund, Jonas Möller, Thorsten Eisenhofer, Konrad Rieck | Published: 2025-09-23
コードメトリクス評価
Prompt Injection
Large Language Model

Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs

Authors: Marcin Chrapek, Marcin Copik, Etienne Mettaz, Torsten Hoefler | Published: 2025-09-23
Cost Efficiency
Privacy-Preserving Machine Learning
Model Extraction Attack

Detection of security smells in IaC scripts through semantics-aware code and language processing

Authors: Aicha War, Adnan A. Rawass, Abdoul K. Kabore, Jordan Samhi, Jacques Klein, Tegawende F. Bissyande | Published: 2025-09-23
コード表現技術
Security Analysis
Prompt validation

MER-Inspector: Assessing model extraction risks from an attack-agnostic perspective

Authors: Xinwei Zhang, Haibo Hu, Qingqing Ye, Li Bai, Huadi Zheng | Published: 2025-09-23
Model Extraction Attack
Model Design
Risk Assessment

Coherence-driven inference for cybersecurity

Authors: Steve Huntsman | Published: 2025-09-23
Algorithm
Security Analysis
Information Gathering Methods

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Authors: Alexander Panfilov, Evgenii Kortukov, Kristina Nikolić, Matthias Bethge, Sebastian Lapuschkin, Wojciech Samek, Ameya Prabhu, Maksym Andriushchenko, Jonas Geiping | Published: 2025-09-22
Hallucination
武器設計手法
Fraud Techniques