文献データベース

Safety Targeted Embedding Exploit via Refinement

Authors: Joshua Adrian Cahyono | Published: 2026-07-02
プロンプトインジェクション
プロンプトリーキング
脆弱性管理

Safety Testing LLM Agents at Scale: From Risk Discovery to Evidence-Grounded Verification

Authors: Yunhao Feng, Ruixiao Lin, Ming Wen, Qinqin He, Yanming Guo, Yifan Ding, Yutao Wu, Jialuo Chen, Yunhao Chen, Xiaohu Du, Jianan Ma, Zixing Chen, Zhuoer Xu, Xingjun Ma, Xinhao Deng | Published: 2026-07-02
エージェント操作手法
脅威モデル
脆弱性管理

DRL-CLBA: A Clean Label Backdoor Attack for Speech Classification via DDPG Reinforcement Learning

Authors: Yueming Huang, Wenhan Yao, Fen Xiao, Xiarun Chen, Weiping Wen | Published: 2026-07-02
バックドア攻撃
バックドア攻撃用の毒データの検知
モデル性能評価

Pmeta-TLA: Backdoor Attacks for Speech Classification Models via Meta-Learning with Timbre Leakage Attack

Authors: Yueming Huang, Wenhan Yao, Fen Xiao, Xiarun Chen, Weiping Wen | Published: 2026-07-02
バックドア攻撃
バックドア攻撃用の毒データの検知
メタ学習

Beyond Gradient-Based Attacks: Adversarial Robustness and Explainability Stability in Cybersecurity Classifiers

Authors: Mona Rajhans, Vishal Khawarey | Published: 2026-07-02
モデルの頑健性保証
敵対的摂動手法
脆弱性評価

Forensic-Oriented Intrusion Detection Using Synthetic Network Traffic Data and Explainable Artificial Intelligence

Authors: Jose Luis Vela Alonso, Carmen Pellicer | Published: 2026-07-01
XAIの応用
データセット分析
データ流分析

HARC: Coupling Harmfulness and Refusal Directions for Robust Safety Alignment

Authors: Shei Pern Chua, Fangzhao Wu | Published: 2026-07-01
アライメント
脆弱性評価
脆弱性評価手法

Cross-Domain Generalization Failure in Lightweight Intrusion Detection Models for IIoT Networks

Authors: MD Azizul Hakim, Md Shihab Uddin, Talha Ibne Anis | Published: 2026-07-01
クロスドメイン評価
解釈可能性
評価手法

Beyond the Prompt: Jailbreaking Function-Calling LLMs via Simulated Moderation Traces

Authors: Junlong Liu, Haobo Wang, Weiqi Luo, Xiaojun Jia | Published: 2026-07-01
マルチラウンド対話
大規模言語モデル
脱獄攻撃手法

Predicting Lethal Outcome (Cause) And Understanding Key Biomarkers Linked With Acute Myocardial Infarction Using Deep Artificial Neural Network And Ensemble Of Machine Learning Methodologies

Authors: Sagnik Ghosh | Published: 2026-07-01
データセット分析
バイオマーカー分析
心疾患予測