文献データベース

A Comprehensive Survey of Attack Techniques, Implementation, and Mitigation Strategies in Large Language Models

Authors: Aysan Esmradi, Daniel Wankit Yip, Chun Fai Chan | Published: 2023-12-18
サイバー攻撃
プロンプトインジェクション
攻撃手法

JailGuard: A Universal Detection Framework for LLM Prompt-based Attacks

Authors: Xiaoyu Zhang, Cen Zhang, Tianlin Li, Yihao Huang, Xiaojun Jia, Ming Hu, Jie Zhang, Yang Liu, Shiqing Ma, Chao Shen | Published: 2023-12-17 | Updated: 2025-03-15
テキストの摂動手法
プロンプトインジェクション
攻撃手法

Android Malware Detection with Unbiased Confidence Guarantees

Authors: Harris Papadopoulos, Nestoras Georgiou, Charalambos Eliades, Andreas Konstantinidis | Published: 2023-12-17
アルゴリズム
ウォーターマーキング
クラス不均衡

SAME: Sample Reconstruction against Model Extraction Attacks

Authors: Yi Xie, Jie Zhang, Shiqian Zhao, Tianwei Zhang, Xiaofeng Chen | Published: 2023-12-17 | Updated: 2024-01-08
ウォーターマーキング
モデル性能評価
モデル抽出攻撃

Rethinking Robustness of Model Attributions

Authors: Sandesh Kamath, Sankalp Mittal, Amit Deshpande, Vineeth N Balasubramanian | Published: 2023-12-16
ロバスト性評価
透かしの耐久性
透かし評価

Towards Reliable Participation in UAV-Enabled Federated Edge Learning on Non-IID Data

Authors: Youssra Cheriguene, Wael Jaafar, Halim Yanikomeroglu, Chaker Abdelaziz Kerrache | Published: 2023-12-16
参加者選択手法
攻撃手法
連合学習

Silent Guardian: Protecting Text from Malicious Exploitation by Large Language Models

Authors: Jiawei Zhao, Kejiang Chen, Xiaojian Yuan, Yuang Qi, Weiming Zhang, Nenghai Yu | Published: 2023-12-15 | Updated: 2024-10-10
プライバシー保護手法
プロンプトインジェクション
透かし評価

What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection

Authors: Xiaohui Zhang, Jiangyan Yi, Chenglong Wang, Chuyuan Zhang, Siding Zeng, Jianhua Tao | Published: 2023-12-15
ウォーターマーキング
深層偽音声評価
音声合成技術

Unsupervised and Supervised learning by Dense Associative Memory under replica symmetry breaking

Authors: Linda Albanese, Andrea Alessandrelli, Alessia Annibale, Adriano Barra | Published: 2023-12-15
収束特性
透かしの耐久性
透かし評価

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models

Authors: Xin Jin, Jonathan Larson, Weiwei Yang, Zhiqiang Lin | Published: 2023-12-15
LLM性能評価
プログラム解析
プロンプトインジェクション