文献データベース

PAL: Proxy-Guided Black-Box Attack on Large Language Models

Authors: Chawin Sitawarin, Norman Mu, David Wagner, Alexandre Araujo | Published: 2024-02-15

LLMセキュリティ

プロンプトインジェクション

攻撃手法

2024.02.15 2025.04.03

文献データベース

Why Does Differential Privacy with Large Epsilon Defend Against Practical Membership Inference Attacks?

Authors: Andrew Lowy, Zhuohang Li, Jing Liu, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang | Published: 2024-02-14

プライバシー保護

プライバシー保護手法

メンバーシップ推論

2024.02.14 2025.04.03

文献データベース

Auditing Private Prediction

Authors: Karan Chadha, Matthew Jagielski, Nicolas Papernot, Christopher Choquette-Choo, Milad Nasr | Published: 2024-02-14

データプライバシー評価

プライバシー保護手法

メンバーシップ推論

2024.02.14 2025.04.03

文献データベース

Copyright Traps for Large Language Models

Authors: Matthieu Meeus, Igor Shilov, Manuel Faysse, Yves-Alexandre de Montjoye | Published: 2024-02-14 | Updated: 2024-06-04

トラップシーケンス生成

プロンプトインジェクション

著作権トラップ

2024.02.14 2025.04.03

文献データベース

FedSiKD: Clients Similarity and Knowledge Distillation: Addressing Non-i.i.d. and Constraints in Federated Learning

Authors: Yousef Alsenani, Rahul Mishra, Khaled R. Ahmed, Atta Ur Rahman | Published: 2024-02-14

クライアントクラスタリング

クラスタリング手法

連合学習

2024.02.14 2025.04.03

文献データベース

I can’t see it but I can Fine-tune it: On Encrypted Fine-tuning of Transformers using Fully Homomorphic Encryption

Authors: Prajwal Panzade, Daniel Takabi, Zhipeng Cai | Published: 2024-02-14

ウォーターマーキング

プライバシー保護

プライバシー保護手法

2024.02.14 2025.04.03

文献データベース

Detecting Adversarial Spectrum Attacks via Distance to Decision Boundary Statistics

Authors: Wenwei Zhao, Xiaowen Li, Shangqing Zhao, Jie Xu, Yao Liu, Zhuo Lu | Published: 2024-02-14

敵対的サンプル

敵対的スペクトル攻撃検出

敵対的攻撃検出

2024.02.14 2025.04.03

文献データベース

Test-Time Backdoor Attacks on Multimodal Large Language Models

Authors: Dong Lu, Tianyu Pang, Chao Du, Qian Liu, Xianjun Yang, Min Lin | Published: 2024-02-13

バックドア攻撃

モデル性能評価

攻撃手法

2024.02.13 2025.04.03

文献データベース

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Authors: Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, Jing Jiang, Min Lin | Published: 2024-02-13 | Updated: 2024-06-03

LLMセキュリティ

プロンプトインジェクション

敵対的攻撃検出

2024.02.13 2025.04.03

文献データベース

ROSpace: Intrusion Detection Dataset for a ROS2-Based Cyber-Physical System

Authors: Tommaso Puccetti, Simone Nardi, Cosimo Cinquilli, Tommaso Zoppi, Andrea Ceccarelli | Published: 2024-02-13

サイバーセキュリティ

データ収集

侵入検知システム

2024.02.13 2025.04.03

文献データベース