プロンプティング戦略

Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators

Authors: Xitao Li, Haijun Wang, Jiang Wu, Ting Liu | Published: 2025-04-08

インダイレクトプロンプトインジェクション

プロンプティング戦略

モデル性能評価

2025.04.08

文献データベース

Can LLM Prompting Serve as a Proxy for Static Analysis in Vulnerability Detection

Authors: Ira Ceka, Feitong Qiao, Anik Dey, Aastha Valecha, Gail Kaiser, Baishakhi Ray | Published: 2024-12-16 | Updated: 2025-01-18

LLM性能評価

プロンプティング戦略

プロンプトインジェクション

2024.12.16 2025.04.03

文献データベース

Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct

Authors: Christopher Ackerman, Nina Panickssery | Published: 2024-10-02 | Updated: 2025-01-25

AIによる出力の識別

プロンプティング戦略

自己認識モデル

2024.10.02 2025.04.03

文献データベース

Extracting Memorized Training Data via Decomposition

Authors: Ellen Su, Anu Vellore, Amy Chang, Raffaele Mura, Blaine Nelson, Paul Kassianik, Amin Karbasi | Published: 2024-09-18 | Updated: 2024-10-01

トレーニングデータ抽出手法

プロンプティング戦略

モデル性能評価

2024.09.18 2025.04.03

文献データベース

ADAPT to Robustify Prompt Tuning Vision Transformers

Authors: Masih Eskandar, Tooba Imtiaz, Zifeng Wang, Jennifer Dy | Published: 2024-03-19 | Updated: 2025-02-07

プロンプティング戦略

プロンプトエンジニアリング

敵対的訓練

2024.03.19 2025.04.03

文献データベース

Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models

Authors: Jiang Zhang, Qiong Wu, Yiming Xu, Cheng Cao, Zheng Du, Konstantinos Psounis | Published: 2023-12-13

プロンプティング戦略

出力の有害度の算出

大規模言語モデル

2023.12.13 2025.04.03

文献データベース

Harnessing the Power of LLM to Support Binary Taint Analysis

Authors: Puzhuo Liu, Chengnian Sun, Yaowen Zheng, Xuan Feng, Chuan Qin, Yuncheng Wang, Zhenyang Xu, Zhi Li, Peng Di, Yu Jiang, Limin Sun | Published: 2023-10-12 | Updated: 2025-01-09

セキュリティ分析

プロンプティング戦略

動的分析

2023.10.12 2025.04.03

文献データベース

ProPILE: Probing Privacy Leakage in Large Language Models

Authors: Siwon Kim, Sangdoo Yun, Hwaran Lee, Martin Gubri, Sungroh Yoon, Seong Joon Oh | Published: 2023-07-04

データ漏洩

プライバシー侵害

プロンプティング戦略

2023.07.04 2025.04.03

文献データベース

ADEPT: A DEbiasing PrompT Framework

Authors: Ke Yang, Charles Yu, Yi Fung, Manling Li, Heng Ji | Published: 2022-11-10 | Updated: 2022-12-23

AIによる出力のバイアスの検出

プロンプティング戦略

公平性のあるAIモデルの作成

2022.11.10 2025.04.03

文献データベース

Toxicity Detection with Generative Prompt-based Inference

Authors: Yau-Shian Wang, Yingshan Chang | Published: 2022-05-24

プロンプティング戦略

出力の有害度の算出

大規模言語モデル

2022.05.24 2025.04.03

文献データベース