Detecting Training Data of Large Language Models via Expectation Maximization Authors: Gyuwan Kim, Yang Li, Evangelia Spiliopoulou, Jie Ma, Miguel Ballesteros, William Yang Wang | Published: 2024-10-10 2024.10.10 2025.04.03 文献データベース
RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? Authors: Di Cao, Yong Liao, Xiuwei Shang | Published: 2024-10-10 2024.10.10 2025.04.03 文献データベース
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy Authors: Tong Wu, Shujian Zhang, Kaiqiang Song, Silei Xu, Sanqiang Zhao, Ravi Agrawal, Sathish Reddy Indurthi, Chong Xiang, Prateek Mittal, Wenxuan Zhou | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
Data Taggants: Dataset Ownership Verification via Harmless Targeted Data Poisoning Authors: Wassim Bouaziz, El-Mahdi El-Mhamdi, Nicolas Usunier | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
Diffuse or Confuse: A Diffusion Deepfake Speech Dataset Authors: Anton Firc, Kamil Malinka, Petr Hanáček | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
Prompt Infection: LLM-to-LLM Prompt Injection within Multi-Agent Systems Authors: Donghyun Lee, Mo Tiwari | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
FreqMark: Frequency-Based Watermark for Sentence-Level Detection of LLM-Generated Text Authors: Zhenyu Xu, Kun Zhang, Victor S. Sheng | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
Signal Watermark on Large Language Models Authors: Zhenyu Xu, Victor S. Sheng | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
Hallucinating AI Hijacking Attack: Large Language Models and Malicious Code Recommenders Authors: David Noever, Forrest McKee | Published: 2024-10-09 2024.10.09 2025.04.03 文献データベース
Near Exact Privacy Amplification for Matrix Mechanisms Authors: Christopher A. Choquette-Choo, Arun Ganesh, Saminul Haque, Thomas Steinke, Abhradeep Thakurta | Published: 2024-10-08 | Updated: 2025-03-20 2024.10.08 2025.04.03 文献データベース