AIによる出力の識別

Adaptive and Robust Cost-Aware Proof of Quality for Decentralized LLM Inference Networks

Authors: Arther Tian, Alex Ding, Frank Chen, Simon Wu, Aaron Chan | Published: 2026-01-29

AIによる出力の識別

インセンティブメカニズム

敵対的学習

2026.01.29

文献データベース

Machine Unlearning Meets Adversarial Robustness via Constrained Interventions on LLMs

Authors: Fatmazohra Rezkellah, Ramzi Dakhmouche | Published: 2025-10-03 | Updated: 2025-10-15

AIによる出力の識別

ロバスト性

大規模言語モデル

2025.10.03

文献データベース

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Authors: Will Cai, Tianneng Shi, Xuandong Zhao, Dawn Song | Published: 2025-04-07

AIによる出力の識別

APIセキュリティ

モデル性能評価

2025.04.07

文献データベース

Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct

Authors: Christopher Ackerman, Nina Panickssery | Published: 2024-10-02 | Updated: 2025-01-25

AIによる出力の識別

プロンプティング戦略

自己認識モデル

2024.10.02 2025.04.03

文献データベース

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

Authors: Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D. Manning, Chelsea Finn | Published: 2023-01-26 | Updated: 2023-07-23

AIによる出力の識別

テキストの摂動手法

深層学習手法

2023.01.26 2025.04.03

文献データベース

Automatic Detection of Generated Text is Easiest when Humans are Fooled

Authors: Daphne Ippolito, Daniel Duckworth, Chris Callison-Burch, Douglas Eck | Published: 2019-11-02 | Updated: 2020-05-07

AIによる出力の識別

テキストの摂動手法

深層学習手法

2019.11.02 2025.04.03

文献データベース

Real or Fake? Learning to Discriminate Machine from Human Generated Text

Authors: Anton Bakhtin, Sam Gross, Myle Ott, Yuntian Deng, Marc'Aurelio Ranzato, Arthur Szlam | Published: 2019-06-07 | Updated: 2019-11-25

AIによる出力の識別

エネルギーベースモデル

深層学習手法

2019.06.07 2025.04.03

文献データベース

Defending Against Neural Fake News

Authors: Rowan Zellers, Ari Holtzman, Hannah Rashkin, Yonatan Bisk, Ali Farhadi, Franziska Roesner, Yejin Choi | Published: 2019-05-29 | Updated: 2020-12-11

AIによる出力の識別

サイバー脅威

深層学習手法

2019.05.29 2025.04.03

文献データベース

An Adversarial Approach for Explainable AI in Intrusion Detection Systems

Authors: Daniel L. Marino, Chathurika S. Wickramasinghe, Milos Manic | Published: 2018-11-28

AIによる出力の識別

モデル性能評価

敵対的サンプル

2018.11.28 2025.04.03

文献データベース