文献データベース

RESIST: Resilient Decentralized Learning Using Consensus Gradient Descent

Authors: Cheng Fang, Rishabh Dixit, Waheed U. Bajwa, Mert Gurbuzbalaban | Published: 2025-02-11

MITM攻撃

収束分析

2025.02.11 2025.04.03

文献データベース

Trustworthy AI: Safety, Bias, and Privacy — A Survey

Authors: Xingli Fang, Jianwei Li, Varun Mulchandani, Jung-Eun Kim | Published: 2025-02-11 | Updated: 2025-06-11

バイアス

プロンプトリーキング

差分プライバシー

2025.02.11

文献データベース

Scalable and Ethical Insider Threat Detection through Data Synthesis and Analysis by LLMs

Authors: Haywood Gelman, John D. Hastings | Published: 2025-02-10 | Updated: 2025-04-07

LLMの応用

リスク分析手法

情報セキュリティ

2025.02.10

文献データベース

Membership Inference Risks in Quantized Models: A Theoretical and Empirical Study

Authors: Eric Aubinais, Philippe Formont, Pablo Piantanida, Elisabeth Gassiat | Published: 2025-02-10

メンバーシップ推論

量子化とプライバシー

2025.02.10 2025.04.03

文献データベース

Generating Privacy-Preserving Personalized Advice with Zero-Knowledge Proofs and LLMs

Authors: Hiroki Watanabe, Motonobu Uchikoshi | Published: 2025-02-10 | Updated: 2025-04-24

アライメント

プライバシー保護データマイニング

透かし

2025.02.10

文献データベース

From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks

Authors: Awa Khouna, Julien Ferry, Thibaut Vidal | Published: 2025-02-07 | Updated: 2025-07-08

モデル抽出攻撃

モデル抽出攻撃の検知

再構成アルゴリズム

2025.02.07

文献データベース

Training Set Reconstruction from Differentially Private Forests: How Effective is DP?

Authors: Alice Gorgé, Julien Ferry, Sébastien Gambs, Thibaut Vidal | Published: 2025-02-07 | Updated: 2025-07-08

プライバシーリスク管理

再構成アルゴリズム

差分プライバシー

2025.02.07

文献データベース

Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks

Authors: Andreas Happe, Jürgen Cito | Published: 2025-02-06 | Updated: 2025-09-11

インダイレクトプロンプトインジェクション

プロンプトインジェクション

攻撃戦略分析

2025.02.06

文献データベース

“Short-length” Adversarial Training Helps LLMs Defend “Long-length” Jailbreak Attacks: Theoretical and Empirical Evidence

Authors: Shaopeng Fu, Liang Ding, Di Wang | Published: 2025-02-06

プロンプトインジェクション

大規模言語モデル

敵対的訓練

2025.02.06 2025.04.03

文献データベース

ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

Authors: Chhavi Yadav, Evan Monroe Laufer, Dan Boneh, Kamalika Chaudhuri | Published: 2025-02-06 | Updated: 2025-05-27

XAI（説明可能なAI）

モデル評価手法

解釈可能性

2025.02.06

文献データベース