透かし評価

Auditing Differential Privacy Guarantees Using Density Estimation

Authors: Antti Koskela, Jafar Mohammadi | Published: 2024-06-07 | Updated: 2024-10-11

プライバシー保護手法

評価手法

透かし評価

2024.06.07 2025.04.03

文献データベース

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Authors: Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz, Philip H. S. Torr, Adel Bibi | Published: 2024-05-22

評価手法

透かし評価

難易度キャリブレーション

2024.05.22 2025.04.03

文献データベース

Naturally Private Recommendations with Determinantal Point Processes

Authors: Jack Fitzsimons, Agustín Freitas Pasqualini, Robert Pisarczyk, Dmitrii Usynin | Published: 2024-05-22

ウォーターマーキング

プライバシー保護手法

透かし評価

2024.05.22 2025.04.03

文献データベース

WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness

Authors: Baizhou Huang, Xiaojun Wan | Published: 2024-05-22

ウォーターマーキング

透かしの耐久性

透かし評価

2024.05.22 2025.04.03

文献データベース

Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing

Authors: Yunlong Zhao, Xiaoheng Deng, Yijing Liu, Xinjun Pei, Jiazhi Xia, Wei Chen | Published: 2024-05-18

モデル性能評価

評価手法

透かし評価

2024.05.18 2025.04.03

文献データベース

Towards Next-Generation Steganalysis: LLMs Unleash the Power of Detecting Steganography

Authors: Minhao Bai. Jinshuai Yang, Kaiyi Pang, Huili Wang, Yongfeng Huang | Published: 2024-05-15

LLM性能評価

ドメイン非依存性

透かし評価

2024.05.15 2025.04.03

文献データベース

Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope Theory

Authors: Pasan Dissanayake, Sanghamitra Dutta | Published: 2024-05-08 | Updated: 2024-11-05

モデル性能評価

モデル抽出攻撃

透かし評価

2024.05.08 2025.04.03

文献データベース

ModelShield: Adaptive and Robust Watermark against Model Extraction Attack

Authors: Kaiyi Pang, Tao Qi, Chuhan Wu, Minhao Bai, Minghu Jiang, Yongfeng Huang | Published: 2024-05-03 | Updated: 2025-01-12

ウォーターマーキング

プロンプトインジェクション

透かし評価

2024.05.03 2025.04.03

文献データベース

Why You Should Not Trust Interpretations in Machine Learning: Adversarial Attacks on Partial Dependence Plots

Authors: Xi Xin, Giles Hooker, Fei Huang | Published: 2024-04-29 | Updated: 2024-05-01

モデルの解釈性

敵対的訓練

透かし評価

2024.04.29 2025.04.03

文献データベース

Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks

Authors: Yunzhen Feng, Tim G. J. Rudner, Nikolaos Tsilivis, Julia Kempe | Published: 2024-04-27

不確実性の定量化

敵対的サンプル

透かし評価

2024.04.27 2025.04.03

文献データベース