透かし評価

Duwak: Dual Watermarks in Large Language Models

Authors: Chaoyi Zhu, Jeroen Galjaard, Pin-Yu Chen, Lydia Y. Chen | Published: 2024-03-12 | Updated: 2024-08-08
ウォーターマーキング
トークンの処理と収集
透かし評価

Robustness bounds on the successful adversarial examples in probabilistic models: Implications from Gaussian processes

Authors: Hiroaki Maeshima, Akira Otsuka | Published: 2024-03-04 | Updated: 2025-03-19
攻撃手法
敵対的サンプル
透かし評価

Revisiting Differentially Private Hyper-parameter Tuning

Authors: Zihang Xiang, Tianhao Wang, Chenglong Wang, Di Wang | Published: 2024-02-20 | Updated: 2024-06-04
ハイパーパラメータ調整
プライバシー保護手法
透かし評価

Bounding Reconstruction Attack Success of Adversaries Without Data Priors

Authors: Alexander Ziller, Anneliese Riess, Kristian Schwethelm, Tamara T. Mueller, Daniel Rueckert, Georgios Kaissis | Published: 2024-02-20
データプライバシー評価
プライバシー保護手法
透かし評価

DART: A Principled Approach to Adversarially Robust Unsupervised Domain Adaptation

Authors: Yunjuan Wang, Hussein Hazimeh, Natalia Ponomareva, Alexey Kurakin, Ibrahim Hammoud, Raman Arora | Published: 2024-02-16
アルゴリズム
敵対的訓練
透かし評価

Private PAC Learning May be Harder than Online Learning

Authors: Mark Bun, Aloni Cohen, Rathin Desai | Published: 2024-02-16
ウォーターマーキング
オンライン学習
透かし評価

Measuring and Reducing LLM Hallucination without Gold-Standard Answers

Authors: Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu | Published: 2024-02-16 | Updated: 2024-06-06
Few-Shot Learning
ハルシネーションの検知
透かし評価

How Much Does Each Datapoint Leak Your Privacy? Quantifying the Per-datum Membership Leakage

Authors: Achraf Azize, Debabrota Basu | Published: 2024-02-15
メンバーシップ推論
仮説検定
透かし評価

CycPUF: Cyclic Physical Unclonable Function

Authors: Michael Dominguez, Amin Rezaei | Published: 2024-02-12
FPGA
PUFの評価手法
透かし評価

ACW: Enhancing Traceability of AI-Generated Codes Based on Watermarking

Authors: Boquan Li, Mengdi Zhang, Peixin Zhang, Jun Sun, Xingmei Wang, Zirui Fu | Published: 2024-02-12 | Updated: 2024-08-21
アルゴリズム
ウォーターマーキング
透かし評価