プライバシー手法

Observational Auditing of Label Privacy

Authors: Iden Kalemaj, Luca Melis, Maxime Boucher, Ilya Mironov, Saeed Mahloujifar | Published: 2025-11-18
バックドア攻撃用の毒データの検知
プライバシー手法
差分プライバシー

GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards

Authors: Yule Liu, Heyi Zhang, Jinyi Zheng, Zhen Sun, Zifan Peng, Tianshuo Cong, Yilong Yang, Xinlei He, Zhuo Ma | Published: 2025-11-18
プライバシー手法
メンバーシップ推論
差分プライバシー

Robust Client-Server Watermarking for Split Federated Learning

Authors: Jiaxiong Tang, Zhengchunmin Dai, Liantao Wu, Peng Sun, Honglong Chen, Zhenfu Cao | Published: 2025-11-17
トリガーの検知
プライバシー手法
透かし評価

Tight and Practical Privacy Auditing for Differentially Private In-Context Learning

Authors: Yuyang Xia, Ruixuan Liu, Li Xiong | Published: 2025-11-17
プライバシー手法
匿名化技術
差分プライバシー

Whistledown: Combining User-Level Privacy with Conversational Coherence in LLMs

Authors: Chelsea McMurray, Hayder Tirmazi | Published: 2025-11-17
プライバシーリスク管理
プライバシー保証
プライバシー手法

DualTAP: A Dual-Task Adversarial Protector for Mobile MLLM Agents

Authors: Fuyao Zhang, Jiaming Zhang, Che Wang, Xiongtao Sun, Yurong Hao, Guowei Guan, Wenjie Li, Longtao Huang, Wei Yang Bryan Lim | Published: 2025-11-17
プライバシー手法
生成モデル
透かし評価

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Authors: Xuankun Rong, Wenke Huang, Tingfeng Wang, Daiguo Zhou, Bo Du, Mang Ye | Published: 2025-11-17
プライバシー手法
不適切コンテンツ生成
倫理的選択評価

DP-FedLoRA: Privacy-Enhanced Federated Fine-Tuning for On-Device Large Language Models

Authors: Honghui Xu, Shiva Shrestha, Wei Chen, Zhiyuan Li, Zhipeng Cai | Published: 2025-09-11
プライバシー手法
収束解析
差分プライバシー

Towards Confidential and Efficient LLM Inference with Dual Privacy Protection

Authors: Honglan Yu, Yibin Wang, Feifei Dai, Dong Liu, Haihui Fan, Xiaoyan Gu | Published: 2025-09-11
アルゴリズム
プライバシー手法
差分プライバシー

Gaze3P: Gaze-Based Prediction of User-Perceived Privacy

Authors: Mayar Elfares, Pascal Reisert, Ralf Küsters, Andreas Bulling | Published: 2025-07-01 | Updated: 2025-09-10
プライバシー手法
プライバシー評価
研究方法論