バイアス

De-amplifying Bias from Differential Privacy in Language Model Fine-tuning

Authors: Sanjari Srivastava, Piotr Mardziel, Zhikhun Zhang, Archana Ahlawat, Anupam Datta, John C Mitchell | Published: 2024-02-07
データプライバシー評価
バイアス
プライバシー保護

TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time (Extended Version)

Authors: Zeliang Kan, Shae McFadden, Daniel Arp, Feargus Pendlebury, Roberto Jordaney, Johannes Kinder, Fabio Pierazzi, Lorenzo Cavallaro | Published: 2024-02-02
バイアス
マルウェア分類
時間に関連する特徴

Domain-Independent Deception: A New Taxonomy and Linguistic Analysis

Authors: Rakesh M. Verma, Nachum Dershowitz, Victor Zeng, Dainis Boumber, Xuting Liu | Published: 2024-02-01
ウォーターマーキング
ドメイン非依存性
バイアス

Comparing Spectral Bias and Robustness For Two-Layer Neural Networks: SGD vs Adaptive Random Fourier Features

Authors: Aku Kammonen, Lisi Liang, Anamika Pandey, Raúl Tempone | Published: 2024-02-01
ウォーターマーキング
バイアス
敵対的攻撃検出

MAPPING: Debiasing Graph Neural Networks for Fair Node Classification with Limited Sensitive Information Leakage

Authors: Ying Song, Balaji Palanisamy | Published: 2024-01-23 | Updated: 2025-01-26
ウォーターマーキング
バイアス
メンバーシップ推論

X Hacking: The Threat of Misguided AutoML

Authors: Rahul Sharma, Sergey Redyuk, Sumantrak Mukherjee, Andrea Sipka, Sebastian Vollmer, David Selby | Published: 2024-01-16 | Updated: 2024-02-12
XAI(説明可能なAI)
バイアス
モデルの解釈性

Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection

Authors: Luping Liu, Yi Ren, Xize Cheng, Rongjie Huang, Chongxuan Li, Zhou Zhao | Published: 2022-11-21 | Updated: 2023-06-04
バイアス
最適化手法
画像特徴抽出

On the Alignment of Group Fairness with Attribute Privacy

Authors: Jan Aalmoes, Vasisht Duddu, Antoine Boutet | Published: 2022-11-18 | Updated: 2024-03-05
バイアス
プライバシー保護手法
プライバシー評価

FairVFL: A Fair Vertical Federated Learning Framework with Contrastive Adversarial Learning

Authors: Tao Qi, Fangzhao Wu, Chuhan Wu, Lingjuan Lyu, Tong Xu, Zhongliang Yang, Yongfeng Huang, Xing Xie | Published: 2022-06-07 | Updated: 2022-10-31
バイアス
ポイズニング
対抗的学習

Toward More Generalized Malicious URL Detection Models

Authors: YunDa Tsai, Cayon Liow, Yin Sheng Siang, Shou-De Lin | Published: 2022-02-21 | Updated: 2024-02-09
トークン分布分析
バイアス
一般化の影響