On Adversarial Bias and the Robustness of Fair Machine Learning

TOP 文献データベース On Adversarial Bias and the Robustness of Fair Machine Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2006.08669

PDF

https://arxiv.org/pdf/2006.08669

文献情報

作者: Hongyan Chang,Ta Duy Nguyen,Sasi Kumar Murakonda,Ehsan Kazemi,Reza Shokri
公開日: 2020-6-16
所属機関: National University of Singapore (NUS)
所属の国: Singapore
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

ポイズニング攻撃手法メンバーシップ推論

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Optimizing prediction accuracy can come at the expense of fairness. Towards minimizing discrimination against a group, fair machine learning algorithms strive to equalize the behavior of a model across different groups, by imposing a fairness constraint on models. However, we show that giving the same importance to groups of different sizes and distributions, to counteract the effect of bias in training data, can be in conflict with robustness. We analyze data poisoning attacks against group-based fair machine learning, with the focus on equalized odds. An adversary who can control sampling or labeling for a fraction of training data, can reduce the test accuracy significantly beyond what he can achieve on unconstrained models. Adversarial sampling and adversarial labeling attacks can also worsen the model's fairness gap on test data, even though the model satisfies the fairness constraint on training data. We analyze the robustness of fair machine learning through an empirical evaluation of attacks on multiple algorithms and benchmark datasets.