Debiasing Pre-trained Contextualised Embeddings

TOP 文献データベース Debiasing Pre-trained Contextualised Embeddings

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2101.09523

PDF

https://arxiv.org/pdf/2101.09523

文献情報

作者: Masahiro Kaneko,Danushka Bollegala
公開日: 2021-1-24
所属機関: Tokyo Metropolitan University
所属の国: Japan
会議名: Conference of the European Chapter of the Association for Computational Linguistics (EACL)

AIにより推定されたラベル

公平性のあるAIモデルの作成深層学習手法 AIによる出力のバイアスの検出

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

In comparison to the numerous debiasing methods proposed for the static non-contextualised word embeddings, the discriminative biases in contextualised embeddings have received relatively little attention. We propose a fine-tuning method that can be applied at token- or sentence-levels to debias pre-trained contextualised embeddings. Our proposed method can be applied to any pre-trained contextualised embedding model, without requiring to retrain those models. Using gender bias as an illustrative example, we then conduct a systematic study using several state-of-the-art (SoTA) contextualised representations on multiple benchmark datasets to evaluate the level of biases encoded in different contextualised embeddings before and after debiasing using the proposed method. We find that applying token-level debiasing for all tokens and across all layers of a contextualised embedding model produces the best performance. Interestingly, we observe that there is a trade-off between creating an accurate vs. unbiased contextualised embedding model, and different contextualised embedding models respond differently to this trade-off.