Towards Debiasing Sentence Representations

TOP 文献データベース Towards Debiasing Sentence Representations

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2007.08100

PDF

https://arxiv.org/pdf/2007.08100

文献情報

作者: Paul Pu Liang,Irene Mengze Li,Emily Zheng,Yao Chong Lim,Ruslan Salakhutdinov,Louis-Philippe Morency
公開日: 2020-7-16
所属機関: Machine Learning Department and Language Technologies Institute, Carnegie Mellon University
所属の国: United States of America
会議名: Annual Meeting of the Association for Computational Linguistics (ACL)

AIにより推定されたラベル

アルゴリズムの公平性公平性のあるAIモデルの作成 AIによる出力のバイアスの検出

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

As natural language processing methods are increasingly deployed in real-world scenarios such as healthcare, legal systems, and social science, it becomes necessary to recognize the role they potentially play in shaping social biases and stereotypes. Previous work has revealed the presence of social biases in widely used word embeddings involving gender, race, religion, and other social constructs. While some methods were proposed to debias these word-level embeddings, there is a need to perform debiasing at the sentence-level given the recent shift towards new contextualized sentence representations such as ELMo and BERT. In this paper, we investigate the presence of social biases in sentence-level representations and propose a new method, Sent-Debias, to reduce these biases. We show that Sent-Debias is effective in removing biases, and at the same time, preserves performance on sentence-level downstream tasks such as sentiment analysis, linguistic acceptability, and natural language understanding. We hope that our work will inspire future research on characterizing and removing social biases from widely adopted sentence representations for fairer NLP.

外部データセット

Stanford Sentiment Treebank (SST-2)

Corpus of Linguistic Acceptability (CoLA)

POM

WikiText-2