Measuring Bias in Contextualized Word Representations

TOP 文献データベース Measuring Bias in Contextualized Word Representations

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1906.07337

PDF

https://arxiv.org/pdf/1906.07337

文献情報

作者: Keita Kurita,Nidhi Vyas,Ayush Pareek,Alan W Black,Yulia Tsvetkov
公開日: 2025-3-25
所属機関: Carnegie Mellon University
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

アルゴリズムの公平性 AIによる出力のバイアスの検出大規模言語モデル

Abstract

Contextual word embeddings such as BERT have achieved state of the art performance in numerous NLP tasks. Since they are optimized to capture the statistical properties of training data, they tend to pick up on and amplify social stereotypes present in the data as well. In this study, we (1)~propose a template-based method to quantify bias in BERT; (2)~show that this method obtains more consistent results in capturing social biases than the traditional cosine based method; and (3)~conduct a case study, evaluating gender bias in a downstream task of Gender Pronoun Resolution. Although our case study focuses on gender bias, the proposed technique is generalizable to unveiling other biases, including in multiclass settings, such as racial and religious biases.