Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models Authors: Jiang Zhang, Qiong Wu, Yiming Xu, Cheng Cao, Zheng Du, Konstantinos Psounis | Published: 2023-12-13 Prompting StrategyCalculation of Output HarmfulnessLarge Language Model 2023.12.13 2025.05.28 Literature Database
Gender bias and stereotypes in Large Language Models Authors: Hadas Kotek, Rikker Dockum, David Q. Sun | Published: 2023-08-28 Bias Detection in AI OutputAlgorithm FairnessLarge Language Model 2023.08.28 2025.05.28 Literature Database
Toxicity Detection with Generative Prompt-based Inference Authors: Yau-Shian Wang, Yingshan Chang | Published: 2022-05-24 Prompting StrategyCalculation of Output HarmfulnessLarge Language Model 2022.05.24 2025.05.28 Literature Database
Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases Authors: Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro | Published: 2021-12-15 | Updated: 2022-04-15 Bias Detection in AI OutputFew-Shot LearningLarge Language Model 2021.12.15 2025.05.28 Literature Database
Measuring Bias in Contextualized Word Representations Authors: Keita Kurita, Nidhi Vyas, Ayush Pareek, Alan W Black, Yulia Tsvetkov | Published: 2019-06-18 Bias Detection in AI OutputAlgorithm FairnessLarge Language Model 2019.06.18 2025.05.28 Literature Database
A Machine Learning Approach To Prevent Malicious Calls Over Telephony Networks Authors: Huichen Li, Xiaojun Xu, Chang Liu, Teng Ren, Kun Wu, Xuezhi Cao, Weinan Zhang, Yong Yu, Dawn Song | Published: 2018-04-07 Large Language ModelTime-Related FeaturesStatistical Analysis 2018.04.07 2025.05.28 Literature Database