R1dacted: Investigating Local Censorship in DeepSeek’s R1 Language Model Authors: Ali Naseh, Harsh Chaudhari, Jaechul Roh, Mingshi Wu, Alina Oprea, Amir Houmansadr | Published: 2025-05-19 Bias Detection in AI OutputPrompt leaking検閲行動 2025.05.19 2025.05.21 Literature Database
Elevating Cyber Threat Intelligence against Disinformation Campaigns with LLM-based Concept Extraction and the FakeCTI Dataset Authors: Domenico Cotroneo, Roberto Natella, Vittorio Orbinato | Published: 2025-05-06 Bias Detection in AI OutputDetection of MisinformationInformation Extraction Method 2025.05.06 2025.05.12 Literature Database
LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems Authors: Yazan Otoum, Arghavan Asad, Amiya Nayak | Published: 2025-05-01 Bias Detection in AI OutputLLM Performance EvaluationPrompt Injection 2025.05.01 2025.05.12 Literature Database
Synthesizing Access Control Policies using Large Language Models Authors: Adarsh Vatsa, Pratyush Patel, William Eiers | Published: 2025-03-14 Bias Detection in AI OutputData Generation MethodPrivacy Design Principles 2025.03.14 2025.05.12 Literature Database
PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing Authors: Zhichao You, Xuewen Dong, Ke Cheng, Xutong Mu, Jiaxuan Fu, Shiyang Ma, Qiang Qu, Yulong Shen | Published: 2025-03-05 | Updated: 2025-05-14 Bias Detection in AI OutputPrivacy Design PrinciplesCryptography 2025.03.05 2025.05.16 Literature Database
ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System Authors: Tingmin Wu, Shuiqiao Yang, Shigang Liu, David Nguyen, Seung Jang, Alsharif Abuadbba | Published: 2024-11-26 | Updated: 2025-05-14 Bias Detection in AI OutputPrompt leaking脅威モデリング自動化 2024.11.26 2025.05.16 Literature Database
Measuring Implicit Bias in Explicitly Unbiased Large Language Models Authors: Xuechunzi Bai, Angelina Wang, Ilia Sucholutsky, Thomas L. Griffiths | Published: 2024-02-06 | Updated: 2024-05-23 Bias Detection in AI OutputAlgorithm FairnessLarge Language Model 2024.02.06 2025.05.12 Literature Database
Gender bias and stereotypes in Large Language Models Authors: Hadas Kotek, Rikker Dockum, David Q. Sun | Published: 2023-08-28 Bias Detection in AI OutputAlgorithm FairnessLarge Language Model 2023.08.28 2025.05.12 Literature Database
ADEPT: A DEbiasing PrompT Framework Authors: Ke Yang, Charles Yu, Yi Fung, Manling Li, Heng Ji | Published: 2022-11-10 | Updated: 2022-12-23 Bias Detection in AI OutputPrompting StrategyCreation of Fair AI Models 2022.11.10 2025.05.12 Literature Database
Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases Authors: Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro | Published: 2021-12-15 | Updated: 2022-04-15 Bias Detection in AI OutputFew-Shot LearningLarge Language Model 2021.12.15 2025.05.13 Literature Database