Bias Detection in AI Output

R1dacted: Investigating Local Censorship in DeepSeek’s R1 Language Model

Authors: Ali Naseh, Harsh Chaudhari, Jaechul Roh, Mingshi Wu, Alina Oprea, Amir Houmansadr | Published: 2025-05-19
Bias Detection in AI Output
Prompt leaking
検閲行動

Elevating Cyber Threat Intelligence against Disinformation Campaigns with LLM-based Concept Extraction and the FakeCTI Dataset

Authors: Domenico Cotroneo, Roberto Natella, Vittorio Orbinato | Published: 2025-05-06
Bias Detection in AI Output
Detection of Misinformation
Information Extraction Method

LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems

Authors: Yazan Otoum, Arghavan Asad, Amiya Nayak | Published: 2025-05-01
Bias Detection in AI Output
LLM Performance Evaluation
Prompt Injection

Synthesizing Access Control Policies using Large Language Models

Authors: Adarsh Vatsa, Pratyush Patel, William Eiers | Published: 2025-03-14
Bias Detection in AI Output
Data Generation Method
Privacy Design Principles

PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing

Authors: Zhichao You, Xuewen Dong, Ke Cheng, Xutong Mu, Jiaxuan Fu, Shiyang Ma, Qiang Qu, Yulong Shen | Published: 2025-03-05 | Updated: 2025-05-14
Bias Detection in AI Output
Privacy Design Principles
Cryptography

ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System

Authors: Tingmin Wu, Shuiqiao Yang, Shigang Liu, David Nguyen, Seung Jang, Alsharif Abuadbba | Published: 2024-11-26 | Updated: 2025-05-14
Bias Detection in AI Output
Prompt leaking
脅威モデリング自動化

Measuring Implicit Bias in Explicitly Unbiased Large Language Models

Authors: Xuechunzi Bai, Angelina Wang, Ilia Sucholutsky, Thomas L. Griffiths | Published: 2024-02-06 | Updated: 2024-05-23
Bias Detection in AI Output
Algorithm Fairness
Large Language Model

Gender bias and stereotypes in Large Language Models

Authors: Hadas Kotek, Rikker Dockum, David Q. Sun | Published: 2023-08-28
Bias Detection in AI Output
Algorithm Fairness
Large Language Model

ADEPT: A DEbiasing PrompT Framework

Authors: Ke Yang, Charles Yu, Yi Fung, Manling Li, Heng Ji | Published: 2022-11-10 | Updated: 2022-12-23
Bias Detection in AI Output
Prompting Strategy
Creation of Fair AI Models

Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases

Authors: Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro | Published: 2021-12-15 | Updated: 2022-04-15
Bias Detection in AI Output
Few-Shot Learning
Large Language Model