Bias

Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness

Authors: Siyuan Li, Xi Lin, Yaju Liu, Jianhua Li | Published: 2024-05-09
Bias
Privacy Protection
Prompt Injection

Evaluating and Mitigating Linguistic Discrimination in Large Language Models

Authors: Guoliang Dong, Haoyu Wang, Jun Sun, Xinyu Wang | Published: 2024-04-29 | Updated: 2024-05-10
LLM Performance Evaluation
Bias
Prompt Injection

Collaborative Heterogeneous Causal Inference Beyond Meta-analysis

Authors: Tianyu Guo, Sai Praneeth Karimireddy, Michael I. Jordan | Published: 2024-04-24
Algorithm
Watermarking
Bias

Can Biases in ImageNet Models Explain Generalization?

Authors: Paul Gavrikov, Janis Keuper | Published: 2024-04-01
Bias
Model Performance Evaluation
Watermark Evaluation

De-amplifying Bias from Differential Privacy in Language Model Fine-tuning

Authors: Sanjari Srivastava, Piotr Mardziel, Zhikhun Zhang, Archana Ahlawat, Anupam Datta, John C Mitchell | Published: 2024-02-07
Data Privacy Assessment
Bias
Privacy Protection

TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time (Extended Version)

Authors: Zeliang Kan, Shae McFadden, Daniel Arp, Feargus Pendlebury, Roberto Jordaney, Johannes Kinder, Fabio Pierazzi, Lorenzo Cavallaro | Published: 2024-02-02 | Updated: 2025-04-09
Bias
Malware Classification
Time-Related Features

Domain-Independent Deception: A New Taxonomy and Linguistic Analysis

Authors: Rakesh M. Verma, Nachum Dershowitz, Victor Zeng, Dainis Boumber, Xuting Liu | Published: 2024-02-01
Watermarking
Domain Independence
Bias

Comparing Spectral Bias and Robustness For Two-Layer Neural Networks: SGD vs Adaptive Random Fourier Features

Authors: Aku Kammonen, Lisi Liang, Anamika Pandey, Raúl Tempone | Published: 2024-02-01
Watermarking
Bias
Adversarial Attack Detection

MAPPING: Debiasing Graph Neural Networks for Fair Node Classification with Limited Sensitive Information Leakage

Authors: Ying Song, Balaji Palanisamy | Published: 2024-01-23 | Updated: 2025-01-26
Watermarking
Bias
Membership Inference

X Hacking: The Threat of Misguided AutoML

Authors: Rahul Sharma, Sergey Redyuk, Sumantrak Mukherjee, Andrea Sipka, Sebastian Vollmer, David Selby | Published: 2024-01-16 | Updated: 2024-02-12
XAI (Explainable AI)
Bias
Model Interpretability