A Linear Approach to Data Poisoning

TOP 文献データベース A Linear Approach to Data Poisoning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2505.15175

PDF

https://arxiv.org/pdf/2505.15175

文献情報

作者: Diego Granziol,Donald Flynn
公開日: 2025-5-21
所属機関: Mathematical Institute, University of Oxford
所属の国: United Kingdom
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

ポイズニング統計的分析動的分析

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

We investigate the theoretical foundations of data poisoning attacks in machine learning models. Our analysis reveals that the Hessian with respect to the input serves as a diagnostic tool for detecting poisoning, exhibiting spectral signatures that characterize compromised datasets. We use random matrix theory (RMT) to develop a theory for the impact of poisoning proportion and regularisation on attack efficacy in linear regression. Through QR stepwise regression, we study the spectral signatures of the Hessian in multi-output regression. We perform experiments on deep networks to show experimentally that this theory extends to modern convolutional and transformer networks under the cross-entropy loss. Based on these insights we develop preliminary algorithms to determine if a network has been poisoned and remedies which do not require further training.

外部データセット

MNIST

CIFAR

ImageNet