Depth-2 Neural Networks Under a Data-Poisoning Attack

TOP 文献データベース Depth-2 Neural Networks Under a Data-Poisoning Attack

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2005.01699

PDF

https://arxiv.org/pdf/2005.01699

文献情報

作者: Sayar Karmakar;Anirbit Mukherjee;Theodore Papamarkou
公開日: 2020-5-5
更新日: 2022-6-30
所属機関: Department of Statistics, University of Florida, 230 Newell Drive, Gainesville, 32611, FL, USA
所属の国: United States of America
会議名

AIにより推定されたラベル

アルゴリズムポイズニング評価指標

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

In this work, we study the possibility of defending against data-poisoning attacks while training a shallow neural network in a regression setup. We focus on doing supervised learning for a class of depth-2 finite-width neural networks, which includes single-filter convolutional networks. In this class of networks, we attempt to learn the network weights in the presence of a malicious oracle doing stochastic, bounded and additive adversarial distortions on the true output during training. For the non-gradient stochastic algorithm that we construct, we prove worst-case near-optimal trade-offs among the magnitude of the adversarial attack, the weight approximation accuracy, and the confidence achieved by the proposed algorithm. As our algorithm uses mini-batching, we analyze how the mini-batch size affects convergence. We also show how to utilize the scaling of the outer layer weights to counter output-poisoning attacks depending on the probability of attack. Lastly, we give experimental evidence demonstrating how our algorithm outperforms stochastic gradient descent under different input data distributions, including instances of heavy-tailed distributions.