On the Adversarial Robustness of Neural Networks without Weight Transport

TOP 文献データベース On the Adversarial Robustness of Neural Networks without Weight Transport

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1908.03560

PDF

https://arxiv.org/pdf/1908.03560

文献情報

作者: Mohamed Akrout
公開日: 2025-3-25
所属機関: University of Toronto, Triage
所属の国: Canada
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

敵対的サンプル堅牢性向上手法敵対的攻撃手法

Abstract

Neural networks trained with backpropagation, the standard algorithm of deep learning which uses weight transport, are easily fooled by existing gradient-based adversarial attacks. This class of attacks are based on certain small perturbations of the inputs to make networks misclassify them. We show that less biologically implausible deep neural networks trained with feedback alignment, which do not use weight transport, can be harder to fool, providing actual robustness. Tested on MNIST, deep neural networks trained without weight transport (1) have an adversarial accuracy of 98% compared to 0.03% for neural networks trained with backpropagation and (2) generate non-transferable adversarial examples. However, this gap decreases on CIFAR-10 but is still significant particularly for small perturbation magnitude less than 1/2.