Feedback Learning for Improving the Robustness of Neural Networks

TOP 文献データベース Feedback Learning for Improving the Robustness of Neural Networks

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1909.05443

PDF

https://arxiv.org/pdf/1909.05443

文献情報

作者: Chang Song,Zuoguan Wang,Hai Li
公開日: 2019-9-12
所属機関: Department of ECE, Duke University
所属の国: United States of America
会議名

AIにより推定されたラベル

敵対的サンプルクラス不均衡攻撃手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recent research studies revealed that neural networks are vulnerable to adversarial attacks. State-of-the-art defensive techniques add various adversarial examples in training to improve models' adversarial robustness. However, these methods are not universal and can't defend unknown or non-adversarial evasion attacks. In this paper, we analyze the model robustness in the decision space. A feedback learning method is then proposed, to understand how well a model learns and to facilitate the retraining process of remedying the defects. The evaluations according to a set of distance-based criteria show that our method can significantly improve models' accuracy and robustness against different types of evasion attacks. Moreover, we observe the existence of inter-class inequality and propose to compensate it by changing the proportions of examples generated in different classes.

外部データセット

MNIST

CIFAR-10