Simultaneous Adversarial Training - Learn from Others Mistakes

TOP 文献データベース Simultaneous Adversarial Training - Learn from Others Mistakes

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1807.08108

PDF

https://arxiv.org/pdf/1807.08108

文献情報

作者: Zukang Liao
公開日: 2018-7-21
更新日: 2018-9-10
所属機関: Lite-On Singapore Pte. Ltd
所属の国: Singapore
会議名: Formal Grammar (FG)

AIにより推定されたラベル

敵対的攻撃モデルの頑健性保証ロバスト性に関する評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Adversarial examples are maliciously tweaked images that can easily fool machine learning techniques, such as neural networks, but they are normally not visually distinguishable for human beings. One of the main approaches to solve this problem is to retrain the networks using those adversarial examples, namely adversarial training. However, standard adversarial training might not actually change the decision boundaries but cause the problem of gradient masking, resulting in a weaker ability to generate adversarial examples. Therefore, it cannot alleviate the problem of black-box attacks, where adversarial examples generated from other networks can transfer to the targeted one. In order to reduce the problem of black-box attacks, we propose a novel method that allows two networks to learn from each others' adversarial examples and become resilient to black-box attacks. We also combine this method with a simple domain adaptation to further improve the performance.