Towards Deep Neural Network Architectures Robust to Adversarial Examples

TOP 文献データベース Towards Deep Neural Network Architectures Robust to Adversarial Examples

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1412.5068

PDF

https://arxiv.org/pdf/1412.5068

文献情報

作者: Shixiang Gu,Luca Rigazio
公開日: 2014-12-12
更新日: 2015-4-10
所属機関: Panasonic Silicon Valley Laboratory
所属の国: United States of America
会議名: International Conference on Learning Representations (ICLR)

AIにより推定されたラベル

ロバスト性モデルの頑健性保証深層学習技術

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recent work has shown deep neural networks (DNNs) to be highly susceptible to well-designed, small perturbations at the input layer, or so-called adversarial examples. Taking images as an example, such distortions are often imperceptible, but can result in 100% mis-classification for a state of the art DNN. We study the structure of adversarial examples and explore network topology, pre-processing and training strategies to improve the robustness of DNNs. We perform various experiments to assess the removability of adversarial examples by corrupting with additional noise and pre-processing with denoising autoencoders (DAEs). We find that DAEs can remove substantial amounts of the adversarial noise. How- ever, when stacking the DAE with the original DNN, the resulting network can again be attacked by new adversarial examples with even smaller distortion. As a solution, we propose Deep Contractive Network, a model with a new end-to-end training procedure that includes a smoothness penalty inspired by the contractive autoencoder (CAE). This increases the network robustness to adversarial examples, without a significant performance penalty.

外部データセット

MNIST