Metric Learning for Adversarial Robustness

TOP 文献データベース Metric Learning for Adversarial Robustness

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1909.00900

PDF

https://arxiv.org/pdf/1909.00900

文献情報

作者: Chengzhi Mao,Ziyuan Zhong,Junfeng Yang,Carl Vondrick,Baishakhi Ray
公開日: 2019-9-3
更新日: 2019-10-28
所属機関: Columbia University
所属の国: United States of America
会議名: Conference on Neural Information Processing Systems (NeurIPS)

AIにより推定されたラベル

ポイズニング敵対的サンプルの脆弱性学習の改善

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Deep networks are well-known to be fragile to adversarial attacks. We conduct an empirical analysis of deep representations under the state-of-the-art attack method called PGD, and find that the attack causes the internal representation to shift closer to the "false" class. Motivated by this observation, we propose to regularize the representation space under attack with metric learning to produce more robust classifiers. By carefully sampling examples for metric learning, our learned representation not only increases robustness, but also detects previously unseen adversarial samples. Quantitative experiments show improvement of robustness accuracy by up to 4% and detection efficiency by up to 6% according to Area Under Curve score over prior work. The code of our work is available at https://github.com/columbia/Metric_Learning_Adversarial_Robustness.

外部データセット

MNIST

CIFAR-10

Tiny-ImageNet