Improving Adversarial Robustness via Unlabeled Out-of-Domain Data

Authors: Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou | Published: 2020-06-15 | Updated: 2021-02-21

2020.06.152025.05.28

Authors: Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou
Published: 2020-06-15 | Updated: 2021-02-21

Source: https://arxiv.org/abs/2006.08476

PDF: https://arxiv.org/pdf/2006.08476

Labels Predicted by AI

Semi-Supervised Learning Adversarial Learning Statistical Methods

Please note that these labels were automatically added by AI. Therefore, they may not be entirely accurate.
For more details, please see the About the Literature Database page.

Abstract

Data augmentation by incorporating cheap unlabeled data from multiple domains is a powerful way to improve prediction especially when there is limited labeled data. In this work, we investigate how adversarial robustness can be enhanced by leveraging out-of-domain unlabeled data. We demonstrate that for broad classes of distributions and classifiers, there exists a sample complexity gap between standard and robust classification. We quantify to what degree this gap can be bridged via leveraging unlabeled samples from a shifted domain by providing both upper and lower bounds. Moreover, we show settings where we achieve better adversarial robustness when the unlabeled data come from a shifted domain rather than the same domain as the labeled data. We also investigate how to leverage out-of-domain data when some structural information, such as sparsity, is shared between labeled and unlabeled domains. Experimentally, we augment two object recognition datasets (CIFAR-10 and SVHN) with easy to obtain and unlabeled out-of-domain data and demonstrate substantial improvement in the model’s robustness against ℓ_∞ adversarial attacks on the original domain.