Towards Verifying Robustness of Neural Networks Against Semantic Perturbations

Authors: Jeet Mohapatra, Tsui-Wei, Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel | Published: 2019-12-19 | Updated: 2020-06-15

2019.12.192025.04.03

Authors: Jeet Mohapatra, Tsui-Wei, Weng, Pin-Yu Chen, Sijia Liu, Luca Daniel
Published: 2019-12-19 | Updated: 2020-06-15

Source: https://arxiv.org/abs/1912.09533

PDF: https://arxiv.org/pdf/1912.09533

AIにより推定されたラベル

ロバスト性に関する評価敵対的学習深層学習

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Verifying robustness of neural networks given a specified threat model is a fundamental yet challenging task. While current verification methods mainly focus on the ℓ_p-norm threat model of the input instances, robustness verification against semantic adversarial attacks inducing large ℓ_p-norm perturbations, such as color shifting and lighting adjustment, are beyond their capacity. To bridge this gap, we propose Semantify-NN, a model-agnostic and generic robustness verification approach against semantic perturbations for neural networks. By simply inserting our proposed semantic perturbation layers (SP-layers) to the input layer of any given model, Semantify-NN is model-agnostic, and any ℓ_p-norm based verification tools can be used to verify the model robustness against semantic perturbations. We illustrate the principles of designing the SP-layers and provide examples including semantic perturbations to image classification in the space of hue, saturation, lightness, brightness, contrast and rotation, respectively. In addition, an efficient refinement technique is proposed to further significantly improve the semantic certificate. Experiments on various network architectures and different datasets demonstrate the superior verification performance of Semantify-NN over ℓ_p-norm-based verification frameworks that naively convert semantic perturbation to ℓ_p-norm. The results show that Semantify-NN can support robustness verification against a wide range of semantic perturbations. Code available https://github.com/JeetMo/Semantify-NN