Delving into adversarial attacks on deep policies

TOP Literature Database Delving into adversarial attacks on deep policies

arxiv

AI Security Portal bot

Information in the literature database is collected automatically.

Source

https://arxiv.org/abs/1705.06452

PDF

https://arxiv.org/pdf/1705.06452

Paper Information

Author: Jernej Kos,Dawn Song
Published: 5-18-2017
Affiliation: National University of Singapore
Country: Singapore
Conference: International Conference on Learning Representations (ICLR)

Labels Estimated by AI

Robustness Certified Robustness Defense Method

These labels were automatically added by AI and may be inaccurate.
For details, see About Literature Database.

Abstract

Adversarial examples have been shown to exist for a variety of deep learning architectures. Deep reinforcement learning has shown promising results on training agent policies directly on raw inputs such as image pixels. In this paper we present a novel study into adversarial attacks on deep reinforcement learning polices. We compare the effectiveness of the attacks using adversarial examples vs. random noise. We present a novel method for reducing the number of times adversarial examples need to be injected for a successful attack, based on the value function. We further explore how re-training on random noise and FGSM perturbations affects the resilience against adversarial examples.

External Datasets

Atari Pong task