Understanding the Limits of Poisoning Attacks in Episodic Reinforcement Learning

TOP 文献データベース Understanding the Limits of Poisoning Attacks in Episodic Reinforcement Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2208.13663

PDF

https://arxiv.org/pdf/2208.13663

文献情報

作者: Anshuka Rangi;Haifeng Xu;Long Tran-Thanh;Massimo Franceschetti
公開日: 2022-8-30
所属機関: University of California San Diego
所属の国: United States of America
会議名: International Joint Conference on Artificial Intelligence (IJCAI)

AIにより推定されたラベル

報酬メカニズム設計サイバー攻撃最適化問題

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

To understand the security threats to reinforcement learning (RL) algorithms, this paper studies poisoning attacks to manipulate \emph{any} order-optimal learning algorithm towards a targeted policy in episodic RL and examines the potential damage of two natural types of poisoning attacks, i.e., the manipulation of \emph{reward} and \emph{action}. We discover that the effect of attacks crucially depend on whether the rewards are bounded or unbounded. In bounded reward settings, we show that only reward manipulation or only action manipulation cannot guarantee a successful attack. However, by combining reward and action manipulation, the adversary can manipulate any order-optimal learning algorithm to follow any targeted policy with $\tilde{\Theta}(\sqrt{T})$ total attack cost, which is order-optimal, without any knowledge of the underlying MDP. In contrast, in unbounded reward settings, we show that reward manipulation attacks are sufficient for an adversary to successfully manipulate any order-optimal learning algorithm to follow any targeted policy using $\tilde{O}(\sqrt{T})$ amount of contamination. Our results reveal useful insights about what can or cannot be achieved by poisoning attacks, and are set to spur more works on the design of robust RL algorithms.