Dual-Mandate Patrols: Multi-Armed Bandits for Green Security

TOP 文献データベース Dual-Mandate Patrols: Multi-Armed Bandits for Green Security

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2009.06560

PDF

https://arxiv.org/pdf/2009.06560

文献情報

作者: Lily Xu;Elizabeth Bondi;Fei Fang;Andrew Perrault;Kai Wang;Milind Tambe
公開日: 2020-9-15
更新日: 2024-4-26
所属機関: Harvard University
所属の国: United States of America
会議名: AAAI Conference on Artificial Intelligence (AAAI)

AIにより推定されたラベル

報酬メカニズム設計最適化アルゴリズムの選択と評価性能評価指標

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Conservation efforts in green security domains to protect wildlife and forests are constrained by the limited availability of defenders (i.e., patrollers), who must patrol vast areas to protect from attackers (e.g., poachers or illegal loggers). Defenders must choose how much time to spend in each region of the protected area, balancing exploration of infrequently visited regions and exploitation of known hotspots. We formulate the problem as a stochastic multi-armed bandit, where each action represents a patrol strategy, enabling us to guarantee the rate of convergence of the patrolling policy. However, a naive bandit approach would compromise short-term performance for long-term optimality, resulting in animals poached and forests destroyed. To speed up performance, we leverage smoothness in the reward function and decomposability of actions. We show a synergy between Lipschitz-continuity and decomposition as each aids the convergence of the other. In doing so, we bridge the gap between combinatorial and Lipschitz bandits, presenting a no-regret approach that tightens existing guarantees while optimizing for short-term performance. We demonstrate that our algorithm, LIZARD, improves performance on real-world poaching data from Cambodia.