A Study on Overfitting in Deep Reinforcement Learning

TOP 文献データベース A Study on Overfitting in Deep Reinforcement Learning

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1804.06893

PDF

https://arxiv.org/pdf/1804.06893

文献情報

作者: Chiyuan Zhang,Oriol Vinyals,Remi Munos,Samy Bengio
公開日: 2018-4-19
更新日: 2018-4-21
所属機関: Google
所属の国: United States of America
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

一般化性能強化学習最適化トレーニング手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Recent years have witnessed significant progresses in deep Reinforcement Learning (RL). Empowered with large scale neural networks, carefully designed architectures, novel training algorithms and massively parallel computing devices, researchers are able to attack many challenging RL problems. However, in machine learning, more training power comes with a potential risk of more overfitting. As deep RL techniques are being applied to critical problems such as healthcare and finance, it is important to understand the generalization behaviors of the trained agents. In this paper, we conduct a systematic study of standard RL agents and find that they could overfit in various ways. Moreover, overfitting could happen "robustly": commonly used techniques in RL that add stochasticity do not necessarily prevent or detect overfitting. In particular, the same agents and learning algorithms could have drastically different test performance, even when all of them achieve optimal rewards during training. The observations call for more principled and careful evaluation protocols in RL. We conclude with a general discussion on overfitting in RL and a study of the generalization behaviors from the perspective of inductive bias.