Deep Reinforcement Learning with Double Q-Learning

Van Hasselt, Hado; Guez, Arthur; Silver, David

doi:10.1609/aaai.v30i1.10295

Public

Deep Reinforcement Learning with Double Q-Learning

Shared by NobleBlocks on Mar 2, 2016 • 12:00 AM UTC

Authors:

Hado Van Hasselt

Arthur Guez

David Silver

Abstract

The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can generally be prevented. In this paper, we answer all these question...

Research Assistant

AI chat, annotations, notes & similar papers

Finding related papers...

Discussions

(0)

No comments yet

Be the first to share your thoughts!