该论文的页面为: http://www.cs.rhul.ac.uk/~chrisw/thesis.html 下载地址为: http://www.cs.rhul.ac.uk/~chrisw/new_thesis.pdf 论文页面对这篇文章的描述: The thesis introduces the notion of reinforcement learning as learning to control a Markov Decision Process...
下载地址为:http://www.cs.rhul.ac.uk/~chrisw/new_thesis.pdf 论文页面对这篇文章的描述: The thesis introduces the notion of reinforcement learning as learning to control a Markov Decision Process by incremental dynamic programming, and describes a range of algorithms for doing this, including Q-...