Chapter 9 On-policy Prediction with Approximation

本文爲《Reinforcement Learning: An Introduction》讀書筆記 9.1 Value-function Approximation 9.2 The Prediction Objective ( VE¯¯¯¯¯¯¯¯ V E ¯ ) 9.3 Stochastic-gradient and Semi-gradient Methods 9.4 Linear Methods
相關文章
相關標籤/搜索