Chapter 9 On-policy Prediction with Approximation

時間 2021-01-02

原文原文鏈接

本文爲《Reinforcement Learning: An Introduction》讀書筆記 9.1 Value-function Approximation 9.2 The Prediction Objective ( VE¯¯¯¯¯¯¯¯ V E ¯ ) 9.3 Stochastic-gradient and Semi-gradient Methods 9.4 Linear Methods