Reinforcement Learning: value function approximation

時間 2020-12-24

標籤強化學習 value UCL 简体版

原文原文鏈接

introduction incremental methods增量法 state value function with prediction approximation action value function with control approximation batch methods批處理 introduction 上一節講到使用採樣的方法進行，狀態和action space都比較小

>>阅读原文<<