Machine Learning(8): Reinforcement learning

Reinforcement learning Problem-abstraction The processing of Markov The propery of Markov The policy Value function The example of Value function Bellman’s Expectation Equation Optimal policy Bellman’
相關文章
相關標籤/搜索