David Silver RL課程第2課（Markov decision processes)

時間 2021-01-12

原文原文鏈接

1.Markov decision processes formally describe an environment for reinforcement learning Where the environment is fully observable The current state completely characterises the process Almost all RL p