Reinforcement Learning Exercise 4.1

時間 2020-12-24

原文原文鏈接

Example 4.1 Consider the 4 × 4 4 \times 4 4×4 gridworld shown below. The nonterminal states are S = { 1 , 2 , . . . , 14 } \mathcal S = \{1, 2, . . . , 14\} S={1,2,...,14}. There are four actions poss