Reinforcement Learning Exercise 4.1

Example 4.1 Consider the 4 × 4 4 \times 4 4×4 gridworld shown below. The nonterminal states are S = { 1 , 2 , . . . , 14 } \mathcal S = \{1, 2, . . . , 14\} S={1,2,...,14}. There are four actions poss
相關文章
相關標籤/搜索