Chapter 3 SARSA
Last updated
Last updated
Page 60, the first square bullet:
contains a typo in the index of the state. It should read:
Page 62, Figure 3.3, Episode 4, Time step 5, the target value calculation:
contains a typo, the second + should have been x. It should read:
Algorithm 3.1 SARSA, line 13:
Contains a typo; the loss function J should have been L: