Chapter 6 Advantage Actor-Critic
Page 142, Section 6.3 A2C Algorithm, Algorithm 6.1
Thanks to Jérémie Clair Coté for this correction.
Algorithm 6.1 A2C, lines 18 and 20:
each contain a typo. The second term on the right hand side of each equation should be subtracted not added since the loss is being minimized. They should read:
Last updated