PrioritizedReplay
PrioritizedReplay
Code: slm_lab/agent/memory/prioritized.py
Prioritized Experience Replay (PER) extends from Replay by calculating prioritization for sampling experiences based on errors in Q-values estimation.
Suitable for off-policy algorithms.
Source Documentation
Refer to the class documentation and example memory spec from the source: slm_lab/agent/memory/prioritized.py#L87-L104
Example Memory Spec
This specification creates a PrioritizedReplay (off-policy) memory with a maximum capacity of 10,000 elements, with a batch size of 32, and CER is disabled. The alpha
and epsilon
parameters are specific to PER in computing the errors.
For more concrete examples of memory spec specific to algorithms, refer to the existing spec files.
Last updated