SLM Lab
v4.2.3
Search
⌃
K
SLM Lab
v4.2.3
Search
⌃
K
SLM Lab
🖥Setup
Installation
Quick Start
🚀Using SLM Lab
Lab Command
Lab Organization
Train: REINFORCE CartPole
Resume and Enjoy: REINFORCE CartPole
Agent Spec: DDQN+PER on LunarLander
Env Spec: A2C on Pong
GPU Usage: PPO on Pong
Parallelizing Training: Async SAC on Humanoid
Experiment and Search Spec: PPO on Breakout
Run Benchmark: A2C on Atari Games
Meta Spec: High Level Specifications
Post-Hoc Analysis
TensorBoard: Visualizing Models and Actions
Using SLM Lab In Your Project
📈Analyzing Results
Data Locations
Graphs and Data
Performance Metrics
🥇Benchmark Results
Public Benchmark Data
Discrete Environment Benchmark
Continuous Environment Benchmark
Atari Environment Benchmark
RL GIFs
🔧Development
Modular Design
Algorithm
Memory
Net
Profiling SLM Lab
📖Publications and Talks
Book: Foundations of Deep Reinforcement Learning
Talks and Presentations
🤓Resources
Deep RL Resources
Contributing
Motivation
Help
Contact
Powered By
GitBook
Deep RL Resources
📌
Master List of Deep RL Resources
GitHub - kengz/awesome-deep-rl: A curated list of awesome Deep Reinforcement Learning resources.
GitHub
📖
Books
Graesser and Keng,
Foundations of Deep Reinforcement Learning
Sutton and Barto,
Reinforcement Learning: An Introduction
Francois-Lavet et. al.,
An Introduction to Deep Reinforcement Learning
🏛
Introductory Tutorials / Courses
Andrew Karpathy Deep Reinforcement Learning: Pong from Pixels
Arthur Juliani Simple Reinforcement Learning in Tensorflow Series
David Silver UCL Course on RL 2015
Deep RL Bootcamp 2017
DeepMind UCL Deep RL Course 2018
dennybritz/reinforcement-learning
higgsfield/RL-Adventure-2
higgsfield/RL-Adventure
MorvanZhou/Reinforcement Learning Methods and Tutorials
OpenAI Spinning Up
Sergey Levine CS294 Deep Reinforcement Learning Fall 2017
🔬
Papers
DQN
Double DQN
(DDQN)
Dueling DQN
Prioritized Experience Replay
(PER)
Combined Experience Replay
(CER)
Hindsight Experience Replay
(HER)
QT-Opt
Asynchronous Advantage Actor Critic
(A3C)
Generalized Advantage Estimation
(GAE)
Proximal Policy Optimization
(PPO)
Self Imitation Learning
(SIL)
Soft Actor-Critic
(SAC)
📖Publications and Talks - Previous
Talks and Presentations
Next - 🤓Resources
Contributing
Last modified
1yr ago