🎬RL GIFs
Trained agents in action—watch SLM Lab's PPO and SAC algorithms play games and control robots.
slm-lab run slm_lab/spec/benchmark/ppo/ppo_cartpole.json ppo_cartpole enjoy@data/ppo_cartpole_2026_01_30_221924/ppo_cartpole_t0_spec.jsonAtari (PPO)
MuJoCo (SAC)
Last updated
Was this helpful?















