๐Env Spec
The Env Spec
{
"spec_name": {
"agent": {...},
"env": {
// Environment name (must be in gymnasium registry)
"name": str,
// Number of parallel environment instances
"num_envs": int,
// Maximum timesteps per episode (null = use environment default)
"max_t": int|null,
// Total training frames
"max_frame": int,
// Optional: Online state normalization (recommended for MuJoCo)
"normalize_obs": bool,
// Optional: Online reward normalization (recommended for MuJoCo)
"normalize_reward": bool,
// Optional: Clip observations to [-bound, bound] (default: 10.0 if normalize_obs)
"clip_obs": float,
// Optional: Clip rewards to [-bound, bound] (default: 10.0 if normalize_reward)
"clip_reward": float
},
...
}
}Supported Environments
Category
Examples
Difficulty
Docs
Environment-Specific Settings
Example: PPO on HalfCheetah
Parameter
Value
Why
Running PPO on HalfCheetah
Results


Using Other Environments
Environment Categories
Category
Environments
Spec Examples
Switching Environments
Template Specs with Variable Substitution
Finding Environment Specs
Standard Settings for Fair Comparison
Recommended Settings by Category
Category
num_envs
max_frame
log_frequency
Notes
What to Keep Consistent
Parameter
Keep Same?
Why
Example: Fair Algorithm Comparison
Advanced Env Options
Environment Kwargs
Normalization Details
Option
What It Does
When to Use
Last updated
Was this helpful?