โกCrossQ
CrossQ: Batch Normalization in Deep RL
Algorithm: CrossQ
Basic Parameters
"agent": {
"name": str,
"algorithm": {
"name": "CrossQ",
"action_pdtype": str,
"action_policy": "default",
"gamma": float,
"training_frequency": int,
"training_iter": int,
"training_start_step": int,
},
"memory": {
"name": "Replay",
"batch_size": int,
"max_size": int
},
"net": {
"type": "TorchArcNet",
"arc": dict, // actor architecture (no BN needed)
"optim_spec": dict,
},
"critic_net": {
"type": "TorchArcNet",
"arc": dict, // critic architecture with LazyBatchRenorm1d
"optim_spec": dict,
}
}Critic Architecture: Batch Renormalization
Comparison with SAC
Feature
SAC
CrossQ
Last updated
Was this helpful?