nvidia-ml-py3
.data/ppo_pong_{ts}
. The trial graphs should look like the following:CUDA_OFFSET=4
for example. Let's say a machine has 8 GPUs and we are running 2 trials of 4 sessions each, we'd want to utilize all the GPUs evenly. Suppose we are running PPO on Pong and PPO on QBert. Then we can do the following: