Perhaps this post will be useful:
mannyv
5
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| PPO algorithms train buffer only collects the first fragment from each worker? | 4 | 821 | October 30, 2021 | |
| [RLlib] Batch size for complete_episodes issue | 6 | 2316 | February 3, 2022 | |
| PPO configuration parameters: num_rollout_workers & train_batch_size | 1 | 847 | November 2, 2023 | |
| Does the agent train per episode or per iteration | 1 | 586 | November 1, 2021 | |
| RLLib PPO Trainer allocating additional memory on second training iteration | 0 | 309 | July 21, 2022 |