[RLlib] Ray RLlib config parameters for PPO

klausk55 · April 28, 2021, 9:00am

@sven1977 Again, thanks for your explanations!

Does this mean that a potentially existing smaller last minibatch will be ignored and not used?
If so, then a train_batch_size is a mutliple of sgd_minibatch_size would be always recommendable.

Topic		Replies	Views
Minibatch for APPO RLlib	2	554	January 3, 2022
Confusing behavior in PPO training loop (train_batch_size, sgd_minibatch_size, num_sgd_iter) RLlib	1	554	July 27, 2022
RLLib PPO Trainer allocating additional memory on second training iteration RLlib	0	301	July 21, 2022
PPO is using too much GPU memory RLlib	3	1935	July 28, 2021
PPO algorithms train buffer only collects the first fragment from each worker? RLlib	4	748	October 30, 2021

[RLlib] Ray RLlib config parameters for PPO

Related topics