RuntimeWarning: Mean of empty slice with TensorFlow multi-agent PPO

hridayns · July 2, 2021, 10:30am

Hello, I have been struggling with this for days so I would really appreciate if someone could help me figure this out! I am not too familiar with the interface here yet, so I have created a question here: numpy - RLlib PPO reward flat-lines with RuntimeWarning: Mean of empty slice - Stack Overflow

Would appreciate any guidance!

Topic		Replies	Views
Overflow encountered in reduce RLlib	3	597	October 26, 2023
~~Possible PPO surrogate policy loss sign error~~ RLlib	2	787	October 4, 2022
PPO with Critic and no GAE RLlib	1	443	May 3, 2021
PPO Training Error: NaN Values in Gradients and Near-Zero Loss RLlib	6	251	September 3, 2024
[Rllib] compute_single_action() with an LSTM-PPO trainer fails RLlib	1	979	February 3, 2023