Num_env & agent_steps_trained 0 even though steps sampled?

wcthibault · August 17, 2023, 6:27pm

I have been experiencing a similar issue with off policy algorithms like DDPG and SAC when using replay buffers with storage units set to episodes. I made a post about it here: Replay buffer with episodes as storage unit not training

Topic		Replies	Views
Num_agent_steps_trained: 0 Configure Algorithm, Training, Evaluation, Scaling	2	266	May 4, 2024
Is there a way to set num_env_steps_sampled? RLlib	1	547	June 23, 2023
MultiAgent training Issues RLlib	1	578	April 9, 2024
Unable to replicate original PPO performance RLlib	0	211	May 10, 2024
Algo.train() calls env.step() with empty action object RLlib	1	246	December 21, 2023

Num_env & agent_steps_trained 0 even though steps sampled?

Related topics