Num_agent_steps less than num_env_steps

klausk55 · July 15, 2021, 1:44pm

I really wonder how this can happen: num_agent_steps is sometimes less than num_(env_)steps !?
E.g., see snippet of train result’s output:

num_agent_steps_sampled: 127
num_agent_steps_trained: 127
num_steps_sampled: 128
num_steps_trained: 128

In my multi-agent use case I deploy PolicyClient and PolicyServer classes since my simulator is an external one. I have two policies and two agents where the agents in my env interact sequential, i.e. agents don’t act synchronous but always one at a time.
Thus, I’m astonished why num_agent_steps is sometimes less than num_(env_)steps, I would expect that they are the same. In my understanding, one call to client.get_action means one step of the env, but then why can occur this difference in numbers?

Topic		Replies	Views
Is there a way to set num_env_steps_sampled? RLlib	1	512	June 23, 2023
Num_env & agent_steps_trained 0 even though steps sampled? RLlib	7	856	April 25, 2024
Num_agent_steps_trained: 0 Configure Algorithm, Training, Evaluation, Scaling	2	242	May 4, 2024
[RLlib] batch size interpretation when training multiple policies RLlib	4	605	July 15, 2021
Get the number of training steps when loading a trained agent RLlib	2	593	March 16, 2021

Num_agent_steps less than num_env_steps

Related topics