Rllib server-client mode slow down when real client number less than given num_workers

tiankaidong · September 7, 2023, 1:57pm

I just found when I work in server-client mode, the working client node number must equal num_workers given in server start up, otherwise, synchronous_parallel_sample() in rollout_ops.py would take much longer time.
I do think this is not reasonable and should be optmized.

Topic		Replies	Views
RLlib's PolicyServer and external simulator as client RLlib	15	1504	April 12, 2021
Num workers speedup? RLlib	1	383	April 29, 2022
Memory issue debugging RLlib	7	1030	September 25, 2022
PPO configuration parameters: num_rollout_workers & train_batch_size Configure Algorithm, Training, Evaluation, Scaling	1	367	November 2, 2023
Checkpoint frequency is not clear RLlib	6	733	May 17, 2021

Rllib server-client mode slow down when real client number less than given num_workers

Related Topics