Evaluation during training without local env_runner

DenBuzz · August 19, 2024, 4:30pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

I have a very unique circumstance where multiple instances of my training environment can’t exist on the same process. Fortunately, during training I can simply make sure that each env_runner only has one environment to run so this issue is never a problem.

However, we’ve been trying to implement evaluation into our training recently and by default the algorithm creates two env_runner_groups which both create a local env_runner. Those two local env_runners then instantiate our env on the same process and everything crashes.

Was hoping to get some advice on how to solve this issue. I think I can create a custom Algorithm class that avoids creating the local env_runners but maybe there’s a config somewhere that I’m missing?

Topic		Replies	Views
Different Environment for training and evaluation RLlib	5	1165	July 13, 2021
Custom evaluation while avoiding unnecessary env creation Configure Algorithm, Training, Evaluation, Scaling	4	532	November 29, 2022
Run ONLY on local driver for train() RLlib	0	147	December 21, 2023
Num_env_runners VS num_envs_per_env_runner with remote_worker_envs=True RLlib	3	88	November 2, 2024
Using evaluation with ExternalEnv RLlib	1	220	October 5, 2021

Evaluation during training without local env_runner

Related topics