i am currently coupling ray with a simulation software, that uses a licence for each worker. I noticed, that rllib is not using a remote worker for the evaluation, but generates another worker/environment.
This means I am using an additional licence when doing evaluation, which can lead to failure of the worker and thus the training stops.
Is it possible to use the remote worker for evaluation?
You are getting two workers because of your config options. The num_workers :1 will create one worker with an environment in it. The create_env_on_driver: True. Will create a second environment on the driver.You can set the former to 0 or the latter to False.
Thanks for the response. Ok, so then there is no way to use for example 16 workers for the training and do the evaluation on them. We will always need an additional “worker”, whether it is the driver or a separate eval worker. Is that right?
1) Set `evaluation_interval` >= 0 to force creating a separate evaluation worker set.
2) Set `create_env_on_driver=True` to force the local (non-eval) worker to have an environment to evaluate on.
Rllib always creates a seperate worker_sets for the environments that collect experience for training and the environments that perform evaluation between training. You can have as many or as few evaluation workers as you want but they will each require a seperate license in addition to the workers used for training.