Evolution strategies - make reproducible

Fot · June 28, 2021, 12:22pm

Hi all!

I am running some experiments with the Evolution Strategies algorithm at RLlib, and I need to make the results reproducible. However, setting the “seed” in the configuration file as posted here (Reproducible training - setting seeds for all workers / environments) is not working. Do you know how I can have identical runs?

The configuration file that I use is:

humanoid-v2-es:
env: Humanoid-v2
run: ES
stop:
episode_reward_mean: 200
config:
# Works for both torch and tf.
framework: tf
num_workers: 2
seed: 42
train_batch_size: 10000

Thanks!

sven1977 · July 14, 2021, 3:29pm

Hey @Fot , ah, I think this makes sense. RLlib does all the seed handling in the RolloutWorkers, which ES and ARS don’t use (one more good reason to force-move these two into using standard Policies/RolloutWorkers/etc…).

There is a PR here that fixes the issue:

github.com/ray-project/ray

[RLlib] Fix seeding for ES and ARS.

ray-project:master ← sven1977:fix_seeding_for_es_and_ars

opened 04:54PM - 29 Jun 21 UTC

sven1977

+83 -1

Fix `seed`-in-config-not-respected issue for ES and ARS. Also see this discussi…on here: https://discuss.ray.io/t/evolution-strategies-make-reproducible/2684 ## Why are these changes needed? ## Related issue number ## Checks - [x] I've run `scripts/format.sh` to lint the changes in this PR. - [x] I've included any doc changes needed for https://docs.ray.io/en/master/. - [x] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [x] Unit tests - [ ] Release tests - [ ] This PR is not tested :(

Topic		Replies	Views
Reproducibility Concerns with GPU RLlib	2	655	October 4, 2022
Reproducible training - setting seeds for all workers / environments RLlib	20	6043	May 24, 2023
Issue Reproducing results RLlib	4	994	June 14, 2021
Reproducibility of training Results on PPO algorithm RLlib	4	469	September 24, 2021
How do I set seed (randomize) for each rollout (for a given environment, worker and vector environment)? RLlib	0	298	August 28, 2023

Evolution strategies - make reproducible

Related topics