Solving multiple trials with tune.grid_search()

sai_lalith_Polawar · March 3, 2022, 1:49pm

I am working with ray.rllib and trying to run PPO on “CartPole-v0” env for different parameter combinations using tune.grid_search() as shown below in the code . But this is creating different trails for different parameter combinations initially and then running all of them in parallel. Is there any solution for running trials one after the other? Like in detail, first trial should completely run until the stop criteria is fulfilled and then it should start to run the other trial. Please, can anyone help me out with this?

target = 8000
analysis = tune.run(
    "PPO",
    stop={"timesteps_total": target},
    mode="max",
    config={
        "env": 'CartPole-v0',
        "num_workers": 1, 
        "num_gpus": 0,
        "lr": 1e-4,
        "gamma": tune.grid_search([0.95, 0.97]),
        "entropy_coeff": tune.grid_search([0.01, 0.1]),
    },
    max_concurrent_trials=0,
)
print("best hyperparameters: ", analysis.best_config)

mannyv · March 3, 2022, 6:24pm

Hi @sai_lalith_Polawar

Initialize ray with fewer cpus.

mannyv · March 4, 2022, 11:41am

@stwerner,

Setting num_workers to zero would have the opposite effect. More would run because each would use fewer cpu resources.

One approach that would work is to initialize ray with ray.init(num_cpus=SMALL_NUMBER_HERE)

sai_lalith_Polawar · March 4, 2022, 2:13pm

Ok, Thank you for the answer.
I have tried both ways
Case 1: ray_init(num_cpus =1) and num_workers: 0
Case 2: ray_init(num_cpus=1) and num_workers:1

“Case 1” was perfect and smoothly running the way I wanted but in “Case 2” all the trials were under PENDING status for long time and not at all the status was changing to RUNNING.

mannyv · March 4, 2022, 2:26pm

@sai_lalith_Polawar

The issue with the second one is that you provided fewer cpus than required. Usually the requirement is num_workers+1. Basically what you need to balance is providing enough cpus for the number of workers but not so many that you can support more than 1 trial.

Realize, that if you set num_workers to <=1 then you will not have any parallelism when sampling new experiences.

Topic		Replies	Views
Errors in Hyperparameter tuning of PPO with Bayesian Optimization (ray.tune) RLlib	2	833	March 18, 2022
How to set #of cpu and gpu per trial? RLlib	1	985	November 6, 2021
A little help for a novice RLlib	1	433	October 26, 2022
Stopping condition in Tune confusion RLlib	1	526	March 24, 2022
Tune.run() doesn't work. runs endlessly Ray Tune stopping condition & comparisons	1	543	November 2, 2023

Solving multiple trials with tune.grid_search()

Related topics