Rllib with offline RL - epochs

joshml · March 4, 2023, 2:16pm

I am new to rllib so maybe missing something very obvious. I am performing offline RL using an offline dataset from json files only. I wondered whether there is a more concise way to specify training over the entire dataset for x number of epochs, please? I am currently having to count the number of transitions in the dataset and multiply by the number of epochs i.e.:

for i in range(0, df_len*epochs):
        eval_res = algo.train().get("evaluation")

Thanks

arturn · April 13, 2023, 10:43pm

Hi @joshml ,

Since train() does not take any such arguments, you can’t specify it within that syntax.
If you don’t want to manage the epochs yourself, you can use ray.tune to this.
That is the recommended way.
Tune is not exclusively meant for HP tuning, but also for managing resources of experiments and managing training runs similarly to what you are doing.
You can make it stop on epochs, rewards or frankly any metric that algo.step() returns.

Cheers

Topic		Replies	Views
How to indicate to RLLIB tune to run 200 episodes Ray Tune	1	325	October 26, 2021
How to tell RLLIB tune to run that many number of episodes RLlib	1	207	August 14, 2021
How to tell RLLIB trainer to run that many number of episodes	0	373	August 23, 2021
Use only specific timesteps during agent training RLlib	3	506	June 21, 2022
Inconsistent number of episodes with 'evaluate' RLlib	2	262	July 18, 2022

Rllib with offline RL - epochs

Related topics