I am new to rllib so maybe missing something very obvious. I am performing offline RL using an offline dataset from json files only. I wondered whether there is a more concise way to specify training over the entire dataset for x number of epochs, please? I am currently having to count the number of transitions in the dataset and multiply by the number of epochs i.e.:
for i in range(0, df_len*epochs):
eval_res = algo.train().get("evaluation")
Thanks