Offline RL passing reward data from .json into environment

Lars_Simon_Zehnder · September 19, 2023, 11:05am

@kris, great that you found some examples how to proceed. As a rule of thumb, the configuration for the evaluation workers is identical to the one used in training (only in_evaluation is set to True and also the evaluation worker numbers are specific).

In regard to evaluation there had been anopther issue in this board here. Usually you need an environment to roll out the policy online. In this case SAC was suggested to be used due to its similar setup.

In the other case that you want to estimate the policy’s performance on an offline dataset, you need to provide action_logp keys in the dataset as mentioned here

Topic		Replies	Views
Offline reinforcement learning without environment Offline RL	3	1305	November 29, 2023
Error when using offline data (.json) for validation Offline RL	0	255	October 1, 2023
Trial Name in custom env / on_episode_start RLlib	3	343	October 28, 2021
Offline data with self made dataset RLlib	1	259	June 7, 2023
Change or Generate offline data RLlib	9	667	July 5, 2022

Offline RL passing reward data from .json into environment

Related topics