Save experience from custom policy

michele-pel · July 7, 2022, 5:51am

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

Goodmornig,
I am creating a custom policy and I want to use it for producing some experience replay.

Converting external experiences to batch format is quite limited and old: I am using a multiagent environment with an rnn so I need to save also the state information and it lacks of a postprocessing. https://docs.ray.io/en/latest/rllib/rllib-offline.

even adapting the script proposed to my case, if I use the batches produced with this method I only obtain .inf as imitation loss, while if I save batches as output of a trining and I use them later for a supervised train loss is meaningful. I suppose there is a problem of format.

is there an automatic way to save some episodes given a custom policy?

Topic		Replies	Views
How to save training experiences? RLlib	1	466	December 22, 2020
How to export/get the latest data of the env class after training? RLlib	11	719	November 21, 2021
How to save PPO trajectory and train at a later time RLlib	6	830	March 29, 2021
Best way to save policy RLlib	2	1607	August 26, 2021
Load/save replay buffer RLlib	5	787	September 18, 2022

Save experience from custom policy

Related topics