Restroing Checkpoint Does Not Include Target Net

cedros23 · January 2, 2021, 7:01pm

Hi everyone,

I have already opened an issue to the github page:

I believe there is a bug while restoring Apex-DQN agent by restoring only the actual network, not the target one which causes high td_error and resultantly low rewards after restore.

I read all the discussions about checkpointing and resuming the training but cannot come up with a solution.

One short-term workaround would be storing the weights by myself and force target network with ._set_weights API…

Any ideas or recommendations?
thx

Topic		Replies	Views
How to create checkpoints RLlib	2	327	July 11, 2022
Fails restoring weights #41508 RLlib	2	408	December 29, 2023
Error creating RLPredictor using restored checkpoint RLlib	5	452	April 2, 2023
Empty checkpoint files with Tune.run RLlib	1	376	March 30, 2022
Target_network_update_freq APEX vs DQN RLlib	1	290	May 30, 2022

Restroing Checkpoint Does Not Include Target Net

Related topics