Ray
Restored Policy gives action that is out of bound
RLlib
Checkpointing, Restoring
arturn
April 13, 2023, 11:06pm
2
Have a look at what I wrote over
here
.
Algorithm unsquashes actions.
show post in topic
Related Topics
Topic
Replies
Views
Activity
[rllib] Problem running compute_single_action from PPO restored checkpoint
Checkpointing, Restoring
1
183
December 13, 2023
Prediction outside outside action space during inference
0
59
March 18, 2024
How to deploy a trained Ray RLlib PPO policy/model in multi-agent-case?
RLlib
5
533
November 10, 2021
Inconsistent actions from Algorithm.compute_single_action
RLlib
3
245
June 14, 2023
Compute_single_action(obs, state) of policy and algo: different performance
Checkpointing, Restoring
1
470
April 13, 2023