Hi! I’m facing a similar problem in that I want to be able to save a policy to then loading it in another process.
After exporting the policy with trainer.export_policy_model("/tmp/policy")
, how can we load that policy into a new trainer? Ideally, I would want to be able to do something like trainer.compute_single_action(...)