How PPOTrainer export compute_action function?

weileze · July 14, 2021, 2:17am

how to do this?

programe 1
trainer = ppo.PPOTrainer(config=config)
trainer.get_policy().export_model(“dir”)

programe 2
model = tf.saved_model.load(“dir”)
action = model(obs=[1,2,3,4,5])

sven1977 · July 14, 2021, 9:02pm

Hey @weileze , looks about right.

Try to pass in a numpy array instead of a list of obs.
For tf static-graph mode (non-eager), you should expect an action tensor, so you’d have to run the action in a tf.Session() to get the actual action value.

Topic		Replies	Views
Best way to export a keras model from RLlib RLlib	3	687	November 16, 2022
Controlling compute_actions during training RLlib	0	370	November 26, 2021
Compute/display actions from ray.tune RLlib	10	1666	March 30, 2021
[Rllib] compute_single_action() with an LSTM-PPO trainer fails RLlib	1	972	February 3, 2023
How to export model with tune.run? Ray Tune	2	1331	May 2, 2021