Rollout/test a already trained policy employing PolicyServerInput and PolicyClient

klausk55 · October 29, 2021, 12:14pm

Hey guys,

I need some advice for the following:
I have a already trained policy stored in a checkpoint (trainer.save()). Now I restore from that checkpoint (trainer.restore()) and want to employ the already trained policy for an “external application”, i.e. I want to employ RLlib’s PolicyServerInput and PolicyClient classes for inference.

What’s the best practice on the trainer/server side to test (“rollout”) the trained policy?
E.g.,

trainer = PPOTrainer(config={"explore": False}, ...)
trainer.restore(checkpoint)
while True:
    trainer.train()

or

trainer = PPOTrainer(config={"explore": False}, ...)
trainer.restore(checkpoint)
while True:
    trainer.evaluate()

Or is it better to directly restore a trainer from the checkpoint on the “virtual client side” and then do action = trainer.compute_action(obs) here (so neither server nor client)?

mannyv · October 30, 2021, 11:31am

Hi @klausk55,

Perhaps this example on how to use serve with a pertained rllib model will help.

https://docs.ray.io/en/latest/serve/tutorials/rllib.html

Topic		Replies	Views
Transfer Learning for Multi-Agent env. with RLlib RLlib	4	778	September 21, 2022
Custom rollout and training loop RLlib	4	717	April 26, 2023
Best practice for training on policy and off policy action together? RLlib	4	342	September 27, 2021
How to deploy a trained Ray RLlib PPO policy/model in multi-agent-case? RLlib	5	809	November 10, 2021
PolicyClient connect distributed PolicyServerInput RLlib	0	192	July 29, 2022

Rollout/test a already trained policy employing PolicyServerInput and PolicyClient

Related topics