Running test evaluation with policy server input

robfitzgerald · January 13, 2023, 5:15pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

hi! just asking for a quick head check here on a workaround i’m using.

i have been building off of the cartpole server example for an integration between RLlib and MATSim, a Java-based traffic simulator. i am finally getting acceptable training results for my problem. i opted out of running any inline tests/evaluations during training, and would now like to go back and manually run tests against saved checkpoints.

the cartpole server example does have a command line argument “–as-test” which i’m assuming was intended to mock up an example of this. but the argument is unused in that file, so there’s no example of what the original author intended. i dove into the git history of the cartpole file and --as-test first appears in this commit but it’s not wired in there either.

what i’ve come up with to approximate running a test (with the same server setup) is to set exploration to False, load the algorithm with policy_server_input, and then spin wait on the process while i send requests from the client:

# config at this point has _input: policy_server_input

if args.as_test:
  config.update({"explore": False})

algo = get_algorithm_class(args.run)(config=config)

if args.as_test:
  # run until user terminates process
  while True:
    time.sleep(0.2)
    pass

does this get me the behavior i’m expecting for a test evaluation of a given algorithm checkpoint?

thank you for reading!

rob

Topic		Replies	Views
Rollout/test a already trained policy employing PolicyServerInput and PolicyClient RLlib	1	300	October 30, 2021
Cartpole_server.py with evaluation_interval of 1 leads to Address already in use Error RLlib	3	304	August 30, 2022
Evaluation in Serving RLlib	2	471	January 28, 2022
Is it possible to restore the train result on another env for rllib? RLlib	2	170	September 21, 2023
How do I evaluate my trained policy after tune.fit() RLlib	1	719	March 30, 2023

Running test evaluation with policy server input

Related topics