Example of ExternalEnv used to implement a REST policy server

klausk55 · February 4, 2021, 12:10pm

Hello everyone,

I am looking for an example on how ExternalEnv can be used to implement a REST policy server (Note: my REST client is outside Python). Also hints how a REST policy server can be combined with ExternalEnv are welcome.

sven1977 · February 8, 2021, 8:09am

Hey @klausk55 , did you take a look at our ExternalEnv examples here?

Our Policy Server (ray/policy_server_input.py at master · ray-project/ray · GitHub) is already a REST server, accepting data from a client (e.g. ray/policy_client.py at master · ray-project/ray · GitHub, but you can write your own) and serving policy/action requests.
Your client would only have to connect and speak our RLlib protocol, which is quite simple and detailed in the above examples.

klausk55 · February 8, 2021, 8:35am

Hey @sven1977, yes I did and this is also how I solved it yet
I slightly modified the REST policy server API/class and implemented a HTTP client in C# similar to PolicyClient. Also changed the config “input” to a callable that returns my REST policy server resp. InputReader.
But what I am still confused by is the following: In the documentation of ExternalEnv class is mentioned that one can use it

by serving HTTP requests in the run loop.

Do it like in the examples or in my modification, I do not need the run() (it is paused/passed). Is the documentation imprecise or does it mean that there is an alternative way using ExternalEnv as “HTTP requests handler”?

sven1977 · February 8, 2021, 1:53pm

That’s correct, you wouldn’t need the ExternalEnv at all (like it’s done in the CartPole or Unity client/serving examples, where we simply connect a client to the server and - in the client - loop through a gym Env and send data and maybe action-requests to the server).

However, there is a RolloutWorker (with an ExternalEnv that overrides the run method simply with sleep(99999)) created automatically inside the PolicyServerInput, but then only really used in case the policy client uses inference_mode=remote.

Similarly, the PolicyClient auto generates a RolloutWorker (using the same auto-wrapped ExternalEnv with the sleep(99999) inside the run method as above), but only if inference_mode=local.

I agree, it’s a little confusing due to these auto-wrappings happening under the hood. I think the original idea was to separate the external env API from the server/client classes, which are more like examples on how one can use the external env API.

RiccardoZ · July 12, 2022, 8:18am

Hi @sven1977,
In the example in serving/unity3d_server.py the role of the variable “ioctx” @ line 106 is not clear to me since its not passed as argument once the function for PolicyServerInput is called @ line 131.
What is the correct syntax?
Thanks

Lars_Simon_Zehnder · July 12, 2022, 5:36pm

Hi @RiccardoZ and welcome to the forum!

I admit typing had been left out there so it is not directly obvious. It should be an IOContext object from RLlib’ Offline API.

Hope this helps

RiccardoZ · July 13, 2022, 7:29am

Hi @Lars_Simon_Zehnder,
thank you for the quick answer!
I just wanted to check whether leaving it as a lambda was something not to do.
I just started with RLLib and I still have many unknowns.

Topic		Replies	Views
How to use an environment that runs outside Python with RLlib? RLlib	1	416	February 1, 2021
ExternalEnv vs. External Application Clients? RLlib	3	553	July 12, 2021
Do I have to change "input": "sampler" config when working with ExternalEnv API? RLlib	4	334	February 2, 2021
ExternalEnv in a secuential simulator running locally? And how to register the environment RLlib	4	495	February 25, 2022
ValueError: Policies using the new Connector API do not support ExternalEnv RLlib	7	847	August 17, 2023

Example of ExternalEnv used to implement a REST policy server

Related topics