Custom sample distribution for Box

aram10 · March 1, 2024, 9:01pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

I need to sample integers for my RL problem. In my specific domain, there are certain known heuristic values. I want to write my own custom probability distribution for sampling from gym.spaces.Box. Ideally, the sampler should be most likely to sample integers that lie close to the heuristic values, but still reserve some probability mass for random values. If I were to create a subclass of gym.spaces.Space, could it still be used with PPO without issue? In the gymnasium documentation it says “some implementations of Reinforcement Learning algorithms might not handle custom spaces properly” so I don’t know if there are other requirements to make a custom space object suitable for an action space.

Lars_Simon_Zehnder · March 5, 2024, 2:08pm

@aram10 welcome to the forum.

If you overrid a Box space and just modify the sampling method this should be okay. Make sure that the shapes returned by the sample() method match the space.

Topic		Replies	Views
Custom action space Configure Algorithm, Training, Evaluation, Scaling	4	573	July 31, 2023
RLlib and gym.space RLlib	4	700	November 14, 2021
Observation dependent continuous action space ("Masking" continuous action space) RLlib	4	1088	February 9, 2022
Issue with Running Experiments with Custom Gym Environment RLlib	4	501	June 13, 2022
How to use Custom Action Distributions for this? RLlib	5	233	May 6, 2024

Custom sample distribution for Box

Related topics