TorchDiagGaussian from logits

H-Park · May 28, 2025, 9:50am

For box action spaces, are the logits in the form

[mean1, std1, mean2, std2, …]

or

[mean1, mean2, …, std1, std2, …]

?

christina · June 3, 2025, 11:55pm

Hi, can you give a bit more context on what you’re trying to build here and what Ray library you’re using? Thanks

iykim · June 4, 2025, 12:50am

Hi. It seems like it has been some time, so you might have found it but here it is anyways:

it’s the second approach.

e.g.
output = torch.concat((means, stds), dim=-1)

H-Park · June 4, 2025, 6:02pm

@christina I am taking the torch model files produced from a ray train job, converting them to onnx files, and making a rust based inference program.

Thanks!

christina · June 5, 2025, 6:41pm

Thank you @iykim ! Let me know if you have any other questions @H-Park

Topic		Replies	Views
Failed to read the results for 1 trials	3	494	July 26, 2023
TorchMultiCategorical with logits calculated in the constructor RLlib	6	483	October 6, 2021
Value of num_outputs of DQNTrainer RLlib	3	533	May 9, 2022
Callbacks for model aggregation	0	166	July 18, 2023
Why is my `rllib.models.torch.torch_modelv2.TorchModelV2` receiving a Tensor of shape ( 32, <observation size> )? Configure Algorithm, Training, Evaluation, Scaling	1	723	November 15, 2022