Hi,
I have a question.
How do RAY calculate the number of Parameters?
I used PPO algorithm and input_dim = 16, hiddens = [256,256] and my action space = 4 dimensional.
I have attached a picture below, but the result is strange.
where is it comes number β8β?
if I use the Compute_action function, I get the 4-dimensional action space.
Could anyone explain me?