ValueError: Expected parameter logits (...) to satisfy the constraint IndependentConstraint(Real(), 1)

Hi @Lars_Simon_Zehnder
I had a similar problem these days and "simple_optimizer=True" solved my issue. See here. But its not working in this case. I have this problem with ray2.0 and ray1.8. Can you tell me in which version it does work?
I know how to debug scripts. But the error does not happen on my local computer although it uses the same ray version as my remote container setup. My training starts like this:

tune.run(
    "PPO",
    config=tune.grid_search(configs),
    verbose=1,
    num_samples=1, 
    checkpoint_freq=0, 
    checkpoint_at_end=False, 
    local_dir=args.output_folder,
...
    )

That means, I dont have a custom training loop.