The `print` in the `compute_action` function of class `PPOTorchPolicy` have no output

Duplicate of Problem of using an LLM during PPO training