- Medium: It contributes to significant difficulty to complete my task, but I can work around it.
How can I monitor the surrogate loss for SGD within a step for PPO? For example, I want to see num_epochs loss values (num_epochs being the AlgorithmConfig setting) so I can verify that it is set large enough to converge at each step.
Thanks.