Jump-Start Reinforcement Learning

Can I just use the 2 options that are talked about here?

PPO nan in actor logits - #6 by tlaurie99.