AssertionError in ppo.py: KL is None, learner stats of at least one policy are empty

Do you use a custom execution plan?
Is this ralated?