Hi @bug404,
See this comment for an explanation. Initialise loss from dummy batch method in policy.py - #2 by mannyv