I have been using get_action
and log_action
based on the comment here:
When using these two functions together, should I be using two different episodes or is it all right if both are called within the same episode?
They should be within the same episode and timestep. log_action
takes action as input, which should be ideally retrieved via get_action
.