I have been using get_action and log_action based on the comment here:
When using these two functions together, should I be using two different episodes or is it all right if both are called within the same episode?
They should be within the same episode and timestep. log_action takes action as input, which should be ideally retrieved via get_action.