Why do we need a lock in _compute_action_help in torch_policy_v2.py

James_Liu · October 28, 2022, 6:00am

How severe does this issue affect your experience of using Ray?

None: Just asking a question out of curiosity

@with_lock
def _compute_action_helper(
    self, input_dict, state_batches, seq_lens, explore, timestep
):

I would like to know where the race condition comes from to make the lock needed.

Thanks,
James

James_Liu · November 2, 2022, 1:13pm

According to ray.rllib.utils.threading.with_lock, it is an object level lock. I guess it avoids race condition between methods(learn_on_batch, compute_gradients) inside torch_policy_v2 only.

Topic		Replies	Views
Implementing something similar to SEED RL architecture RLlib	5	439	September 25, 2021
My Ray programs stops learning when using distributed compute RLlib	10	904	August 16, 2022
[rllib][windows] CUDA error when work is distributed RLlib	4	695	August 6, 2021
Error when trying to use gpus during RL training RLlib	4	480	July 21, 2021
Reproducibility of training Results on PPO algorithm RLlib	4	337	September 24, 2021

Why do we need a lock in _compute_action_help in torch_policy_v2.py

Related Topics