Updating agents' `action_distribution_fn` signature

thomaslecat · June 30, 2021, 2:34pm

Hi!

The policy of several base agents - including DQN (TF and Torch), DDPG (TF and Torch), SAC (TF) - still use an action_distribution_fn function with the old signature. These agents therefore don’t benefit from the features of Trajectory View API.

Is there a ticket and/or ongoing work to update them? Can we expect all agents to be updated in the 2.0.0 release?

This seems to be the last barrier to use custom models implementing shared computation graphs with built-in policies in the multi-agent settings

Thanks!

Topic		Replies	Views
Multi-agent Training with two Policies throwing model interfacing error RLlib	2	824	October 7, 2021
Changing the sampling mechanism in DQN RLlib	7	446	August 28, 2021
Compute_actions for Trajectory API RLlib	11	2434	February 10, 2022
Passing custom policy multi-agent RLlib	3	858	December 28, 2021
Scripted Agent Support RLlib	2	301	June 10, 2021

Updating agents' `action_distribution_fn` signature

Related topics