How to configure the neural networks in A3C?

Roller44 · November 10, 2021, 12:01pm

+++++++++++++++++++

What I want to do:

Hi, I am trying to train A3C agents in my custom environment, and I want to use Tune to find out the optimal hyperparameters of the neural networks in the A3C algorithm.

I have zero TensorFlow or PyTorch background, so the only thing I can do is to use default networks/policies in RLlib.

+++++++++++++++++++

Here are what I know:

The A3C algorithm adopts the actor-critic structure, where the actor and the critic have a neural network, respectively. Furthermore, according to the A3C paper, the actor’s network and the critic’s network share some parameters.
By setting up the values in an RL agent’s config dict relating to hyperparameters of the agent’s networks and then input the config dict into the “Tune.run()” method, I can modify the agent’s network (e.g., number of layers or units, etc.) and start to train agent.
By setting the “model: {“use_lstm”: True}” in the A3C agent’s config dict, I can enable the A3C agent to adopt an LSTM algorithm.

+++++++++++++++++++

Here are my questions:

Q1: Where can I configure the neural networks in A3C? When I looked up the A3C agent’s configure dict (in the a3c.py file), there is no key relating to neural network configurations, whereas in contrast, there is a key named “hidden” in DQN’s configure dict that can be used to change the neural network in DQN.
Q2: What are the structures of the neural networks in the RLlib’s implementations of A3C?
Q3: When I enable LSTM by setting “use_lstm: True”, is the actor’s neural network or the critic’s neural network converted to LSTM?

mannyv · November 10, 2021, 1:26pm

Hi @Roller44,

Here is a link to all the model options in rllib.

The config you provide will have a key called modelthat holds a dictionary with these values.

github.com

ray-project/ray/blob/4e3e213549c2f65ee167f1f150f50033e6f93bc7/rllib/models/catalog.py#L37

    
      
          from ray.rllib.utils.spaces.space_utils import flatten_space
          from ray.rllib.utils.typing import ModelConfigDict, TensorType
          
          
tf1, tf, tfv = try_import_tf()
          torch, _ = try_import_torch()
          
          
logger = logging.getLogger(__name__)
          
          
# yapf: disable
          # __sphinx_doc_begin__
          MODEL_DEFAULTS: ModelConfigDict = {
              # Experimental flag.
              # If True, try to use a native (tf.keras.Model or torch.Module) default
              # model instead of our built-in ModelV2 defaults.
              # If False (default), use "classic" ModelV2 default models.
              # Note that this currently only works for:
              # 1) framework != torch AND
              # 2) fully connected and CNN default networks as well as
              # auto-wrapped LSTM- and attention nets.
              "_use_default_native_models": False,
              # Experimental flag.

These options are used by all of the RLlib algorithms (DQN, A3C, etc.)

Both the actor and the critic will share the lstm.

Roller44 · November 10, 2021, 2:01pm

Thanks for the reply!

Follow up question:

Do you have any idea what are the structures of the neural networks in the RLlib’s implementations of A3C? Based on your reply, it seems like the actor-network and the critic-network are identical.
I have noticed that there is a “hiddens” key (link) in the DQN’s config, while there is a “fcnet_hiddens” key (link) in the comment model config. Can I say that in DQN, there is a network (indicated by the “hiddens” key) on the top of another network (indicated by the “fcnet_hiddens” key)?

Topic		Replies	Views
How to make the A3C tutorial work? RLlib	2	402	September 27, 2021
NN model for RLLib A3CPolicy RLlib	1	433	July 23, 2021
Is there no option to train SAC with a convolutional network? RLlib	3	381	August 21, 2021
What happens when you pass a custom model to an actor-critic method RLlib	1	300	March 16, 2022
Using custom neural network in RLlib RLlib	5	1271	December 22, 2022

How to configure the neural networks in A3C?

Related topics