How to make the A3C tutorial work?

konstmish · September 20, 2021, 1:22pm

In the documentation, I found this tutorial on A3C. I really like the tutorial as it’s very simple and it covers asynchronous methods, but unfortunately it seems outdated and it uses LSTMPolicy, which is not present in the current version of RLLib. Is there a working version of the same or similar tutorial or maybe a fix that would make it work, such as replacing the LSTMPolicy with something else?

sven1977 · September 27, 2021, 3:31pm

Hey @konstmish , great question. It does seem like the example is quite outdated and on the Ray core docs, not RLlib’s. Hmm, the algo works out of the box, though. You could get the results using an LSTM wrapped default model (in this case: a classic Atari Conv2D Stack) by doing:

rllib train --run A3C --env Pong-v0 --config {"model": {"use_lstm": true}}

konstmish · September 27, 2021, 3:50pm

Thanks for the suggestion. What you propose makes sense to get the results, but I actually wanted to tweak things, so I hoped there is a way to make the tutorial work.

Topic		Replies	Views
RLLIB LSTM model summary view RLlib	1	794	March 31, 2023
'use_lstm' wrapping in older and newer Ray versions RLlib	0	624	March 16, 2022
Built in 2D Convolutions with LSTM RLlib	7	602	August 7, 2022
RNN support + RAM usage for RL algorithms RLlib	2	214	January 17, 2023
LSTM wrapper giving issue when used with trainer.compute_single_action RLlib	9	956	April 25, 2022

How to make the A3C tutorial work?

Related topics