How does Rolloutworker work (how is experience added to the replaybuffer?)

Halman · March 17, 2023, 3:56pm

High: It blocks me to complete my task.

I know this is a very simple question, but please tell me because I am new to ray and rllib.

I am currently trying to train reinforcement learning with Soft Actor Critic using image information as input. CUROBS, which seems to store the values after passing through the conv layer as far as I can see in the shape.

Then I realized that I myself do not know where SampleBatch.CUROBS is defined.

I have learned about storing experience by looking at the following URL, but have not been able to catch up on the program due to the complexity of the content.
https://docs.ray.io/en/latest/rllib/rllib-sample-collection.html

My question is as follows.

・Where is the behavior of saving the observations retrieved from the environment to the Replay Buffer during execution defined?
・If it is an image input, is it natural that the CUROBS are not the image data but the values after passing through the conv layer?

Halman · March 24, 2023, 2:11am

Sorry for the trouble, but it has resolved itself.

Topic		Replies	Views
Can i check the Replay buffer? RLlib	4	378	July 27, 2021
Offline training using previous obs+action=reward tuples RLlib	1	298	May 24, 2021
Accessing DQN Memory Buffer from Ray object store memory for Restore RLlib	0	231	December 8, 2020
Add the experiences to the buffer "by hand" RLlib	7	953	December 14, 2021
Load/save replay buffer RLlib	5	783	September 18, 2022

How does Rolloutworker work (how is experience added to the replaybuffer?)

Related topics