Dreamerv3 default cartpole example not learning?

deafgrinder · March 1, 2024, 3:52pm

Hi,

Just trying out different algorithms on default gyms envs. PPO works fine with cartpole, getting to 500 reward in about 15 minutes.

However, dreamver3, using the stock tuned_examples/dreamerv3/cartpole.py, it seems like it won’t learn, staying below 20 reward even after 30+ minutes of running.

Anybody with the same experience? Is the rllib implementation flawed or am I missing something?

Also as a side question, dreamerv3 uses a lot of ram. Cartpole is about 16GB, and atari_100k gets me to OOM (24GB). Are there memory requirements indicated somewhere? How much memory should I get to be able to run atari 100k? What about an XL model for the atari_200M?

Thanks!

Topic		Replies	Views
Dreamer V3 with CartPole environment in Ray 2.9.2 RLlib	0	85	August 2, 2024
Attribute error when trying to compute actions after training DreamerV3 on Cartpole RLlib	2	468	December 4, 2023
DreamerV3 torch implementation RLlib	0	269	November 13, 2023
DreamerV3 torch implementation completed Configure Algorithm, Training, Evaluation, Scaling	0	546	November 16, 2023
RNN support + RAM usage for RL algorithms RLlib	2	218	January 17, 2023

Dreamerv3 default cartpole example not learning?

Related topics