After the exciting release of DeepMind’s DreamerV3 algorithm, will RLLIB move to implement this?
I would love to see that. Is there anyone who could help me implement it?
On which paper is the current implementation based on? The first one? Or was it updated?