Last Office Hours, we discussed 1) IMPALA with MixinReplayBuffer to answer Renos’ question. 2) PettingZoo deeper integration rather than just using existing RLlib wrapper. 3) Custom DDPO with PyTorch to answer Simon’s question. 4) Renos another question about multi heterogeneous agents. Here is full Office Hours Playlist on YouTube.
Next Tuesday, Artur is hosting. Please add your question to this doc here . Please link to either a github issue or discuss link in your question. This will help to share the learnings with others.
Looking forward to your questions, and thank you!