CollaQ which is introduced in this paper: https://arxiv.org/pdf/2010.08531.pdf looks to be pretty promising regarding the general performance compared with QMIX and QTran. And also the performance achieved in ad-hoc MARL looks promising.
Would be great to see this algorithm implemented in RLlib.
In the paper there is a link to the following repo: