Using multi-agent model after training

juancroldan · October 8, 2021, 12:28pm

I designed a simple custom environment for the kids game Block, Load and Fire. It works like this:

Every turn, the two players say in loud voice, at the same time “block”, “load”, or “fire”.
Initially, both players have no ammo. Saying “load” will give you one bullet (so this is the obvious choice at the first turn).
After this, one can say “shoot”, and if the other player is not blocking he will lose. If both players shoot at the same time, it is a draw. You cannot shoot without bullets.
The fifth bullet you reload makes you have a cannon, which means that if you shoot the other player will lose even if he is blocking. If both players shoot a cannon against the other, it is a draw.

The MultiAgentEnv definition as well as the training/test are in this gist.

My problem is that, after training the algorithm, I don’t see a way to use the model as something standalone. How would you complete the test method to play against the trained model?

Topic		Replies	Views
Loading pre-trained single-agent policy weights for multi-agent training RLlib	2	900	June 11, 2021
Asymmetric play multiagent environment RLlib	2	472	January 6, 2022
How to deploy a trained Ray RLlib PPO policy/model in multi-agent-case? RLlib	5	847	November 10, 2021
Best way to use rllib with player vs player games RLlib	3	804	March 15, 2021
[rllib] Applying MAML in multi-agent environments RLlib	2	285	January 25, 2021

Using multi-agent model after training

Related topics