This discusssion seems to be very similar to the one I started earlier this week. Maybe this is interesting for you, @CodingBurmer: Multi-Agent System for maximizing the overall reward of all agents? ?
1 Like