Implementation example of intrinsic reward in MARL

XavierM · March 16, 2021, 8:58am

Hello,

I am looking for some documentation or example showing how agents could cusotmize their reward, for instance by summing up the environment reward with an own intrisic reward.

The only examples I have found so far “only” show MARL agents using the environment reward (i.e. the reward coming from the environment’s step() function).

Thanks.

Topic		Replies	Views
Adding priority to MARL RLlib	5	650	October 19, 2021
An example of RLLib used with multiple neural networks RLlib	2	299	June 29, 2022
Multi agent policy optimization in competitive settings RLlib	0	249	April 20, 2023
MARL modeling issue RLlib	7	874	March 31, 2021
MARL settings - how to broadcast messages? RLlib	0	161	September 19, 2021

Implementation example of intrinsic reward in MARL

Related Topics