Using customized credit assignment

hossein836 · March 23, 2022, 11:11am

HI, I want to write my own credit assignment function instead of using discount factor, a lot of problems are not in the type of discount factor credit assignment which give more weight to last action. sometimes there is one action (for example in the middle of trajectory) that has more credit for gaining reward and we can write a customized function to assign credits. the question is:
1- how can I do this in Rllib? should I subclass postprocess?
2- do we have any example of doing so?

Example:
" in some basketball match we give more credit on an excellent pass (with some logics) against last move (maybe because its easy) while discount factor do opposite and gives more credit on last action.
Many thanks

Topic		Replies	Views
Delayed assigning of rewards resp. punishments in single-agent RL RLlib	4	402	April 27, 2021
How does agent know from what action it gets reward? RLlib	4	644	May 22, 2022
TrajectoryTracking with RLLIB RLlib	14	1363	November 17, 2021
My RLlib implementation seems to compute random actions RLlib	4	942	February 15, 2022
How to shape the reward successfully RLlib	0	19	July 29, 2025

Using customized credit assignment

Related topics