Sample Rule-Based Expert Demonstrations in Rllib

Also, maybe this thread helps you.

1 Like