In the tf-agent library, there’s a feature of StepType that
indicates whether the current observation is the first/mid/last observation of an episode.
I think this is a helpful signal for implementing a custom model.
Could you please consider adding this feature? or, am I failing to find the feature that is already implemented in Ray?