How to integrate a custom FIFO policy?

klausk55 · April 12, 2022, 9:06am

I guess I found the answer with the help of this RLlib example

from typing import Type

from ray.rllib.agents.trainer import Trainer
from ray.rllib.policy.policy import Policy
from ray.rllib.utils.typing import ModelWeights, TrainerConfigDict


class FIFO(Policy):
    """FIFO policy"""
    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.model = None
        self.exploration = self._create_exploration()

    def compute_actions(self,
                        obs_batch,
                        state_batches=None,
                        prev_action_batch=None,
                        prev_reward_batch=None,
                        info_batch=None,
                        episodes=None,
                        **kwargs):
        # TODO: Should return action for transport order according to fifo logic
        return ...

    def learn_on_batch(self, samples):
        # implement your learning code here
        return {}  # return stats

    def get_weights(self) -> ModelWeights:
        """No weights to save."""
        return {}


class FIFOTrainer(Trainer):
    def get_default_policy_class(
        self, config: TrainerConfigDict
    ) -> Type[Policy]:
        # default policy class for this Trainer is FIFO
        return FIFO

Topic		Replies	Views
Controlling compute_actions during training RLlib	0	376	November 26, 2021
Add custom policy to config on a non multi-agent setup RLlib	2	294	June 4, 2023
Proper way to implement a custom Algorithm + Policy + Model RLlib	2	1000	April 24, 2023
Failing at configuring a multi-agent trainer RLlib	0	43	December 20, 2024
How to deploy a trained Ray RLlib PPO policy/model in multi-agent-case? RLlib	5	827	November 10, 2021

How to integrate a custom FIFO policy?

Related topics