RL based recommender system with no simulator

8rigo8 · October 18, 2023, 3:52pm

Hi!

How could I train a model (lets say SlateQ) if I don’t have a simulator environment to do online training?

I’d plug the model on my website and I could register observations/actions into a replay buffer, but the reward I need to calculate it in a daily batch. So, I would be training an online model offline. Is that possible?

Thank you in advance!

Topic		Replies	Views
Offline RL with DQN, PPO, etc Offline RL	0	321	November 5, 2023
Pre-train a model with baseline policy Offline RL	0	192	February 6, 2024
Hybrid Offline learning and PPO? Offline RL	4	954	April 17, 2025
Offline reinforcement learning without environment Offline RL	3	1322	November 29, 2023
Offline training using previous obs+action=reward tuples RLlib	1	298	May 24, 2021

RL based recommender system with no simulator

Related topics