Using pre-trained PPO for Inference

Abid_Ali · January 7, 2025, 6:17am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

I have trained PPO on a machine with multiple GPUs and saved it. Now, I’ve to use it for inference on my own CPU-only machine for which, apparently I have to change the configurations of my trained model before making the inference. I have used following approach to modify the configurations, however, still can’t use the model for inference.

  original_ppo  = Algorithm.from_checkpoint()

  ppo_configurations = original_ppo.config.to_dict()
  ppo_configurations["num_env_runners"] = 1
  ppo_configurations["num_cpus_per_env_runner"] = 1
  ppo_configurations["num_gpus_per_env_runner"] = 0
  ppo_configurations["num_gpus"] = 0
  ppo_configurations["explore"] = False

  updated_config = PPOConfig().from_dict(ppo_configurations)
  new_ppo = updated_config.build()
  # Step 3: Reinitialize the algorithm with the updated configuration
  new_ppo.restore(os.path.abspath())

Following is the warning I get when executed, and the code stays halts there:

The following resource request cannot be scheduled right now: {'CPU': 6.0, 'GPU': 0.25}

I had trained the model with 6 CPUs and 0.25 GPU for each env_runner. So, I need to confirm if there is something wrong with my above-mentioned configurations or the way I’m modifying those?

Abid_Ali · January 7, 2025, 10:33pm

My bad, in the case of inference, setting the num_env_runners to 0 was sufficient to use the model for inference. I will mark it as resolved rather than removing it in case someone else makes the same mistake.

Topic		Replies	Views
How do I enable remote inference for PPO? RLlib	3	522	August 19, 2021
Correct way of using foreach_worker and foreach_env RLlib	6	62	December 16, 2024
Does ChatGPT suggests correct config for 1 gpu and 72 cpus? RLlib	1	43	November 18, 2024
PPO from checkpoint Checkpointing, Restoring	0	40	September 10, 2024
PPO: GPU available, but not utilized Debugging and performance tuning	4	109	April 1, 2025

Using pre-trained PPO for Inference

Related topics