How to use fraction GPU in `ray.tune.Tuner`?

matthewdeng · August 20, 2023, 4:04pm

Hey! What is the total amount of resources you want to schedule for a single TorchTrainer?

Based on your current ScalingConfig, each Trainer/Trial will request a total of

trainer_resources + num_workers * resources_per_worker

where

trainer_resources=1 CPU
num_workers=2
resources_per_worker=0.5 GPU

So in this case, you’ll be requesting a total of 1 CPU and 1 GPU - does that match the output you’re seeing in the console output?

Topic		Replies	Views
Using fractional GPU with TorchTrainer and Tuner API	3	915	August 22, 2023
How to make all use of the GPU memory in Ray.tune	6	1328	December 6, 2022
How do I run my experiment on a single GPU?	4	1679	August 20, 2023
How to correclty allocate resources with Tune + TorchTrainer on Slurm	2	449	December 20, 2022
GPU accelarate that can not be used with ray and tune in training PPO RLlib	3	857	December 23, 2023