Utilization of resources by RLlib

Athe-kunal · October 26, 2023, 8:32am

I am working on a project for algorithmic trading and Black-Litterman Portfolio optimization with reinforcement learning. Here I am using RLlib for the PPO algorithm and hyperparameter optimization using Ray tune.
Link to project: GitHub - Athe-kunal/Black-Litterman-Portfolio-Optimization-using-RL

From my university cluster, I have one v100 GPU and 2 Core Xeon CPU. Here is my configuration parameters

num_workers = 1
num_samples = 20
num_gpus = 1
num_cpus = 2
training_iterations = 200
checkpoint_freq = 1
num_envs_per_worker = 100
worker_cpu = 0.5
worker_gpu = 0.5
log_level="DEBUG"

It is a small financial environment with only 206 time steps and to run the codes, you can do

python main.py --if_confidence true --model mlp

Issue that I am facing:
The ray trials are not able to utilize the hardware properly. I have only 2 core CPUs (but they are Xeon CPUs which can potentially have more workers). I am logging all my results to Weights and Biases here: Weights & Biases

In the sample_perf tab, you can see the resource utilization, where I can see a flat line. How can I ensure that I am using the hardware effectively? It is a server environment, hence I am unable to access Ray dashboard, so this weights and biases report is helpful. But as I am learning ray and rllib, can someone help me to debug and understand how can I use my resources effectively?

Athe-kunal · October 26, 2023, 8:41am

@sven1977 Can you please take up the issue and suggest to me how can I improve my performance? Currently, only one trial takes 40-50 seconds. However as there are 500 training iterations followed by hyperparameter optimization, it will take a lot of time.

Athe-kunal · November 7, 2023, 5:59am

Hi @sven1977. Please do have a look at it

Topic		Replies	Views
Training and inference ONLY using GPUs and no CPUs RLlib	7	1877	April 12, 2021
Formula for RlLib resource requirements RLlib	1	148	April 2, 2024
tune.Tuner trials not using specified resources with rllib Ray Tune	7	266	March 14, 2025
Specifying overall maximum number of cores to be used in RayTune RLlib	1	780	June 7, 2023
Best Practices for Optimizing Ray Tune Trials RLlib	2	24	June 19, 2025

Utilization of resources by RLlib

Related topics