Image on HeadNode vs WorkerNode

max1 · June 24, 2024, 4:43pm

I am trying to run an LLM inference. The docker image is quite big, like 15 GB. It seems OK for the worker nodes, but should the head node have a smaller docker image, since it doesn’t need most of the components? Also, I am setting lower cpu/memory requirements for the head node.

Thanks.

shrekris · July 26, 2024, 7:03pm

I would recommend using the same image for all nodes for consistency sake. The image should realistically only get pulled once as the service is starting up, so the size shouldn’t impact the service during runtime.

Topic		Replies	Views
Determining Compute Limits for Head Node Kubernetes	1	383	August 16, 2023
LLM model loading LLMs/Generative AI/Aviary	0	34	July 29, 2024
Problem with worker node Ray Clusters	3	381	July 22, 2024
Worker nodes are IDLE	1	540	January 26, 2022
Local cluster with multiple nodes in YAML config, while there's only head being started... Any hints? Ray Clusters	11	1596	June 17, 2022

Image on HeadNode vs WorkerNode

Related topics