Hi,
I’m running ray on a Kubernetes cluster using Kuberay operator. Everything is working great, but I’m now stuck with an issue when trying accessing the serve endpoint (from pods in the same Kubernetes cluster).
When using a port forward to ray head (port 8000), I’m able to reach the service on port 8000 and perform the inference with no issues, but when trying to access from other pods with either “ray-cluster-head-svc.ray-workload.svc.cluster.local” or the clusterIP of the service, it gives me the ECONNREFUSED error, only for port 8000.
The service/deployment is running with -p 0.0.0.0
.
I’m able to access the dashboard from other pods in the cluster normally, using http://ray-cluster-head-svc.ray-workload.svc.cluster.local::8265
but no luck reaching port 8000.
Any suggestions on what to try next?