Hi. I have Ray Serve running on a Ray Cluster on Kubernetes, which I deployed using the ray/doc/kubernetes/ray-cluster.yaml
configuration file. Is there a simple or canonical way to expose this server so that I can make network requests to it from the host machine or other machines on the LAN?
I am new to Kubernetes. It seems like I might need to add a new Kubernetes service and/or a NodePort or Load Balancer. I thought I would ask here in case there is already some simpler or built-in way of doing this that I am missing.
Thanks.