Hello,
I am trying to run test job as mentioned in the tutorial here :
https://docs.ray.io/en/master/cluster/kubernetes.html
The initial state of minikube cluster is:
$ kubectl -n ray get pods
NAME READY STATUS RESTARTS AGE
example-cluster-ray-head-mmr8n 1/1 Running 0 8s
ray-operator-pod 1/1 Running 0 42s
$ kubectl -n ray get services
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
example-cluster-ray-head ClusterIP 10.102.35.227 10001/TCP,8265/TCP,8000/TCP 83s
I submit the test job:
$ kubectl create -f ray/doc/kubernetes/job-example.yaml
job.batch/ray-test-job-5tcp8 created
I see two workers spawned up but do not see the job:
$ kubectl -n ray get pods
NAME READY STATUS RESTARTS AGE
example-cluster-ray-head-mmr8n 1/1 Running 0 2m47s
example-cluster-ray-worker-mqmxd 1/1 Running 0 2m36s
example-cluster-ray-worker-rmqrj 1/1 Running 0 2m36s
ray-operator-pod 1/1 Running 0 3m21s
$ kubectl -n ray logs ray-test-job-5tcp8
Error from server (NotFound): pods βray-test-job-5tcp8β not found
can you please confirm if this is the correct way of launching jobs?
Thanks