Hello,
I have a Kubernetes cluster with version as:
kubectl version
Client Version: version.Info{Major:“1”, Minor:“16”, GitVersion:“v1.16.1”, GitCommit:“d647ddbd755faf07169599a625faf302ffc34458”, GitTreeState:“clean”, BuildDate:“2019-10-02T23:49:20Z”, GoVersion:“go1.12.9”, Compiler:“gc”, Platform:“darwin/amd64”}
Server Version: version.Info{Major:“1”, Minor:“19”, GitVersion:“v1.19.8+IKS”, GitCommit:“2051f2b131d2ba8ca584e7734e8c5284dac3630d”, GitTreeState:“clean”, BuildDate:“2021-02-24T04:17:23Z”, GoVersion:“go1.15.8”, Compiler:“gc”, Platform:“linux/amd64”}
I am trying to deploy the ray cluster on Kubernetes by following the instructions mentioned on below link:
https://docs.ray.io/en/master/cluster/kubernetes.html#k8s-operator
I am trying to launch the operator but it is failing:
kubectl get pods
NAME READY STATUS RESTARTS AGE
ray-operator-pod 0/1 CrashLoopBackOff 5 5m
Here is the log:
$ kubectl logs ray-operator-pod
Traceback (most recent call last):
File “/home/ray/anaconda3/lib/python3.7/site-packages/ray/ray_operator/operator.py”, line 150, in main
for event in cluster_cr_stream:
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/watch/watch.py”, line 157, in stream
resp = func(*args, **kwargs)
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/api/custom_objects_api.py”, line 2113, in list_namespaced_custom_object
return self.list_namespaced_custom_object_with_http_info(group, version, namespace, plural, **kwargs) # noqa: E501
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/api/custom_objects_api.py”, line 2258, in list_namespaced_custom_object_with_http_info
collection_formats=collection_formats)
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/api_client.py”, line 353, in call_api
_preload_content, _request_timeout, _host)
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/api_client.py”, line 184, in __call_api
_request_timeout=_request_timeout)
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/api_client.py”, line 377, in request
headers=headers)
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/rest.py”, line 243, in GET
query_params=query_params)
File “/home/ray/anaconda3/lib/python3.7/site-packages/kubernetes/client/rest.py”, line 233, in request
raise ApiException(http_resp=r)
kubernetes.client.exceptions.ApiException: (404)
Reason: Not Found
HTTP response headers: HTTPHeaderDict({‘Cache-Control’: ‘no-cache, private’, ‘Content-Type’: ‘text/plain; charset=utf-8’, ‘X-Content-Type-Options’: ‘nosniff’, ‘Date’: ‘Fri, 26 Mar 2021 22:37:23 GMT’, ‘Content-Length’: ‘19’})
HTTP response body: b’404 page not found\n’
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File “/home/ray/anaconda3/bin/ray-operator”, line 8, in
sys.exit(main())
File “/home/ray/anaconda3/lib/python3.7/site-packages/ray/ray_operator/operator.py”, line 158, in main
“Caught a 404 error. Has the RayCluster CRD been created?”)
Exception: Caught a 404 error. Has the RayCluster CRD been created?
Can you please help?