Hi, I am also running into this issue using Azure Container Registry and wondering if anyone has a solution? I have logged into my ACR from the host OS (and can pull from the ACR), but when I run ray up example-full.yaml I get:
Error response from daemon: Head “https://myregistry.azurecr.io/v2/myimage/manifests/mytag”: unauthorized:
Shared connection to xx.x.x.xx closed.
2025-01-22 04:45:37,636 INFO node_provider.py:114 – ClusterState: Writing cluster state: [‘xx.x.x.xx’]
New status: update-failed
!!!
Exception details: {‘message’: ‘SSH command failed.’}
Full traceback: Traceback (most recent call last):
File “/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/updater.py”, line 159, in run
self.do_update()
File “/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/updater.py”, line 451, in do_update
self.cmd_runner.run_init(
File “/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/command_runner.py”, line 722, in run_init
self.run(
File “/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/command_runner.py”, line 493, in run
return self.ssh_command_runner.run(
File “/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/command_runner.py”, line 379, in run
return self._run_helper(
File “/usr/local/lib/python3.10/dist-packages/ray/autoscaler/_private/command_runner.py”, line 298, in _run_helper
raise click.ClickException(fail_msg) from None
click.exceptions.ClickException: SSH command failed.