Trying to update my latest ray images, and I’m getting failures on trying to install horovod. The root error appears to be:
cmake: symbol lookup error: cmake: undefined symbol: archive_write_add_filter_zstd
unless something with my base image changed (which seems unlikely, but maybe not impossible), the only other thing that changed would have been the ray nightly. All my python lib versions are pinned.
Technically, there is a first image build that installed the ray nightly, cython, flatbuffers, and dataclasses, and then there is a subsequent build that installs the remainder (ml stuff) on top.
Looks like CMake was built to use a certain library that is missing at runtime. Can you run ldd /usr/bin/cmake and share the result?
By the way, might be worth checking out the Horovod on Ray Docker images we’re publishing as a point of comparison:
The fact that these are building correctly suggests it’s not an issue specific to Horovod or Ray or how they interact, but rather an environment issue.