Hi, I would like to use Ray Serve for an image pipeline. I don’t like the idea of encoding/decoding to json all the time. Ideally, if two agents are on the same host, I would like to use shared memory or a GPU pointer (would be great if RAY could take care of this for me)
I can’t see any examples like this in the Serve docs.