My model sering consists of several components, and I found Deployment composition suits my requirement. My questions:
- Can I select which downstream deployment instances to call? IIUC, calling via
DeploymentHandles
it will route to any instances meet the load balance requirement? But if my business logic need to schedule specific downstream deployment instances, how can I do that? - There are requirement of huge memory exchange between deployment instances. How can I do that? for example, via RDMA, distributed memory sharing, etc?