Set max_num_models_per_replica for multiplexed deployments in the serve config file

How severe does this issue affect your experience of using Ray?

  • High: It blocks me to complete my task.

Is it possible to set the max_num_models_per_replica for multiplexed deployments in the serve config file, so that the application can be updated in place?

1 Like

Could you submit a feature request on GitHub, so we can track this?

Done! [Serve] Make `max_num_models_per_replica` in `@serve.multiplexed` reconfigurable · Issue #46422 · ray-project/ray · GitHub