Put balancer desired state
This endpoint allows you to apply the desired state of the balancer, including data such as inference parameters, selected model, or whether you want to override the chat template with your own.
Endpoint
Method: PUT
Path: /api/v1/balancer_desired_statePayload
Model sources
The model (and multimodal_projection) field selects where the model comes from. The example above uses Hugging Face; the other accepted forms are:
"model": "model": "model": "None"A Url model is downloaded over http(s) — a direct link to the .gguf file that the agent can reach (including a server on your local network); a LocalToAgent path is read from the agent's own filesystem; "None" leaves no model loaded.
Response
Responds with 204 No Content if the request was successful.