You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
use server side backend selection instead of duplicating logic in CLI
Remove the CLI side GetRequiredBackendFromModelInfo mapping and delegate
backend resolution to the server via a new GET /models/backend?model=<ref>
endpoint. The endpoint reuses Scheduler.selectBackendForModel so the
platform aware fallback chain (vLLM, MLX, SGLang, diffusers) stays in
one place. Also address earlier review feedback: sentinel error for
cancelled install, cmd.InOrStdin for prompt input, and unit tests for
the new helpers.
0 commit comments