litellm: re-index models with local_/proxy_/direct_ prefixes + scaffold OpenAI+Gemini
Backend-prefix taxonomy so the Open WebUI picker is self-documenting and a model name can't lie about where it routes: local_* -> Anvil/Ollama (free) e.g. local_qwen2.5-72b proxy_* -> Claude via Meridian/Max e.g. proxy_claude-sonnet-4-6 direct_* -> metered OpenAI/Gemini e.g. direct_gpt-4o, direct_gemini-2.0-flash Drops the redundant -max suffix (proxy_ already implies Max). api_base is now emitted only when a model defines it, so direct_* hit the provider default endpoint instead of Meridian. direct_* are SCAFFOLDED (no live keys): litellm.env writes a placeholder so the proxy boots; deploy.sh pulls OPENAI_API_KEY/ GEMINI_API_KEY from Infisical /meridian if present (non-fatal). They 401 until real keys land. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -1,2 +1,7 @@
|
||||
# {{ ansible_managed }}
|
||||
LITELLM_MASTER_KEY={{ litellm_master_key }}
|
||||
# direct_* provider keys. A placeholder keeps os.environ/<KEY> resolvable so the
|
||||
# proxy boots; real keys come from Infisical /meridian via deploy.sh. Until then
|
||||
# direct_* models 401 on call (proxy_* + local_* are unaffected).
|
||||
OPENAI_API_KEY={{ litellm_openai_api_key | default('', true) or 'sk-PLACEHOLDER-set-in-infisical-meridian' }}
|
||||
GEMINI_API_KEY={{ litellm_gemini_api_key | default('', true) or 'PLACEHOLDER-set-in-infisical-meridian' }}
|
||||
|
||||
Reference in New Issue
Block a user