a6b26c500f
LiteLLM sits in front of Meridian for clients that can't talk Anthropic's /v1/messages format (Pulse OpenAI provider, paperless-ai, etc.). Routes OpenAI-shaped requests to localhost:3456 (Meridian) which forwards to the Max sub. - New roles/litellm/ — Python venv, pip install litellm[proxy], systemd - vars/main.yml — model map (haiku/sonnet/opus) + LITELLM_MASTER_KEY env lookup - site.yml — adds litellm role + sanity-check assert - deploy.sh — pulls LITELLM_MASTER_KEY from Infisical (/meridian/) on the controller and exports it for the playbook - New Infisical secret /meridian/vault_litellm_master_key Smoke: Pulse → LiteLLM /v1/chat/completions → Meridian /v1/messages → Max sub returns "pong" through both the LiteLLM master key auth and the Claude Code SDK OAuth. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
22 lines
627 B
Django/Jinja
22 lines
627 B
Django/Jinja
# {{ ansible_managed }}
|
|
#
|
|
# LiteLLM proxy config. Routes OpenAI-shaped requests to Meridian's
|
|
# /v1/messages (Anthropic format). Meridian (same host, :3456) ignores the
|
|
# upstream API key, so we pass a placeholder.
|
|
|
|
model_list:
|
|
{% for m in litellm_models %}
|
|
- model_name: {{ m.name }}
|
|
litellm_params:
|
|
model: {{ m.backend }}
|
|
api_base: http://127.0.0.1:{{ meridian_port }}
|
|
api_key: placeholder-meridian-ignores-this
|
|
{% endfor %}
|
|
|
|
general_settings:
|
|
master_key: os.environ/LITELLM_MASTER_KEY
|
|
|
|
litellm_settings:
|
|
drop_params: true # tolerate clients sending unsupported params
|
|
set_verbose: false
|