Complete activity-core LLM endpoint handoff (LLM-WP-0006)
Some checks failed
CI / test (3.10) (push) Has been cancelled
CI / test (3.11) (push) Has been cancelled
CI / test (3.12) (push) Has been cancelled

Switch the custodian triage default from anthropic/claude-sonnet-4 to
google/gemini-2.5-flash, which advertises structured-output support on
OpenRouter. Tighten the OpenRouter adapter to send strict JSON schema
requests and set provider.require_parameters=true so routing only hits
providers that honor the requested response_format.

Update Kubernetes deploy docs and config for the verified coulombcore
handoff: Containerfile build path, image-pull-policy=Never for smoke
pods, credential-routing notes, and live smoke evidence. Mark
LLM-WP-0006 finished with closure notes from 2026-06-18.
This commit is contained in:
2026-06-19 13:51:12 +02:00
parent 6a0319ee86
commit 90eb39c247
12 changed files with 176 additions and 27 deletions

View File

@@ -17,10 +17,14 @@ kubectl -n activity-core create secret generic llm-connect-provider-secrets \
--from-literal=OPENROUTER_API_KEY="$OPENROUTER_API_KEY"
```
Provider API key custody belongs to the operator/OpenBao-to-Kubernetes Secret
path. ops-warden documents this as outside its issuance scope; do not paste key
values into Git, State Hub, logs, or chat.
Apply:
```bash
docker build -t docker.io/library/llm-connect:latest .
docker build -f Containerfile -t docker.io/library/llm-connect:latest .
docker save docker.io/library/llm-connect:latest | ssh coulombcore sudo k3s ctr -n k8s.io images import -
kubectl apply -k deploy/k8s/activity-core-llm-connect
kubectl -n activity-core rollout status deployment/llm-connect
@@ -33,6 +37,7 @@ fixtures and `scripts/smoke_activity_core_endpoint.py`:
kubectl -n activity-core run llm-connect-smoke \
--rm -i --restart=Never \
--image=llm-connect:latest \
--image-pull-policy=Never \
--env=LLM_CONNECT_URL=http://llm-connect.activity-core.svc.cluster.local:8080 \
--env=LLM_CONNECT_TIMEOUT_SECONDS=300 \
-- python scripts/smoke_activity_core_endpoint.py

View File

@@ -10,9 +10,9 @@ data:
LLM_CONNECT_HOST: "0.0.0.0"
LLM_CONNECT_PORT: "8080"
LLM_CONNECT_PROVIDER: "openrouter"
LLM_CONNECT_MODEL: "anthropic/claude-sonnet-4"
LLM_CONNECT_MODEL: "google/gemini-2.5-flash"
LLM_CONNECT_CUSTODIAN_TRIAGE_PROVIDER: "openrouter"
LLM_CONNECT_CUSTODIAN_TRIAGE_MODEL: "anthropic/claude-sonnet-4"
LLM_CONNECT_CUSTODIAN_TRIAGE_MODEL: "google/gemini-2.5-flash"
LLM_CONNECT_CUSTODIAN_TRIAGE_TEMPERATURE: "0.2"
LLM_CONNECT_CUSTODIAN_TRIAGE_MAX_TOKENS: "1800"
LLM_CONNECT_CUSTODIAN_TRIAGE_MAX_DEPTH: "2"