generated from coulomb/repo-seed
Complete activity-core LLM endpoint handoff (LLM-WP-0006)
Switch the custodian triage default from anthropic/claude-sonnet-4 to google/gemini-2.5-flash, which advertises structured-output support on OpenRouter. Tighten the OpenRouter adapter to send strict JSON schema requests and set provider.require_parameters=true so routing only hits providers that honor the requested response_format. Update Kubernetes deploy docs and config for the verified coulombcore handoff: Containerfile build path, image-pull-policy=Never for smoke pods, credential-routing notes, and live smoke evidence. Mark LLM-WP-0006 finished with closure notes from 2026-06-18.
This commit is contained in:
@@ -17,10 +17,14 @@ kubectl -n activity-core create secret generic llm-connect-provider-secrets \
|
||||
--from-literal=OPENROUTER_API_KEY="$OPENROUTER_API_KEY"
|
||||
```
|
||||
|
||||
Provider API key custody belongs to the operator/OpenBao-to-Kubernetes Secret
|
||||
path. ops-warden documents this as outside its issuance scope; do not paste key
|
||||
values into Git, State Hub, logs, or chat.
|
||||
|
||||
Apply:
|
||||
|
||||
```bash
|
||||
docker build -t docker.io/library/llm-connect:latest .
|
||||
docker build -f Containerfile -t docker.io/library/llm-connect:latest .
|
||||
docker save docker.io/library/llm-connect:latest | ssh coulombcore sudo k3s ctr -n k8s.io images import -
|
||||
kubectl apply -k deploy/k8s/activity-core-llm-connect
|
||||
kubectl -n activity-core rollout status deployment/llm-connect
|
||||
@@ -33,6 +37,7 @@ fixtures and `scripts/smoke_activity_core_endpoint.py`:
|
||||
kubectl -n activity-core run llm-connect-smoke \
|
||||
--rm -i --restart=Never \
|
||||
--image=llm-connect:latest \
|
||||
--image-pull-policy=Never \
|
||||
--env=LLM_CONNECT_URL=http://llm-connect.activity-core.svc.cluster.local:8080 \
|
||||
--env=LLM_CONNECT_TIMEOUT_SECONDS=300 \
|
||||
-- python scripts/smoke_activity_core_endpoint.py
|
||||
|
||||
@@ -10,9 +10,9 @@ data:
|
||||
LLM_CONNECT_HOST: "0.0.0.0"
|
||||
LLM_CONNECT_PORT: "8080"
|
||||
LLM_CONNECT_PROVIDER: "openrouter"
|
||||
LLM_CONNECT_MODEL: "anthropic/claude-sonnet-4"
|
||||
LLM_CONNECT_MODEL: "google/gemini-2.5-flash"
|
||||
LLM_CONNECT_CUSTODIAN_TRIAGE_PROVIDER: "openrouter"
|
||||
LLM_CONNECT_CUSTODIAN_TRIAGE_MODEL: "anthropic/claude-sonnet-4"
|
||||
LLM_CONNECT_CUSTODIAN_TRIAGE_MODEL: "google/gemini-2.5-flash"
|
||||
LLM_CONNECT_CUSTODIAN_TRIAGE_TEMPERATURE: "0.2"
|
||||
LLM_CONNECT_CUSTODIAN_TRIAGE_MAX_TOKENS: "1800"
|
||||
LLM_CONNECT_CUSTODIAN_TRIAGE_MAX_DEPTH: "2"
|
||||
|
||||
Reference in New Issue
Block a user