Files
the-custodian/ops

Ops Documentation

Operational runbooks and incident reports for the Railiance/Custodian infrastructure.

Structure

ops/
  runbooks/   — how-to guides for recurring operational tasks and known issues
  incidents/  — post-incident reports (append-only, one file per incident)

Runbooks

Runbook Covers
gitea-coulombcore.md Gitea on COULOMBCORE k3s — access, known issues, recovery checklist

Incidents

ID Date Summary Status
INC-001 2026-03-25 Gitea down 13d — PGPool containerd StartError + CPU exhaustion Resolved