Add automation status surface

2026-07-01 20:12:04 +02:00
parent 3f85274916
commit ffe10f098e
20 changed files with 1732 additions and 11 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -158,6 +158,21 @@ get wrong.

 ---

+## Automation Scheduling Preference
+
+Durable activity-core automations must use this repo's own infrastructure:
+Temporal Schedules, NATS JetStream, activity-core run records, State Hub
+progress, and configured report/evidence sinks. Do not use coding
+assistant-provided automation, reminder, or heartbeat tooling as the execution
+or evidence source for production or operational recurrence.
+
+Coding assistants may run repo-native inspection commands and summarize their
+outputs, but the baseline answer to questions like "How did our automations go
+since Friday?" must come from deterministic local tooling such as the
+ACTIVITY-WP-0018 automation status surface.
+
+---
+
 ## Workplan Convention (ADR-001)

 Work items originate as files in this repo — not in the hub. The hub is a
--- a/12
+++ b/12
@@ -2,6 +2,7 @@
 export

 .PHONY: sync-event-types sync-activity-definitions sync-schedules test migrate sync-all \
+        automation-status automation-status-json \
        dev-up dev-down railiance-up railiance-down \
        start-worker start-api start-event-router help

@@ -24,6 +25,17 @@ migrate:  ## Apply all pending Alembic migrations

 sync-all: sync-event-types sync-activity-definitions  ## Sync event types and activity definitions

+# -- Automation status ---------------------------------------------------------
+
+SINCE ?= today
+FORMAT ?= human
+
+automation-status:  ## Report recent automation status from repo-owned evidence
+	uv run python scripts/automation_status.py --since "$(SINCE)" $(if $(UNTIL),--until "$(UNTIL)",) --format "$(FORMAT)"
+
+automation-status-json:  ## Report recent automation status as JSON
+	$(MAKE) automation-status FORMAT=json
+
 # ── Infrastructure ─────────────────────────────────────────────────────────────

 dev-up:  ## Start full dev stack (Temporal + PG + ES + NATS)
--- a/SCOPE.md
+++ b/SCOPE.md
@@ -90,6 +90,9 @@ The two evaluation modes:
 - **REST admin API** (FastAPI): CRUD for ActivityDefinitions, manual trigger,
  event type registry queries.
 - **Prometheus metrics**: Temporal SDK metrics exposed for scraping.
+- **Automation status surface**: deterministic, non-LLM status reporting via
+  `make automation-status` / `scripts/automation_status.py`, using repo-owned
+  evidence sources rather than coding assistant scheduler state.
 - **Operational runbook**: `docs/runbook.md`.

 ---
@@ -116,6 +119,10 @@ The two evaluation modes:
  runs on Railiance infrastructure (or Docker Compose for dev).
 - **End-user task UI** — tasks land in issue-core; presentation is separate.
 - **Synchronous request-response patterns** — Temporal is async-first.
+- **Coding assistant automation infrastructure** — assistant-provided reminders,
+  heartbeats, or scheduled jobs are not the execution or evidence authority for
+  activity-core automations. Assistants may run and summarize repo-native
+  commands only.

 ---

@@ -132,6 +139,8 @@ The two evaluation modes:
  commands.
 - You are replacing scattered bespoke cron jobs and manual coordination with
  a governed, observable automation layer.
+- You need to answer "how did our automations go since Friday?" from
+  deterministic repo-native evidence before any optional LLM summary.

 ---

--- a/docs/runbook.md
+++ b/docs/runbook.md
@@ -136,6 +136,47 @@ The response reports:
 - `schedules.deleted_orphans`
 - bounded `errors[]`

+## Automation status
+
+Use the repo-native status command to answer operator questions such as "how did
+our automations go since Friday?". This is the baseline evidence surface; LLMs
+or coding assistants may summarize the output, but they are not the scheduler or
+source of truth.
+
+```bash
+# Human-readable status. `friday` resolves in Europe/Berlin by default.
+make automation-status SINCE=friday
+
+# JSON for scripts or assistant summarization.
+make automation-status-json SINCE=2026-06-26
+```
+
+The command reads activity-core owned evidence only: ActivityDefinition files or
+DB rows, `activity_runs`, State Hub progress, working-memory report notes, and
+Temporal visibility when `TEMPORAL_HOST` is configured. Missing live sources are
+reported as warnings rather than hidden. It exits non-zero for real automation
+failures such as `missed`, `validation_failed`, or `sink_failed`.
+
+Useful knobs:
+
+```bash
+AUTOMATION_STATUS_TIMEOUT_SECONDS=10 make automation-status SINCE=friday
+make automation-status SINCE=2026-06-26 FORMAT=json
+make automation-status SINCE=2026-06-26 UNTIL=2026-06-27 ACTCORE_DB_URL=
+```
+
+Example distinction from the June 2026 daily triage evidence:
+
+```text
+- Activity 6fca51fa-387a-4fd0-bc4e-d62c29eb859a [validation_failed] expected=0 runs=0 evidence=2
+  evidence state_hub_progress event_type=daily_triage run=ebec6e41... output_validated=false validation_error=Unterminated string...
+  evidence state_hub_progress event_type=daily_triage run=c7370f9c... output_validated=false validation_error=Expecting ',' delimiter...
+```
+
+That means the schedule/report path left evidence, but the report was not a
+clean validated output. Disabled schedules, such as the gated weekly coding
+retro, are reported as `disabled` and are not counted as missed runs.
+
 `event_types` defaults to `false` for this endpoint because event-triggered
 definitions already reload from the DB in the event router path; opt in when
 the operator intentionally changed event type definition files:
--- a/k8s/railiance/20-runtime.yaml
+++ b/k8s/railiance/20-runtime.yaml
@@ -95,7 +95,8 @@ data:
      (strategic_value + time_criticality + risk_reduction +
      opportunity_enablement) / job_size. Use integer factor values from 1 to 5,
      round score to one decimal place, sort recommendations by rank, and return at
-      most 10 recommendations.
+      most 7 recommendations. If uncertain, emit fewer well-formed
+      recommendations rather than more.

      Curated digest:
      {context.daily_triage_digest}
@@ -432,7 +433,7 @@ data:
        "recommendations": {
          "type": "array",
          "minItems": 1,
-          "maxItems": 10,
+          "maxItems": 7,
          "items": {
            "type": "object",
            "required": ["rank", "candidate", "action", "why", "confidence", "wsjf"],
@@ -441,7 +442,7 @@ data:
              "rank": {
                "type": "integer",
                "minimum": 1,
-                "maximum": 10
+                "maximum": 7
              },
              "candidate": {
                "type": "string"
--- a/scripts/automation_status.py
+++ b/scripts/automation_status.py
@@ -0,0 +1,8 @@
+#!/usr/bin/env python3
+"""CLI wrapper for the repo-native automation status report."""
+
+from activity_core.automation_status import main
+
+
+if __name__ == "__main__":
+    raise SystemExit(main())
--- a/src/activity_core/activities.py
+++ b/src/activity_core/activities.py
@@ -366,6 +366,7 @@ async def evaluate_instructions(payload: dict) -> dict:
                "output_validated": result.output_validated,
                "review_required": result.review_required,
                "validation_error": result.validation_error,
+                "llm_response_metadata": result.llm_response_metadata,
            })
        for spec in result.tasks:
            task_specs.append({
--- a/src/activity_core/automation_status.py
+++ b/src/activity_core/automation_status.py
@@ -0,0 +1,811 @@
+"""Repo-native automation status reporting without LLM calls."""
+
+from __future__ import annotations
+
+import argparse
+import asyncio
+import json
+import os
+import sys
+import uuid
+from datetime import date, datetime, time, timedelta, timezone
+from pathlib import Path
+from typing import Any
+from zoneinfo import ZoneInfo
+
+import httpx
+import yaml
+from sqlalchemy import bindparam, text
+
+from activity_core.db import make_engine
+from activity_core.definition_parser import scan_and_parse
+from activity_core.schedule_manager import schedule_id
+from activity_core.sync_activity_definitions import ACTIVITY_DEFINITION_ID_NAMESPACE
+
+DEFAULT_TIMEZONE = "Europe/Berlin"
+DEFAULT_STATE_HUB_URL = "http://127.0.0.1:8000"
+DEFAULT_WORKING_MEMORY_DIR = "/home/worsch/the-custodian/memory/working"
+DEFAULT_TEMPORAL_NAMESPACE = "default"
+FAILURE_STATUSES = {"missed", "validation_failed", "sink_failed"}
+WEEKDAYS = {
+    "monday": 0,
+    "tuesday": 1,
+    "wednesday": 2,
+    "thursday": 3,
+    "friday": 4,
+    "saturday": 5,
+    "sunday": 6,
+}
+
+
+def parse_args(argv: list[str] | None = None) -> argparse.Namespace:
+    parser = argparse.ArgumentParser(
+        description="Report recent activity-core automation status without LLMs."
+    )
+    parser.add_argument(
+        "--since",
+        default=os.environ.get("SINCE", "today"),
+        help="Window start: YYYY-MM-DD, ISO datetime, today, yesterday, friday, or last-friday.",
+    )
+    parser.add_argument(
+        "--until",
+        default=os.environ.get("UNTIL"),
+        help="Window end. Defaults to now; date-only values use that day's end.",
+    )
+    parser.add_argument("--timezone", default=os.environ.get("AUTOMATION_STATUS_TIMEZONE", DEFAULT_TIMEZONE))
+    parser.add_argument("--activity-id", action="append", default=[])
+    parser.add_argument("--activity-name", action="append", default=[])
+    parser.add_argument("--db-url", default=os.environ.get("ACTCORE_DB_URL"))
+    parser.add_argument("--state-hub-url", default=os.environ.get("STATE_HUB_URL", DEFAULT_STATE_HUB_URL))
+    parser.add_argument("--working-memory-dir", default=os.environ.get("AUTOMATION_STATUS_WORKING_MEMORY_DIR", DEFAULT_WORKING_MEMORY_DIR))
+    parser.add_argument("--temporal-host", default=os.environ.get("TEMPORAL_HOST"))
+    parser.add_argument("--temporal-namespace", default=os.environ.get("TEMPORAL_NAMESPACE", DEFAULT_TEMPORAL_NAMESPACE))
+    parser.add_argument("--timeout-seconds", type=float, default=float(os.environ.get("AUTOMATION_STATUS_TIMEOUT_SECONDS", "5")))
+    parser.add_argument("--progress-limit", type=int, default=int(os.environ.get("AUTOMATION_STATUS_PROGRESS_LIMIT", "100")))
+    parser.add_argument("--progress-event-type", action="append", default=None, help="State Hub progress event type to read; repeatable. Use all for the unfiltered feed.")
+    parser.add_argument("--format", choices=("human", "json"), default=os.environ.get("FORMAT", "human"))
+    return parser.parse_args(argv)
+
+
+def resolve_window(since: str, until: str | None, timezone_name: str, *, now: datetime | None = None) -> dict[str, Any]:
+    tz = ZoneInfo(timezone_name)
+    base = now or datetime.now(tz=tz)
+    if base.tzinfo is None:
+        base = base.replace(tzinfo=tz)
+    base = base.astimezone(tz)
+    since_local = _resolve_time(since, tz, base, "start")
+    until_local = _resolve_time(until, tz, base, "end") if until else base
+    if since_local > until_local:
+        raise ValueError(f"since {since_local.isoformat()} is after until {until_local.isoformat()}")
+    return {
+        "since": since_local,
+        "until": until_local,
+        "since_utc": since_local.astimezone(timezone.utc),
+        "until_utc": until_local.astimezone(timezone.utc),
+        "timezone": timezone_name,
+    }
+
+
+def _resolve_time(raw: str, tz: ZoneInfo, now: datetime, boundary: str) -> datetime:
+    token = raw.strip().lower()
+    if token == "now":
+        return now
+    if token == "today":
+        return _day_boundary(now.date(), tz, boundary)
+    if token == "yesterday":
+        return _day_boundary(now.date() - timedelta(days=1), tz, boundary)
+    strict_last = False
+    if token.startswith("last-"):
+        strict_last = True
+        token = token.removeprefix("last-")
+    elif token.startswith("last "):
+        strict_last = True
+        token = token.removeprefix("last ")
+    if token in WEEKDAYS:
+        days_back = (now.weekday() - WEEKDAYS[token]) % 7
+        if strict_last and days_back == 0:
+            days_back = 7
+        return _day_boundary((now - timedelta(days=days_back)).date(), tz, boundary)
+    if "T" in raw or " " in raw:
+        value = datetime.fromisoformat(raw)
+        if value.tzinfo is None:
+            value = value.replace(tzinfo=tz)
+        return value.astimezone(tz)
+    try:
+        return _day_boundary(date.fromisoformat(raw), tz, boundary)
+    except ValueError as exc:
+        raise ValueError(f"unsupported time expression: {raw!r}") from exc
+
+
+def _day_boundary(day: date, tz: ZoneInfo, boundary: str) -> datetime:
+    if boundary == "end":
+        return datetime.combine(day, time.max, tzinfo=tz)
+    return datetime.combine(day, time.min, tzinfo=tz)
+
+
+def definition_uuid(raw_id: str) -> str:
+    try:
+        return str(uuid.UUID(raw_id))
+    except ValueError:
+        return str(uuid.uuid5(ACTIVITY_DEFINITION_ID_NAMESPACE, raw_id))
+
+
+def file_definitions() -> list[dict[str, Any]]:
+    records = []
+    for definition in scan_and_parse():
+        trigger = dict(definition.trigger_config or {})
+        trigger_type = str(trigger.get("trigger_type") or trigger.get("type") or "")
+        if trigger_type not in {"cron", "scheduled"}:
+            continue
+        records.append({
+            "id": definition_uuid(definition.id),
+            "name": definition.name,
+            "enabled": bool(definition.enabled),
+            "trigger_type": trigger_type,
+            "trigger_config": trigger,
+            "instructions": list(definition.instructions or []),
+            "source": "files",
+        })
+    return sorted(records, key=lambda item: item["name"])
+
+
+def filter_definitions(definitions: list[dict[str, Any]], ids: list[str], names: list[str]) -> list[dict[str, Any]]:
+    wanted_ids = {item.lower() for item in ids}
+    wanted_names = {item.lower() for item in names}
+    if not wanted_ids and not wanted_names:
+        return definitions
+    return [
+        item for item in definitions
+        if item["id"].lower() in wanted_ids or item["name"].lower() in wanted_names
+    ]
+
+
+def progress_event_types(args: argparse.Namespace) -> list[str | None]:
+    raw = args.progress_event_type
+    if raw is None:
+        env_value = os.environ.get("AUTOMATION_STATUS_PROGRESS_EVENT_TYPES")
+        raw = env_value.split(",") if env_value else ["daily_triage", "schedule_miss", "ops_inventory_probe"]
+    values = [item.strip() for item in raw if item and item.strip()]
+    return [None if item == "all" else item for item in values]
+
+
+def expected_fires(definition: dict[str, Any], window: dict[str, Any]) -> list[str]:
+    cfg = definition.get("trigger_config") or {}
+    if definition.get("trigger_type") == "scheduled":
+        at = coerce_datetime(cfg.get("at"))
+        if at and in_window(at, window):
+            return [at.astimezone(ZoneInfo(window["timezone"])).isoformat()]
+        return []
+    if definition.get("trigger_type") != "cron" or not cfg.get("cron_expression"):
+        return []
+    tz = ZoneInfo(str(cfg.get("timezone") or window["timezone"]))
+    start = window["since_utc"].astimezone(tz).replace(second=0, microsecond=0)
+    end = window["until_utc"].astimezone(tz).replace(second=0, microsecond=0)
+    minutes = int((end - start).total_seconds() // 60) + 1
+    if minutes < 0:
+        return []
+    if minutes > 366 * 24 * 60:
+        raise ValueError("automation-status refuses to estimate cron windows longer than 366 days")
+    cron = parse_cron(cfg["cron_expression"])
+    fires = []
+    current = start
+    for _ in range(minutes):
+        if cron_matches(current, cron):
+            fires.append(current.isoformat())
+        current += timedelta(minutes=1)
+    return fires
+
+
+def parse_cron(expr: str) -> tuple[set[int], set[int], set[int], set[int], set[int]]:
+    parts = expr.split()
+    if len(parts) != 5:
+        raise ValueError(f"unsupported cron expression {expr!r}: expected 5 fields")
+    return (
+        parse_cron_field(parts[0], 0, 59),
+        parse_cron_field(parts[1], 0, 23),
+        parse_cron_field(parts[2], 1, 31),
+        parse_cron_field(parts[3], 1, 12),
+        parse_cron_field(parts[4], 0, 7),
+    )
+
+
+def parse_cron_field(field: str, minimum: int, maximum: int) -> set[int]:
+    values: set[int] = set()
+    for chunk in field.split(","):
+        base, _, step_text = chunk.partition("/")
+        step = int(step_text) if step_text else 1
+        if base == "*":
+            start, stop = minimum, maximum
+        elif "-" in base:
+            left, right = base.split("-", 1)
+            start, stop = int(left), int(right)
+        else:
+            start = stop = int(base)
+        if step <= 0 or start < minimum or stop > maximum or start > stop:
+            raise ValueError(f"cron field {field!r} outside {minimum}-{maximum}")
+        values.update(range(start, stop + 1, step))
+    if minimum == 0 and maximum == 7 and 7 in values:
+        values.add(0)
+        values.discard(7)
+    return values
+
+
+def cron_matches(value: datetime, cron: tuple[set[int], set[int], set[int], set[int], set[int]]) -> bool:
+    minute, hour, day, month, weekday = cron
+    cron_weekday = (value.weekday() + 1) % 7
+    return value.minute in minute and value.hour in hour and value.day in day and value.month in month and cron_weekday in weekday
+
+
+async def db_definitions(db_url: str) -> list[dict[str, Any]]:
+    engine = make_engine(db_url)
+    try:
+        async with engine.connect() as conn:
+            result = await conn.execute(text(
+                "select id, name, enabled, trigger_type, trigger_config, instructions_json, version "
+                "from activity_definitions where trigger_type in ('cron', 'scheduled') order by name"
+            ))
+            return [{
+                "id": str(row["id"]),
+                "name": row["name"],
+                "enabled": bool(row["enabled"]),
+                "trigger_type": row["trigger_type"],
+                "trigger_config": dict(row["trigger_config"] or {}),
+                "instructions": list(row["instructions_json"] or []),
+                "version": row["version"],
+                "source": "db",
+            } for row in result.mappings().all()]
+    finally:
+        await engine.dispose()
+
+
+async def load_definitions(args: argparse.Namespace, warnings: list[str]) -> tuple[list[dict[str, Any]], dict[str, Any]]:
+    if args.db_url:
+        try:
+            return await db_definitions(args.db_url), {"status": "ok", "source": "db"}
+        except Exception as exc:  # pragma: no cover - depends on local DB driver/runtime
+            warning = f"definition DB unavailable; using file definitions: {exc}"
+            warnings.append(warning)
+            return file_definitions(), {"status": "degraded", "source": "files", "warning": warning}
+    warning = "ACTCORE_DB_URL is not set; using file definitions and skipping run-history DB checks"
+    warnings.append(warning)
+    return file_definitions(), {"status": "degraded", "source": "files", "warning": warning}
+
+
+async def load_runs(db_url: str | None, definitions: list[dict[str, Any]], window: dict[str, Any]) -> tuple[dict[str, list[dict[str, Any]]], dict[str, Any]]:
+    if not db_url:
+        return {}, {"status": "unavailable", "warning": "ACTCORE_DB_URL is not set"}
+    if not definitions:
+        return {}, {"status": "ok", "count": 0}
+    ids = [uuid.UUID(item["id"]) for item in definitions]
+    stmt = text(
+        "select run_id, activity_id, scheduled_for, fired_at, tasks_spawned, version_used from activity_runs "
+        "where activity_id in :ids and coalesce(scheduled_for, fired_at) >= :since "
+        "and coalesce(scheduled_for, fired_at) <= :until order by fired_at"
+    ).bindparams(bindparam("ids", expanding=True))
+    engine = make_engine(db_url)
+    try:
+        async with engine.connect() as conn:
+            result = await conn.execute(stmt, {"ids": ids, "since": window["since_utc"], "until": window["until_utc"]})
+            rows = result.mappings().all()
+    except Exception as exc:  # pragma: no cover - depends on local DB driver/runtime
+        return {}, {"status": "unavailable", "warning": f"activity_runs unavailable: {exc}"}
+    finally:
+        await engine.dispose()
+    grouped: dict[str, list[dict[str, Any]]] = {}
+    for row in rows:
+        record = {
+            "run_id": str(row["run_id"]),
+            "activity_id": str(row["activity_id"]),
+            "scheduled_for": iso(coerce_datetime(row["scheduled_for"])),
+            "fired_at": iso(coerce_datetime(row["fired_at"])),
+            "tasks_spawned": row["tasks_spawned"],
+            "version_used": row["version_used"],
+        }
+        grouped.setdefault(record["activity_id"], []).append(record)
+    return grouped, {"status": "ok", "count": len(rows)}
+
+
+async def load_spawn_validation(db_url: str | None, definitions: list[dict[str, Any]], window: dict[str, Any]) -> tuple[list[dict[str, Any]], dict[str, Any]]:
+    if not db_url:
+        return [], {"status": "unavailable", "warning": "ACTCORE_DB_URL is not set"}
+    if not definitions:
+        return [], {"status": "ok", "count": 0}
+    ids = [uuid.UUID(item["id"]) for item in definitions]
+    stmt = text(
+        "select activity_def_id, source_id, output_validated, created_at from task_spawn_log "
+        "where activity_def_id in :ids and created_at >= :since and created_at <= :until "
+        "and output_validated is not null"
+    ).bindparams(bindparam("ids", expanding=True))
+    engine = make_engine(db_url)
+    try:
+        async with engine.connect() as conn:
+            result = await conn.execute(stmt, {"ids": ids, "since": window["since_utc"], "until": window["until_utc"]})
+            rows = result.mappings().all()
+    except Exception as exc:  # pragma: no cover - depends on local DB driver/runtime
+        return [], {"status": "unavailable", "warning": f"task_spawn_log unavailable: {exc}"}
+    finally:
+        await engine.dispose()
+    return [{
+        "source": "task_spawn_log",
+        "activity_id": str(row["activity_def_id"]),
+        "created_at": iso(coerce_datetime(row["created_at"])),
+        "output_validated": bool(row["output_validated"]),
+        "summary": f"instruction {row['source_id']} task output validation",
+    } for row in rows], {"status": "ok", "count": len(rows)}
+
+
+def load_state_hub_progress(state_hub_url: str | None, window: dict[str, Any], *, limit: int, timeout_seconds: float, event_types: list[str | None] | None = None) -> tuple[list[dict[str, Any]], dict[str, Any]]:
+    if not state_hub_url:
+        return [], {"status": "unavailable", "warning": "STATE_HUB_URL is not set"}
+    event_types = event_types or [None]
+    payload: list[Any] = []
+    try:
+        for event_type in event_types:
+            params: dict[str, Any] = {"limit": limit}
+            if event_type:
+                params["event_type"] = event_type
+            response = httpx.get(f"{state_hub_url.rstrip('/')}/progress/", params=params, timeout=timeout_seconds)
+            response.raise_for_status()
+            items = response.json()
+            if not isinstance(items, list):
+                return [], {"status": "unavailable", "warning": "State Hub progress response was not a list"}
+            payload.extend(items)
+    except Exception as exc:
+        detail = str(exc) or exc.__class__.__name__
+        return [], {"status": "unavailable", "warning": f"State Hub progress unavailable: {detail}"}
+
+    evidence: list[dict[str, Any]] = []
+    seen: set[str] = set()
+    for item in payload:
+        if not isinstance(item, dict):
+            continue
+        item_id = str(item.get("id") or id(item))
+        if item_id in seen:
+            continue
+        seen.add(item_id)
+        detail = item.get("detail") if isinstance(item.get("detail"), dict) else {}
+        created_at = coerce_datetime(item.get("created_at"))
+        scheduled_for = coerce_datetime(detail.get("scheduled_for"))
+        event_time = scheduled_for or created_at
+        if event_time and not in_window(event_time, window):
+            continue
+        run_id = string_or_none(detail.get("activity_core_run_id"))
+        activity_id = string_or_none(detail.get("activity_id"))
+        if not run_id and not activity_id and item.get("event_type") != "schedule_miss":
+            continue
+        evidence.append({
+            "source": "state_hub_progress",
+            "activity_id": activity_id,
+            "run_id": run_id,
+            "scheduled_for": iso(scheduled_for),
+            "created_at": iso(created_at),
+            "event_type": string_or_none(item.get("event_type")),
+            "output_validated": bool_or_none(detail.get("output_validated")),
+            "validation_error": shorten(string_or_none(detail.get("validation_error"))),
+            "status": string_or_none(detail.get("status")),
+            "summary": shorten(string_or_none(item.get("summary"))),
+            "path": string_or_none(detail.get("working_memory_path")),
+        })
+    labels = [item or "all" for item in event_types]
+    return evidence, {"status": "ok", "count": len(evidence), "event_types": labels}
+
+
+def load_working_memory_evidence(working_memory_dir: str | None, window: dict[str, Any]) -> tuple[list[dict[str, Any]], dict[str, Any]]:
+    if not working_memory_dir:
+        return [], {"status": "unavailable", "warning": "working-memory directory is not configured"}
+    root = Path(working_memory_dir).expanduser()
+    if not root.exists():
+        return [], {"status": "unavailable", "warning": f"working-memory directory does not exist: {root}"}
+    evidence: list[dict[str, Any]] = []
+    for path in sorted(root.glob("*.md")):
+        meta = read_frontmatter(path)
+        if not meta or meta.get("source") != "activity-core":
+            continue
+        scheduled_for = coerce_datetime(meta.get("scheduled_for"))
+        created_at = coerce_datetime(meta.get("created"))
+        event_time = scheduled_for or created_at
+        if event_time and not in_window(event_time, window):
+            continue
+        evidence.append({
+            "source": "working_memory",
+            "activity_id": string_or_none(meta.get("activity_id")),
+            "run_id": string_or_none(meta.get("activity_core_run_id")),
+            "scheduled_for": iso(scheduled_for),
+            "created_at": iso(created_at),
+            "output_validated": bool_or_none(meta.get("output_validated")),
+            "path": str(path),
+            "summary": path.name,
+        })
+    return evidence, {"status": "ok", "count": len(evidence), "path": str(root)}
+
+
+def read_frontmatter(path: Path) -> dict[str, Any]:
+    try:
+        value = path.read_text(encoding="utf-8")
+    except OSError:
+        return {}
+    if not value.startswith("---\n"):
+        return {}
+    parts = value.split("---\n", 2)
+    if len(parts) < 3:
+        return {}
+    loaded = yaml.safe_load(parts[1])
+    return loaded if isinstance(loaded, dict) else {}
+
+
+async def load_temporal_visibility(temporal_host: str | None, namespace: str, definitions: list[dict[str, Any]], *, timeout_seconds: float) -> tuple[dict[str, dict[str, Any]], dict[str, Any]]:
+    if not temporal_host:
+        return {}, {"status": "skipped", "warning": "TEMPORAL_HOST is not set"}
+    try:
+        from temporalio.client import Client
+        client = await asyncio.wait_for(Client.connect(temporal_host, namespace=namespace), timeout=timeout_seconds)
+    except Exception as exc:
+        return {}, {"status": "unavailable", "warning": f"Temporal unavailable: {exc}"}
+    records: dict[str, dict[str, Any]] = {}
+    for definition in definitions:
+        sid = automation_schedule_id(definition)
+        try:
+            desc = await asyncio.wait_for(client.get_schedule_handle(sid).describe(), timeout=timeout_seconds)
+            state = getattr(getattr(desc, "schedule", None), "state", None)
+            info = getattr(desc, "info", None)
+            records[definition["id"]] = {
+                "schedule_id": sid,
+                "available": True,
+                "paused": getattr(state, "paused", None),
+                "missed_catchup_window": int(getattr(info, "num_actions_missed_catchup_window", 0) or 0),
+                "last_fired_at": iso(latest_recent_action_time(getattr(info, "recent_actions", None) or [])),
+                "workflows": await list_workflows(client, definition["id"], timeout_seconds),
+            }
+        except Exception as exc:
+            records[definition["id"]] = {"schedule_id": sid, "available": False, "warning": str(exc)}
+    return records, {"status": "ok", "count": len(records)}
+
+
+async def list_workflows(client: Any, activity_id: str, timeout_seconds: float) -> list[dict[str, Any]]:
+    async def collect() -> list[dict[str, Any]]:
+        workflows: list[dict[str, Any]] = []
+        async for item in client.list_workflows(query=f'ActivityId="{activity_id}"'):
+            workflows.append({
+                "id": item.id,
+                "run_id": item.run_id,
+                "status": str(item.status),
+                "start_time": iso(coerce_datetime(getattr(item, "start_time", None))),
+                "close_time": iso(coerce_datetime(getattr(item, "close_time", None))),
+            })
+            if len(workflows) >= 5:
+                break
+        return workflows
+    return await asyncio.wait_for(collect(), timeout=timeout_seconds)
+
+
+def latest_recent_action_time(actions: list[Any]) -> datetime | None:
+    times = [coerce_datetime(getattr(item, "scheduled_at", None) or getattr(item, "started_at", None)) for item in actions]
+    times = [item for item in times if item is not None]
+    return max(times) if times else None
+
+
+def automation_schedule_id(definition: dict[str, Any]) -> str:
+    activity_id = definition["id"]
+    if definition.get("trigger_type") == "scheduled":
+        return f"activity-schedule-{activity_id}-once"
+    return schedule_id(activity_id)
+
+
+async def build_report(args: argparse.Namespace) -> tuple[dict[str, Any], int]:
+    window = resolve_window(args.since, args.until, args.timezone)
+    timeout = max(float(args.timeout_seconds), 0.1)
+    warnings: list[str] = []
+    sources: dict[str, dict[str, Any]] = {}
+
+    try:
+        definitions, sources["definitions"] = await asyncio.wait_for(
+            load_definitions(args, warnings),
+            timeout=timeout,
+        )
+    except asyncio.TimeoutError:
+        warning = "definition DB timed out; using file definitions"
+        warnings.append(warning)
+        definitions = file_definitions()
+        sources["definitions"] = {"status": "degraded", "source": "files", "warning": warning}
+
+    definitions = filter_definitions(definitions, args.activity_id, args.activity_name)
+
+    try:
+        runs_by_activity, sources["activity_runs"] = await asyncio.wait_for(
+            load_runs(args.db_url, definitions, window),
+            timeout=timeout,
+        )
+    except asyncio.TimeoutError:
+        runs_by_activity = {}
+        sources["activity_runs"] = {"status": "unavailable", "warning": "activity_runs timed out"}
+
+    try:
+        spawn_evidence, sources["task_spawn_log"] = await asyncio.wait_for(
+            load_spawn_validation(args.db_url, definitions, window),
+            timeout=timeout,
+        )
+    except asyncio.TimeoutError:
+        spawn_evidence = []
+        sources["task_spawn_log"] = {"status": "unavailable", "warning": "task_spawn_log timed out"}
+
+    progress_evidence, sources["state_hub_progress"] = load_state_hub_progress(
+        args.state_hub_url,
+        window,
+        limit=args.progress_limit,
+        timeout_seconds=timeout,
+        event_types=progress_event_types(args),
+    )
+    wm_evidence, sources["working_memory"] = load_working_memory_evidence(args.working_memory_dir, window)
+    temporal_by_activity, sources["temporal"] = await load_temporal_visibility(
+        args.temporal_host,
+        args.temporal_namespace,
+        definitions,
+        timeout_seconds=timeout,
+    )
+    for source in sources.values():
+        if source.get("status") not in {"ok", "skipped"} and source.get("warning"):
+            warnings.append(str(source["warning"]))
+
+    all_evidence = progress_evidence + wm_evidence + spawn_evidence
+    definitions = add_evidence_only_definitions(definitions, all_evidence)
+    activities = []
+    runs_available = sources["activity_runs"].get("status") == "ok"
+    for definition in definitions:
+        runs = runs_by_activity.get(definition["id"], [])
+        activities.append(classify_activity(
+            definition,
+            window,
+            runs,
+            evidence_for_activity(definition, runs, all_evidence, window),
+            temporal_by_activity.get(definition["id"]),
+            expected_fires(definition, window),
+            runs_available=runs_available,
+        ))
+
+    report = {
+        "mode": "automation-status",
+        "generated_at": datetime.now(tz=timezone.utc).isoformat(),
+        "window": {
+            "since": window["since"].isoformat(),
+            "until": window["until"].isoformat(),
+            "timezone": window["timezone"],
+            "since_utc": window["since_utc"].isoformat(),
+            "until_utc": window["until_utc"].isoformat(),
+        },
+        "sources": sources,
+        "summary": summarize(activities),
+        "activities": activities,
+        "warnings": sorted(set(warnings)),
+    }
+    exit_code = 1 if any(item["status"] in FAILURE_STATUSES for item in activities) else 0
+    return report, exit_code
+
+
+def add_evidence_only_definitions(definitions: list[dict[str, Any]], evidence: list[dict[str, Any]]) -> list[dict[str, Any]]:
+    known = {item["id"] for item in definitions}
+    result = list(definitions)
+    for item in evidence:
+        activity_id = item.get("activity_id")
+        if not activity_id or activity_id in known:
+            continue
+        known.add(activity_id)
+        result.append({
+            "id": activity_id,
+            "name": f"Activity {activity_id}",
+            "enabled": True,
+            "trigger_type": "evidence",
+            "trigger_config": {},
+            "instructions": [],
+            "source": "evidence",
+        })
+    return result
+
+
+def evidence_for_activity(definition: dict[str, Any], runs: list[dict[str, Any]], all_evidence: list[dict[str, Any]], window: dict[str, Any]) -> list[dict[str, Any]]:
+    run_ids = {item["run_id"] for item in runs}
+    selected: list[dict[str, Any]] = []
+    for item in all_evidence:
+        if item.get("activity_id") == definition["id"] or (item.get("run_id") and item["run_id"] in run_ids):
+            selected.append(item)
+            continue
+        event_time = coerce_datetime(item.get("scheduled_for") or item.get("created_at"))
+        if item.get("activity_id") is None and item.get("run_id") in run_ids and (not event_time or in_window(event_time, window)):
+            selected.append(item)
+    return selected
+
+
+def classify_activity(definition: dict[str, Any], window: dict[str, Any], runs: list[dict[str, Any]], evidence: list[dict[str, Any]], temporal: dict[str, Any] | None, expected: list[str], *, runs_available: bool) -> dict[str, Any]:
+    warnings: list[str] = []
+    if not runs_available:
+        warnings.append("activity_runs source unavailable; missed-run verdict is unknown")
+    if temporal and not temporal.get("available", False) and temporal.get("warning"):
+        warnings.append(f"Temporal schedule visibility unavailable: {temporal['warning']}")
+    if temporal and temporal.get("paused") and definition.get("enabled"):
+        warnings.append("Temporal schedule is paused while ActivityDefinition is enabled")
+
+    workflows = temporal.get("workflows", []) if temporal else []
+    if not definition.get("enabled"):
+        status = "disabled"
+    elif any(item.get("output_validated") is False for item in evidence):
+        status = "validation_failed"
+    elif any(workflow_status_matches(item, {"RUNNING"}) for item in workflows):
+        status = "running"
+    elif any(workflow_status_matches(item, {"FAILED", "TIMED_OUT", "TERMINATED"}) for item in workflows):
+        status = "retrying"
+    elif temporal and int(temporal.get("missed_catchup_window") or 0) > 0:
+        status = "missed"
+    elif runs_available and definition.get("enabled") and len(expected) > len(runs):
+        status = "missed"
+    elif runs or evidence:
+        status = "completed"
+    elif expected:
+        status = "unknown"
+    else:
+        status = "no_due"
+
+    return {
+        "id": definition["id"],
+        "name": definition["name"],
+        "enabled": definition["enabled"],
+        "trigger_type": definition["trigger_type"],
+        "trigger_config": public_trigger_config(definition.get("trigger_config", {})),
+        "schedule_id": automation_schedule_id(definition),
+        "definition_source": definition.get("source"),
+        "status": status,
+        "expected_fires": expected,
+        "expected_fire_count": len(expected),
+        "observed_run_count": len(runs),
+        "runs": runs,
+        "evidence": evidence,
+        "temporal": temporal,
+        "warnings": warnings,
+        "window": {"since": window["since"].isoformat(), "until": window["until"].isoformat(), "timezone": window["timezone"]},
+    }
+
+
+def workflow_status_matches(workflow: dict[str, Any], names: set[str]) -> bool:
+    value = str(workflow.get("status") or "").upper()
+    return any(name in value for name in names)
+
+
+def public_trigger_config(config: dict[str, Any]) -> dict[str, Any]:
+    allowed = {"trigger_type", "cron_expression", "timezone", "misfire_policy", "catchup_window_seconds", "jitter_seconds", "at"}
+    return {key: config.get(key) for key in allowed if key in config}
+
+
+def summarize(activities: list[dict[str, Any]]) -> dict[str, Any]:
+    counts: dict[str, int] = {}
+    for item in activities:
+        counts[item["status"]] = counts.get(item["status"], 0) + 1
+    return {"activity_count": len(activities), "status_counts": counts, "failure_count": sum(counts.get(status, 0) for status in FAILURE_STATUSES)}
+
+
+def render_human(report: dict[str, Any]) -> str:
+    window = report["window"]
+    lines = [
+        f"Automation status {window['since']} -> {window['until']} ({window['timezone']})",
+        render_source_line(report["sources"]),
+        render_summary_line(report["summary"]),
+        "",
+    ]
+    for activity in report["activities"]:
+        lines.append(f"- {activity['name']} [{activity['status']}] expected={activity['expected_fire_count']} runs={activity['observed_run_count']} evidence={len(activity['evidence'])}")
+        if activity["expected_fires"]:
+            suffix = " ..." if len(activity["expected_fires"]) > 3 else ""
+            lines.append("  expected fires: " + ", ".join(activity["expected_fires"][:3]) + suffix)
+        for run in activity["runs"][:3]:
+            lines.append(f"  run {run['run_id']} scheduled_for={run['scheduled_for']} fired_at={run['fired_at']} tasks={run['tasks_spawned']}")
+        for evidence in activity["evidence"][:3]:
+            detail = f"  evidence {evidence['source']}"
+            if evidence.get("event_type"):
+                detail += f" event_type={evidence['event_type']}"
+            if evidence.get("run_id"):
+                detail += f" run={evidence['run_id']}"
+            if evidence.get("output_validated") is not None:
+                detail += f" output_validated={str(evidence['output_validated']).lower()}"
+            if evidence.get("validation_error"):
+                detail += f" validation_error={evidence['validation_error']}"
+            lines.append(detail)
+        for warning in activity["warnings"]:
+            lines.append(f"  warning: {warning}")
+    if report["warnings"]:
+        lines.extend(["", "Warnings:"])
+        lines.extend(f"- {warning}" for warning in report["warnings"])
+    return "\n".join(lines)
+
+
+def render_source_line(sources: dict[str, dict[str, Any]]) -> str:
+    parts = []
+    for name, source in sources.items():
+        label = f"{name}={source.get('status', 'unknown')}"
+        if source.get("source"):
+            label += f"/{source['source']}"
+        parts.append(label)
+    return "Sources: " + ", ".join(parts)
+
+
+def render_summary_line(summary: dict[str, Any]) -> str:
+    counts = summary.get("status_counts") or {}
+    if not counts:
+        return "Summary: no scheduled automations found"
+    return "Summary: " + ", ".join(f"{status}={count}" for status, count in sorted(counts.items()))
+
+
+def coerce_datetime(value: Any) -> datetime | None:
+    if value is None:
+        return None
+    if isinstance(value, datetime):
+        result = value
+    elif isinstance(value, date):
+        result = datetime.combine(value, time.min)
+    elif isinstance(value, str):
+        raw = value.strip()
+        if not raw:
+            return None
+        if raw.endswith("Z"):
+            raw = raw[:-1] + "+00:00"
+        try:
+            result = datetime.fromisoformat(raw)
+        except ValueError:
+            return None
+    else:
+        return None
+    if result.tzinfo is None:
+        result = result.replace(tzinfo=timezone.utc)
+    return result.astimezone(timezone.utc)
+
+
+def in_window(value: datetime, window: dict[str, Any]) -> bool:
+    instant = coerce_datetime(value)
+    return bool(instant and window["since_utc"] <= instant <= window["until_utc"])
+
+
+def iso(value: datetime | None) -> str | None:
+    return value.isoformat() if value else None
+
+
+def string_or_none(value: Any) -> str | None:
+    if value is None:
+        return None
+    result = str(value)
+    return result or None
+
+
+def bool_or_none(value: Any) -> bool | None:
+    if value is None or isinstance(value, bool):
+        return value
+    if isinstance(value, str):
+        lowered = value.strip().lower()
+        if lowered in {"true", "yes", "1"}:
+            return True
+        if lowered in {"false", "no", "0"}:
+            return False
+    return None
+
+
+def shorten(value: str | None, limit: int = 240) -> str | None:
+    if value is None:
+        return None
+    return value if len(value) <= limit else value[: limit - 3] + "..."
+
+
+async def async_main(argv: list[str] | None = None) -> int:
+    args = parse_args(argv)
+    try:
+        report, exit_code = await build_report(args)
+    except ValueError as exc:
+        print(f"automation_status: {exc}", file=sys.stderr)
+        return 2
+    if args.format == "json":
+        print(json.dumps(report, indent=2, sort_keys=True))
+    else:
+        print(render_human(report))
+    return exit_code
+
+
+def main(argv: list[str] | None = None) -> int:
+    return asyncio.run(async_main(argv))
+
+
+if __name__ == "__main__":
+    raise SystemExit(main())
--- a/src/activity_core/llm_client.py
+++ b/src/activity_core/llm_client.py
@@ -17,6 +17,8 @@ import httpx
 class DisabledLLMClient:
    """LLM client used when no llm-connect endpoint is configured."""

+    last_response_metadata: dict[str, Any] | None = None
+
    def complete(
        self,
        prompt: str,
@@ -32,6 +34,7 @@ class LLMConnectClient:
    def __init__(self, base_url: str, timeout_seconds: float = 300.0) -> None:
        self.base_url = base_url.rstrip("/")
        self.timeout_seconds = timeout_seconds
+        self.last_response_metadata: dict[str, Any] | None = None

    def complete(
        self,
@@ -54,12 +57,48 @@ class LLMConnectClient:
        )
        resp.raise_for_status()
        data = resp.json()
+        self.last_response_metadata = _extract_response_metadata(data)
        content = data.get("content")
        if not isinstance(content, str):
            raise ValueError("llm-connect response missing string content")
        return content


+_SAFE_RESPONSE_METADATA_KEYS = {
+    "finish_reason",
+    "usage",
+    "model",
+    "model_name",
+    "provider",
+    "request_id",
+    "response_id",
+    "trace_id",
+    "latency_ms",
+    "duration_ms",
+    "elapsed_ms",
+    "created",
+    "created_at",
+}
+
+
+def _extract_response_metadata(data: dict[str, Any]) -> dict[str, Any]:
+    """Keep non-secret llm-connect diagnostics alongside the returned content."""
+    return {
+        key: value for key, value in data.items()
+        if key in _SAFE_RESPONSE_METADATA_KEYS and _json_safe(value)
+    }
+
+
+def _json_safe(value: Any) -> bool:
+    try:
+        import json
+
+        json.dumps(value)
+    except (TypeError, ValueError):
+        return False
+    return True
+
+
 def get_llm_client() -> DisabledLLMClient | LLMConnectClient:
    base_url = os.environ.get("LLM_CONNECT_URL", "").strip()
    if not base_url:
--- a/src/activity_core/report_sinks.py
+++ b/src/activity_core/report_sinks.py
@@ -136,6 +136,7 @@ def _post_state_hub_progress(
            "output_validated": report_entry.get("output_validated"),
            "review_required": report_entry.get("review_required"),
            "validation_error": report_entry.get("validation_error"),
+            "llm_response_metadata": report_entry.get("llm_response_metadata"),
            "report": report,
        },
    }
@@ -224,6 +225,16 @@ def _render_markdown(
        lines.extend([summary, ""])
    if validation_error:
        lines.extend(["Validation error:", "", f"`{validation_error}`", ""])
+    metadata = report_entry.get("llm_response_metadata")
+    if metadata:
+        lines.extend([
+            "LLM response metadata:",
+            "",
+            "```json",
+            json.dumps(metadata, indent=2, sort_keys=True),
+            "```",
+            "",
+        ])
    lines.extend([
        "```json",
        json.dumps(report, indent=2, sort_keys=True),
--- a/src/activity_core/rules/executor.py
+++ b/src/activity_core/rules/executor.py
@@ -41,6 +41,7 @@ class InstructionResult:
    review_required: bool = False
    condition_matched: str | None = None
    validation_error: str | None = None
+    llm_response_metadata: dict[str, Any] | None = None


 def _resolve_path(obj: Any, path: str) -> Any:
@@ -167,12 +168,14 @@ def _execute(

    # Step 3 — call LLM
    raw_output = llm_client.complete(rendered, model=instr.model, config=llm_config)
+    response_metadata = _llm_response_metadata(llm_client)

    # Step 4 — validate and optionally retry
    task_specs, report, error = _validate_output(raw_output, instr, allow_list)
    if error:
        retry_prompt = rendered + f"\n\nPrevious output was invalid: {error}\nPlease fix."
        raw_output = llm_client.complete(retry_prompt, model=instr.model, config=llm_config)
+        response_metadata = _llm_response_metadata(llm_client)
        task_specs, report, error = _validate_output(raw_output, instr, allow_list)
        if error:
            # Truncate to keep log volume bounded but long enough to see the
@@ -188,10 +191,13 @@ def _execute(
            # loss. One bad item should cost one item, not the whole report.
            recovered = _resilient_report(
                instr, raw_output, error, prompt_hash, allow_list,
+                response_metadata=response_metadata,
            )
            if recovered is not None:
                return recovered
-            failure_report = _invalid_output_report(instr, error, raw_output)
+            failure_report = _invalid_output_report(
+                instr, error, raw_output, response_metadata=response_metadata,
+            )
            if failure_report is not None:
                return InstructionResult(
                    tasks=[],
@@ -202,6 +208,7 @@ def _execute(
                    review_required=True,
                    condition_matched=instr.condition or None,
                    validation_error=error,
+                    llm_response_metadata=response_metadata,
                )
            return _empty_result(instr, prompt_hash=prompt_hash, validation_error=error)

@@ -213,6 +220,7 @@ def _execute(
        output_validated=True,
        review_required=bool(getattr(instr, "review_required", False)),
        condition_matched=instr.condition or None,
+        llm_response_metadata=response_metadata,
    )


@@ -252,6 +260,7 @@ def _invalid_output_report(
    instr: Any,
    validation_error: str,
    raw_output: Any,
+    response_metadata: dict[str, Any] | None = None,
 ) -> dict[str, Any] | None:
    """Build a durable diagnostic report for invalid report-sink output.

@@ -269,7 +278,7 @@ def _invalid_output_report(
            partial_output = _parse_json_output(raw_output)
        except json.JSONDecodeError:
            partial_output = None
-            raw_preview = raw_output[:4000]
+            raw_preview = raw_output[:_RAW_OUTPUT_PREVIEW_LIMIT]
    else:
        partial_output = raw_output

@@ -281,6 +290,8 @@ def _invalid_output_report(
        "status": "validation_failed",
        "validation_error": validation_error,
    }
+    if response_metadata:
+        report["llm_response_metadata"] = response_metadata
    if isinstance(partial_output, dict):
        if isinstance(partial_output.get("summary"), str):
            report["partial_summary"] = partial_output["summary"]
@@ -310,9 +321,43 @@ _SNIPPET_LIMIT = 200
 # fail the whole report or flow unbounded into a downstream consumer.
 _MAX_STRING_LEN = 4000
 _MAX_DEPTH = 8
+_RAW_OUTPUT_PREVIEW_LIMIT = 12000
 _SUMMARY_RE = re.compile(r'"summary"\s*:\s*"((?:[^"\\]|\\.)*)"')


+_SAFE_RESPONSE_METADATA_KEYS = {
+    "finish_reason",
+    "usage",
+    "model",
+    "model_name",
+    "provider",
+    "request_id",
+    "response_id",
+    "trace_id",
+    "latency_ms",
+    "duration_ms",
+    "elapsed_ms",
+    "created",
+    "created_at",
+}
+
+
+def _llm_response_metadata(llm_client: Any) -> dict[str, Any] | None:
+    metadata = getattr(llm_client, "last_response_metadata", None)
+    if not isinstance(metadata, dict) or not metadata:
+        return None
+    safe: dict[str, Any] = {}
+    for key, value in metadata.items():
+        if key not in _SAFE_RESPONSE_METADATA_KEYS:
+            continue
+        try:
+            json.dumps(value)
+        except (TypeError, ValueError):
+            continue
+        safe[str(key)] = value
+    return safe or None
+
+
 def _snippet(value: Any) -> str:
    text = value if isinstance(value, str) else json.dumps(value, default=str)
    return text[:_SNIPPET_LIMIT]
@@ -561,6 +606,7 @@ def _resilient_report(
    original_error: str,
    prompt_hash: str | None,
    allow_list: set[str] | None = None,
+    response_metadata: dict[str, Any] | None = None,
 ) -> InstructionResult | None:
    """Recover a partial-but-usable report from output that failed validation.

@@ -590,6 +636,8 @@ def _resilient_report(
        "quarantined_items": quarantined[:_QUARANTINE_LIMIT],
        "recovery_note": f"original validation error: {original_error}",
    }
+    if response_metadata:
+        report["llm_response_metadata"] = response_metadata
    logger.warning(
        "instruction_output_recovered: instruction=%r, kept=%d, quarantined=%d",
        getattr(instr, "id", None), len(valid), len(quarantined),
@@ -603,6 +651,7 @@ def _resilient_report(
        review_required=True,
        condition_matched=getattr(instr, "condition", "") or None,
        validation_error=None,
+        llm_response_metadata=response_metadata,
    )


--- a/tests/rules/test_executor.py
+++ b/tests/rules/test_executor.py
@@ -573,6 +573,47 @@ def test_resilient_recovery_against_real_2026_06_26_fixture():
    assert all("rank" in rec and "candidate" in rec for rec in result.report["recommendations"])


+
+class _MetadataBadLLM:
+    def __init__(self) -> None:
+        self.call_count = 0
+        self.last_response_metadata: dict[str, Any] | None = None
+
+    def complete(
+        self,
+        prompt: str,
+        model: str = "",
+        config: dict | None = None,
+    ) -> str:
+        self.call_count += 1
+        self.last_response_metadata = {
+            "finish_reason": "length",
+            "usage": {"input_tokens": 1100, "output_tokens": 1200},
+        }
+        return ("x" * 9000) + "{"
+
+
+def test_invalid_report_preserves_response_metadata_and_long_preview():
+    llm = _MetadataBadLLM()
+    instr = _instr(
+        id="daily-triage-report",
+        prompt="Report.",
+        trusted_fields=[],
+        report_sinks=[{"type": "working-memory", "path": "/tmp"}],
+    )
+
+    result = execute_instruction_with_audit(instr, _Event(), {}, llm)
+
+    assert llm.call_count == 2
+    assert result.output_validated is False
+    assert result.llm_response_metadata == {
+        "finish_reason": "length",
+        "usage": {"input_tokens": 1100, "output_tokens": 1200},
+    }
+    assert result.report["llm_response_metadata"] == result.llm_response_metadata
+    assert len(result.report["raw_output_preview"]) > 4000
+
+
 def test_execute_instruction_with_audit_preserves_invalid_report_with_sinks(
    tmp_path,
    monkeypatch,
--- a/tests/test_automation_status.py
+++ b/tests/test_automation_status.py
@@ -0,0 +1,184 @@
+from __future__ import annotations
+
+from datetime import datetime
+from pathlib import Path
+from zoneinfo import ZoneInfo
+
+from activity_core import automation_status as status
+
+ACTIVITY_ID = "00000000-0000-0000-0000-000000000123"
+
+
+def _window():
+    return status.resolve_window(
+        "2026-06-26",
+        "2026-06-29",
+        "Europe/Berlin",
+    )
+
+
+def _definition(enabled: bool = True):
+    return {
+        "id": ACTIVITY_ID,
+        "name": "Daily Check",
+        "enabled": enabled,
+        "trigger_type": "cron",
+        "trigger_config": {
+            "trigger_type": "cron",
+            "cron_expression": "0 9 * * *",
+            "timezone": "Europe/Berlin",
+            "misfire_policy": "skip",
+        },
+        "source": "test",
+    }
+
+
+def test_friday_shortcut_resolves_to_previous_friday_start() -> None:
+    now = datetime(2026, 6, 29, 12, 0, tzinfo=ZoneInfo("Europe/Berlin"))
+
+    window = status.resolve_window("friday", None, "Europe/Berlin", now=now)
+
+    assert window["since"].isoformat() == "2026-06-26T00:00:00+02:00"
+    assert window["until"].isoformat() == "2026-06-29T12:00:00+02:00"
+
+
+def test_expected_fires_for_simple_cron_window() -> None:
+    fires = status.expected_fires(_definition(), _window())
+
+    assert fires == [
+        "2026-06-26T09:00:00+02:00",
+        "2026-06-27T09:00:00+02:00",
+        "2026-06-28T09:00:00+02:00",
+        "2026-06-29T09:00:00+02:00",
+    ]
+
+
+def test_completed_when_expected_run_exists() -> None:
+    run = {
+        "run_id": "run-1",
+        "activity_id": ACTIVITY_ID,
+        "scheduled_for": "2026-06-26T07:00:00+00:00",
+        "fired_at": "2026-06-26T07:00:10+00:00",
+        "tasks_spawned": 1,
+    }
+
+    report = status.classify_activity(
+        _definition(),
+        _window(),
+        [run],
+        [{"source": "state_hub_progress", "run_id": "run-1", "output_validated": True}],
+        None,
+        ["2026-06-26T09:00:00+02:00"],
+        runs_available=True,
+    )
+
+    assert report["status"] == "completed"
+
+
+def test_validation_failure_wins_over_completed_run() -> None:
+    run = {"run_id": "run-1", "activity_id": ACTIVITY_ID, "scheduled_for": None, "fired_at": "2026-06-26T07:00:10+00:00"}
+
+    report = status.classify_activity(
+        _definition(),
+        _window(),
+        [run],
+        [{"source": "working_memory", "run_id": "run-1", "output_validated": False}],
+        None,
+        ["2026-06-26T09:00:00+02:00"],
+        runs_available=True,
+    )
+
+    assert report["status"] == "validation_failed"
+
+
+def test_missed_when_expected_fire_has_no_run_and_runs_available() -> None:
+    report = status.classify_activity(
+        _definition(),
+        _window(),
+        [],
+        [],
+        None,
+        ["2026-06-26T09:00:00+02:00"],
+        runs_available=True,
+    )
+
+    assert report["status"] == "missed"
+
+
+def test_disabled_schedule_is_not_counted_as_missed() -> None:
+    report = status.classify_activity(
+        _definition(enabled=False),
+        _window(),
+        [],
+        [],
+        None,
+        ["2026-06-26T09:00:00+02:00"],
+        runs_available=True,
+    )
+
+    assert report["status"] == "disabled"
+
+
+def test_scheduled_definition_reports_one_shot_schedule_id() -> None:
+    definition = {
+        "id": ACTIVITY_ID,
+        "name": "One Shot",
+        "enabled": True,
+        "trigger_type": "scheduled",
+        "trigger_config": {
+            "trigger_type": "scheduled",
+            "at": "2026-06-26T09:00:00+02:00",
+            "timezone": "Europe/Berlin",
+        },
+        "source": "test",
+    }
+
+    report = status.classify_activity(
+        definition,
+        _window(),
+        [],
+        [],
+        None,
+        ["2026-06-26T09:00:00+02:00"],
+        runs_available=False,
+    )
+
+    assert status.automation_schedule_id(_definition()) == f"activity-schedule-{ACTIVITY_ID}"
+    assert report["schedule_id"] == f"activity-schedule-{ACTIVITY_ID}-once"
+
+
+def test_partial_source_availability_is_unknown_not_missed() -> None:
+    report = status.classify_activity(
+        _definition(),
+        _window(),
+        [],
+        [],
+        None,
+        ["2026-06-26T09:00:00+02:00"],
+        runs_available=False,
+    )
+
+    assert report["status"] == "unknown"
+    assert "missed-run verdict is unknown" in report["warnings"][0]
+
+
+def test_working_memory_frontmatter_evidence(tmp_path: Path) -> None:
+    note = tmp_path / "daily-triage-2026-06-26-run.md"
+    note.write_text(
+        "---\n"
+        "source: activity-core\n"
+        f"activity_id: {ACTIVITY_ID}\n"
+        "activity_core_run_id: run-1\n"
+        "scheduled_for: 2026-06-26T07:00:00+00:00\n"
+        "output_validated: false\n"
+        "created: 2026-06-26T07:01:00+00:00\n"
+        "---\n"
+        "body\n",
+        encoding="utf-8",
+    )
+
+    evidence, source = status.load_working_memory_evidence(str(tmp_path), _window())
+
+    assert source["status"] == "ok"
+    assert evidence[0]["run_id"] == "run-1"
+    assert evidence[0]["output_validated"] is False
--- a/tests/test_llm_client.py
+++ b/tests/test_llm_client.py
@@ -13,7 +13,12 @@ def test_llm_connect_client_forwards_run_config(monkeypatch) -> None:
            pass

        def json(self) -> dict:
-            return {"content": '{"summary":"ok","recommendations":[]}'}
+            return {
+                "content": '{"summary":"ok","recommendations":[]}',
+                "finish_reason": "stop",
+                "usage": {"input_tokens": 10, "output_tokens": 20},
+                "raw_response": {"provider_blob": "not persisted"},
+            }

    def fake_post(url: str, json: dict, timeout: float) -> Response:
        captured["url"] = url
@@ -50,3 +55,7 @@ def test_llm_connect_client_forwards_run_config(monkeypatch) -> None:
            "timeout_seconds": 42,
        },
    }
+    assert client.last_response_metadata == {
+        "finish_reason": "stop",
+        "usage": {"input_tokens": 10, "output_tokens": 20},
+    }
--- a/tests/test_railiance_ops_inventory_wiring.py
+++ b/tests/test_railiance_ops_inventory_wiring.py
@@ -93,12 +93,21 @@ def test_external_configmap_projects_enabled_daily_wsjf_definition(tmp_path) ->
    assert definition.trigger_config["cron_expression"] == "20 7 * * *"
    assert definition.trigger_config["timezone"] == "Europe/Berlin"
    assert instruction["id"] == "daily-triage-report"
+    assert instruction["max_tokens"] == 1800
+    assert "most 7 recommendations" in instruction["prompt"]
+    assert "fewer well-formed" in instruction["prompt"]
    assert instruction["output_schema"] == (
        "/etc/activity-core/schemas/daily-triage-report.json"
    )
    assert instruction["report_sinks"][0]["type"] == "working-memory"
    assert instruction["report_sinks"][1]["event_type"] == "daily_triage"

+    schema = _by_kind_name("ConfigMap", "actcore-report-schemas")
+    daily_schema = yaml.safe_load(schema["data"]["daily-triage-report.json"])
+    recommendations = daily_schema["properties"]["recommendations"]
+    assert recommendations["maxItems"] == 7
+    assert recommendations["items"]["properties"]["rank"]["maximum"] == 7
+

 def test_ops_inventory_configmap_contains_probeable_inventory() -> None:
    config = _by_kind_name("ConfigMap", "actcore-ops-service-inventory")
--- a/tests/test_report_sinks.py
+++ b/tests/test_report_sinks.py
@@ -37,6 +37,10 @@ def _payload(sinks: list[dict[str, Any]]) -> dict[str, Any]:
                "output_validated": True,
                "review_required": False,
                "validation_error": None,
+                "llm_response_metadata": {
+                    "finish_reason": "stop",
+                    "usage": {"output_tokens": 50},
+                },
            }
        ],
    }
@@ -62,6 +66,8 @@ def test_working_memory_sink_writes_idempotently(tmp_path) -> None:
    assert "output_validated: true" in text
    assert "review_required: false" in text
    assert "model: test-model" in text
+    assert "LLM response metadata:" in text
+    assert '"finish_reason": "stop"' in text
    assert "State Hub has loose ends." in text


@@ -113,6 +119,10 @@ def test_state_hub_progress_sink_posts(monkeypatch) -> None:
    assert posts[0]["json"]["detail"]["activity_core_run_id"] == payload_run_id()
    assert posts[0]["json"]["detail"]["output_validated"] is True
    assert posts[0]["json"]["detail"]["review_required"] is False
+    assert posts[0]["json"]["detail"]["llm_response_metadata"] == {
+        "finish_reason": "stop",
+        "usage": {"output_tokens": 50},
+    }


 def test_state_hub_progress_includes_prior_working_memory_path(
--- a/workplans/ACTIVITY-WP-0006-post-triage-operational-hardening.md
+++ b/workplans/ACTIVITY-WP-0006-post-triage-operational-hardening.md
@@ -4,11 +4,11 @@ type: workplan
 title: "Post-triage operational hardening"
 domain: custodian
 repo: activity-core
-status: active
+status: finished
 owner: codex
 topic_slug: custodian
 created: "2026-06-03"
-updated: "2026-06-27"
+updated: "2026-06-30"
 state_hub_workstream_id: "5646e13a-13af-4724-bca6-3c0d86f96733"
 ---

@@ -104,7 +104,7 @@ and emitted a validated `daily_triage` report plus working-memory note.

 ```task
 id: ACTIVITY-WP-0006-T03
-status: wait
+status: done
 priority: medium
 state_hub_task_id: "7cbf0a35-71a1-47ac-afc2-f51ad2180fd0"
 ```
@@ -203,6 +203,27 @@ ACTIVITY-WP-0016 output-robustness bundle and runtime prompt/token changes, not
 a missing schedule. T03 stays wait until a post-deployment smoke passes and three
 new clean scheduled runs are collected.

+2026-06-30 early checkpoint: two new clean scheduled runs exist after the
+validation failures. State Hub daily_triage progress shows 2026-06-28
+05:20:51Z run `6a44d6dd-3f02-53f2-a5d8-d42b76b0ef98` and 2026-06-29
+05:20:49Z run `1dfb47c9-07bf-551b-b778-1d21a40bd95c`, both with
+`output_validated=true` and working-memory notes written. The current local time
+was 2026-06-30 01:37 Europe/Berlin, before the expected 07:20 Berlin scheduled
+fire, so the three-clean-run gate cannot close yet. Recheck after 2026-06-30
+05:20Z; if that scheduled run validates, the clean streak is 06-28 / 06-29 /
+06-30 and T03 can close with calibration feedback.
+
+2026-06-30 closeout: the 07:20 Berlin scheduled run fired at 05:20:50Z as run
+`ac3d71a0-2f8f-50df-b3ce-7c60c2abb5c5` with `output_validated=true` and a
+working-memory note written. The post-failure clean streak is now complete:
+2026-06-28 (`6a44d6dd`), 2026-06-29 (`1dfb47c9`), and 2026-06-30 (`ac3d71a0`).
+Calibration feedback: the scheduler, worker, llm-connect route, State Hub sink,
+and working-memory sink are stable again; the recommendations were operationally
+useful but too dense at 10 items, repeatedly emphasizing human-dependency and
+infrastructure-unblock work. ACTIVITY-WP-0016 now owns the density/contract fix:
+Railiance runtime projection was aligned to a top-7 contract so the next live
+run can prove the bounded output posture. T03 is done.
+
 ## Rule Action Contract Documentation

 ```task
--- a/workplans/ACTIVITY-WP-0016-llm-output-robustness-trust-boundary.md
+++ b/workplans/ACTIVITY-WP-0016-llm-output-robustness-trust-boundary.md
@@ -8,7 +8,7 @@ status: active
 owner: codex
 topic_slug: custodian
 created: "2026-06-26"
-updated: "2026-06-27"
+updated: "2026-06-30"
 state_hub_workstream_id: "4ef0d53b-1777-41ae-80c6-1b69fdb34726"
 ---

@@ -144,11 +144,21 @@ Done when:
  `tests/fixtures/wp0016/daily_triage_2026-06-26_validation_failure.partial.json`
  (the 4000-char preview + validation error; full payload pending the remote pull).

+2026-06-30 local retention hardening: activity-core now preserves future
+llm-connect diagnostic metadata instead of dropping it at the client boundary.
+`LLMConnectClient.complete()` still returns the content string for compatibility,
+but records safe non-secret response fields such as `finish_reason` and `usage`
+on `last_response_metadata`; the executor copies that into report artifacts,
+State Hub progress detail, and working-memory notes. Invalid report raw previews
+were raised from 4000 to 12000 chars. This does not recover the historical
+06-26 full payload or producer-side `finish_reason`, so T01 remains wait on the
+remote llm-connect log pull, but the retention gap is closed for future failures.
+
 ## Schema + Prompt Redesign For Error Locality

 ```task
 id: ACTIVITY-WP-0016-T02
-status: progress
+status: done
 priority: high
 state_hub_task_id: "ae67ca8c-ee01-4a8d-9e8a-a0a36c999758"
 ```
@@ -209,6 +219,21 @@ Apply there:
 4. State the value vocabularies (`action`, `confidence`) the T04 guardrails will
   check.

+2026-06-30 live evidence check: the 2026-06-28 and 2026-06-29 scheduled
+`daily_triage` events validated successfully, which shows the runtime is no
+longer failing every day. However, the preserved State Hub reports still contain
+10 recommendations, not the requested bounded top-N of 7 / framed item contract.
+Treat that as evidence that the runtime-projected prompt/schema/max-token bundle
+has not fully absorbed the T02 handoff yet.
+
+2026-06-30 source projection closeout: patched `k8s/railiance/20-runtime.yaml`
+so the projected `daily-statehub-wsjf-triage.md` prompt now says at most 7
+recommendations and instructs the model to emit fewer well-formed items rather
+than more. The projected `daily-triage-report.json` now has `maxItems: 7` and
+`rank.maximum: 7`, aligned with the repo schema. `max_tokens: 1800` remains as
+headroom for the bounded report. T02 is done in source; live deployment and an
+observed <=7 recommendation run remain under T05.
+
 ## Boundary Parser — Verify & Mitigate (Posture B)

 ```task
@@ -368,6 +393,19 @@ Done when:
  is cluster/operator work outside this repo's SCOPE. T05 therefore stays
  `progress` until that live run exists; the in-repo deliverables are done.

+2026-06-30 follow-up: added forward-looking diagnostics so future validation
+failures carry llm-connect response metadata and a larger bounded raw-output
+preview in activity-core-owned evidence. Focused verification passed:
+`uv run pytest tests/test_llm_client.py tests/rules/test_executor.py tests/test_report_sinks.py -q`
+=> 39 passed. This improves future root-cause ability but does not replace the
+required live smoke proving graceful degradation on railiance01.
+
+2026-06-30 projection follow-up: local source projection now enforces the top-7
+prompt/schema contract. Remaining T05 proof is operational: deploy or sync the
+updated `k8s/railiance/20-runtime.yaml`, run `actcore-sync`/schedule smoke or wait
+for the next 07:20 Berlin fire, then confirm State Hub `daily_triage` evidence is
+`output_validated=true` with no more than 7 recommendations.
+
 ## Relationships

 - **Blocks / feeds:** `ACTIVITY-WP-0006-T03` (three clean scheduled runs) and
--- a/workplans/ACTIVITY-WP-0018-own-infra-automation-status.md
+++ b/workplans/ACTIVITY-WP-0018-own-infra-automation-status.md
@@ -0,0 +1,248 @@
+---
+id: ACTIVITY-WP-0018
+type: workplan
+title: "Own-infrastructure automation status surface"
+domain: infotech
+repo: activity-core
+status: finished
+owner: codex
+topic_slug: automation-observability
+created: "2026-06-29"
+updated: "2026-06-29"
+state_hub_workstream_id: "0220b38b-7c73-4601-9601-5f2c1a5b29e8"
+---
+
+# Own-infrastructure automation status surface
+
+## Goal
+
+Make activity-core's own scheduling and evidence infrastructure the explicit
+operating preference for durable automations, independent of any coding
+assistant-provided scheduler or reminder system.
+
+An operator should be able to answer a question like "How did our automations go
+since Friday?" with a repo-native command that does not require an LLM. Coding
+assistants may inspect or summarize that command's output, but they must not be
+the source of truth for scheduled execution, run history, or operational
+evidence.
+
+## Review notes
+
+The repo already owns the correct infrastructure direction:
+
+- `SCOPE.md` defines activity-core as the org-wide event bridge for cron,
+  one-off scheduled datetime, and event-triggered automation.
+- `Makefile` exposes sync and service targets, but no operator status target for
+  recent automation outcomes.
+- `docs/runbook.md` documents daily-triage verification through
+  `scripts/verify_daily_triage.py`, but that helper is activity-specific and
+  still reads like a checklist rather than the baseline answer surface for all
+  automations.
+- Existing workplan evidence shows the status question is operationally common:
+  2026-06-24 and 2026-06-25 daily triage runs were clean, while 2026-06-26 and
+  2026-06-27 fired on schedule but failed output validation. That distinction is
+  exactly what the baseline command must make obvious.
+
+## Task: Codify the own-infra scheduling preference
+
+```task
+id: ACTIVITY-WP-0018-T01
+status: done
+priority: high
+state_hub_task_id: "00127678-5ce4-4cb3-b81c-f42e04407c73"
+```
+
+Record the repository preference that durable automation scheduling, execution
+history, and run evidence belong to activity-core's own infrastructure: Temporal
+Schedules, NATS JetStream, activity-core run records, State Hub progress, and
+working-memory/report sinks.
+
+Acceptance:
+
+- `AGENTS.md` repo-specific instructions say not to use coding
+  assistant-provided automation tooling as the execution or evidence source for
+  activity-core automations.
+- `SCOPE.md` and `docs/runbook.md` describe coding assistants as callers or
+  summarizers of repo-native automation commands, not as schedulers.
+- The preference distinguishes durable automation from harmless local session
+  reminders: production/operational recurrence belongs to activity-core.
+- The text names the authoritative evidence sources and avoids tying the policy
+  to any one assistant product.
+
+2026-06-29 progress: Added the immediate repo-agent instruction in AGENTS.md
+that durable activity-core automations must use repo-owned infrastructure, not
+coding assistant automation/reminder/heartbeat tooling, as the execution or
+evidence source. Remaining T01 work is to carry the same preference into
+SCOPE.md and docs/runbook.md.
+
+## Task: Define the automation status evidence contract
+
+```task
+id: ACTIVITY-WP-0018-T02
+status: done
+priority: high
+state_hub_task_id: "17e6bb87-d4bf-4ef3-b91c-4bdfe2fe3492"
+```
+
+Define a small, deterministic report contract for answering recent automation
+status questions across all ActivityDefinitions.
+
+Acceptance:
+
+- The contract covers schedule state, expected fires in the requested window,
+  observed workflow runs, `activity_runs` rows, State Hub progress events,
+  working-memory/report sink evidence, and known validation or sink failures.
+- It defines normalized statuses such as `completed`, `running`, `retrying`,
+  `validation_failed`, `sink_failed`, `missed`, `disabled`, and `unknown`.
+- Partial data is explicit: if Temporal, Postgres, State Hub, or a sink path is
+  unavailable, the report includes warnings rather than silently passing or
+  failing the whole check.
+- The contract is safe for operator logs: no secrets, prompts, raw model output,
+  or credential-bearing URLs.
+- The contract can be emitted as JSON for scripts and rendered as concise text
+  for humans.
+
+## Task: Implement the non-LLM automation status CLI
+
+```task
+id: ACTIVITY-WP-0018-T03
+status: done
+priority: high
+state_hub_task_id: "7831f2fc-8b76-48fe-aa34-9dcc11ee84db"
+```
+
+Add a deterministic CLI, likely under `scripts/automation_status.py` or an
+`activity_core` module, that answers recent automation status questions without
+calling an LLM.
+
+Acceptance:
+
+- Supports `--since`, `--until`, activity name/id filters, JSON output, and a
+  concise human summary.
+- Accepts simple operator dates, including absolute dates and a documented
+  `friday`/`last-friday` style shortcut, resolving them to concrete dates in the
+  configured timezone.
+- Inspects all enabled scheduled ActivityDefinitions by default, not just daily
+  triage.
+- Uses live sources when configured: Postgres `activity_definitions` /
+  `activity_runs`, Temporal schedule and workflow visibility, State Hub
+  progress, and configured local report sink paths.
+- Degrades usefully when a source is unavailable and exits non-zero only for
+  real status failures or invalid input, not for optional evidence gaps that are
+  clearly reported.
+- Includes focused unit tests with fixture data for clean runs, validation
+  failures, missed runs, disabled schedules, and partial-source availability.
+
+## Task: Add the Make target baseline
+
+```task
+id: ACTIVITY-WP-0018-T04
+status: done
+priority: high
+state_hub_task_id: "451bdf62-b619-4ace-9262-46d20b912781"
+```
+
+Expose the CLI through a Make target that is easy for an operator or any coding
+assistant to run before attempting a prose summary.
+
+Acceptance:
+
+- `make automation-status SINCE=2026-06-26` prints the human-readable baseline.
+- `make automation-status SINCE=friday` is supported or documented with the
+  exact accepted shortcut.
+- A JSON form is available, either through `FORMAT=json` or a separate target
+  such as `make automation-status-json`.
+- The target does not require LLM credentials, coding assistant automation
+  tooling, or interactive prompts.
+- `make help` lists the target with a clear one-line description.
+
+## Task: Update operator docs and examples
+
+```task
+id: ACTIVITY-WP-0018-T05
+status: done
+priority: medium
+state_hub_task_id: "233659aa-e14a-4b3d-b156-d04f0fa16db6"
+```
+
+Update the runbook so "How did automations go since Friday?" has an obvious
+operator recipe.
+
+Acceptance:
+
+- `docs/runbook.md` has a short "Automation status" section near the scheduling
+  operations.
+- The docs include example output or a compact sample for the known daily
+  triage distinction: fired on time versus completed successfully versus output
+  validation failure.
+- The docs clarify that LLM summaries are optional convenience only; the Make
+  target output is the baseline evidence.
+- The daily-triage-specific helper is either kept as a lower-level diagnostic or
+  folded into the generalized status command.
+
+## Task: Verify against recent scheduled-run evidence
+
+```task
+id: ACTIVITY-WP-0018-T06
+status: done
+priority: medium
+state_hub_task_id: "24efbe9f-dfff-482f-9edc-456379c9a2aa"
+```
+
+Prove the new surface against the recent evidence that motivated this workplan.
+
+Acceptance:
+
+- Running the status command over the window starting Friday, 2026-06-26 shows
+  that the daily triage schedule fired on 2026-06-26 and 2026-06-27 but did not
+  produce clean validated reports.
+- The command distinguishes scheduling health from output/schema validation
+  failure.
+- Disabled or waiting schedules, such as the weekly coding retro gate when its
+  upstream read model is not available, are reported without being counted as
+  missed runs.
+- Verification results are recorded in this workplan and as a State Hub progress
+  note once the implementation lands.
+
+## Implementation Result
+
+Completed 2026-06-29: implemented the own-infrastructure automation status
+surface and codified the scheduling preference.
+
+Delivered:
+
+- `AGENTS.md` now states that durable activity-core automations use repo-owned
+  infrastructure, not coding assistant automation/reminder/heartbeat tooling, as
+  execution or evidence authority.
+- `SCOPE.md` and `docs/runbook.md` describe the deterministic status surface and
+  assistant boundary.
+- `src/activity_core/automation_status.py` and `scripts/automation_status.py`
+  provide the non-LLM CLI.
+- `make automation-status SINCE=...` and `make automation-status-json` expose the
+  baseline operator commands.
+- `tests/test_automation_status.py` covers date shortcuts, cron fire estimation,
+  completed runs, validation failures, missed runs, disabled schedules, partial
+  source availability, and working-memory evidence parsing.
+
+Verification:
+
+```bash
+python3 -m py_compile src/activity_core/automation_status.py scripts/automation_status.py tests/test_automation_status.py
+/home/worsch/.local/bin/uv run pytest tests/test_automation_status.py tests/test_daily_triage_verifier.py -q
+/home/worsch/.local/bin/uv run python scripts/automation_status.py \
+  --since 2026-06-26 --until 2026-06-27 --db-url '' \
+  --progress-event-type daily_triage --timeout-seconds 10 \
+  --working-memory-dir /tmp --format json
+```
+
+Results:
+
+- focused tests: `11 passed`;
+- `make help` lists `automation-status` and `automation-status-json`;
+- the 2026-06-26 through 2026-06-27 status run exited `1` as expected because
+  State Hub evidence classified daily triage activity
+  `6fca51fa-387a-4fd0-bc4e-d62c29eb859a` as `validation_failed` with two
+  non-secret evidence records: 2026-06-26 `Expecting ',' delimiter` and
+  2026-06-27 `Unterminated string`;
+- the same report classified the gated weekly coding retro as `disabled`, not
+  `missed`.
--- a/workplans/ACTIVITY-WP-0019-automation-schedule-inventory-targets.md
+++ b/workplans/ACTIVITY-WP-0019-automation-schedule-inventory-targets.md
@@ -0,0 +1,164 @@
+---
+id: ACTIVITY-WP-0019
+type: workplan
+title: "Automation schedule inventory Make targets"
+domain: infotech
+repo: activity-core
+status: ready
+owner: codex
+topic_slug: automation-inventory
+created: "2026-06-29"
+updated: "2026-06-29"
+state_hub_workstream_id: "21c73763-9adc-42f6-8fd2-1b8b33c2c770"
+---
+
+# Automation schedule inventory Make targets
+
+## Goal
+
+Provide a repo-native, non-LLM way to list every scheduled automation that
+activity-core knows about.
+
+`ACTIVITY-WP-0018` added the status surface for questions like "How did our
+automations go since Friday?". The next operator question is the inventory
+baseline: "What automations are scheduled at all?" That should be answerable
+through Make targets backed by activity-core's own ActivityDefinitions,
+database, and Temporal schedule metadata when available, independent of any
+coding assistant automation infrastructure.
+
+## Review notes
+
+- `Makefile` currently exposes `automation-status` and
+  `automation-status-json`, but no dedicated inventory/list target.
+- `scripts/automation_status.py` and `src/activity_core/automation_status.py`
+  already load scheduled ActivityDefinitions and compute their Temporal schedule
+  ids. The inventory target should reuse that parsing/loading posture where it
+  fits rather than creating a second discovery path.
+- `make sync-schedules` reconciles Temporal schedules from the
+  `activity_definitions` database, but it is an action target, not a read-only
+  operator inventory command.
+- The inventory command should remain useful in degraded local mode: file-backed
+  definitions are enough to list configured scheduled automations, while live
+  DB and Temporal visibility can enrich the output.
+
+## Task: Define the automation inventory contract
+
+```task
+id: ACTIVITY-WP-0019-T01
+status: todo
+priority: high
+state_hub_task_id: "8de24590-f9ee-4d0e-8692-b7ada9f232ed"
+```
+
+Define the fields and source precedence for a deterministic scheduled
+automation inventory report.
+
+Acceptance:
+
+- The report includes every ActivityDefinition with `trigger_type` of `cron` or
+  `scheduled`, including disabled definitions.
+- Each row includes id, name, enabled/disabled state, trigger type, schedule
+  expression or one-shot datetime, timezone, overlap/catchup policy when known,
+  and the derived Temporal schedule id.
+- The report identifies its source for each row: database, repo definition file,
+  Temporal visibility, or a combination.
+- If Temporal is reachable, the report adds paused/missing/drift hints without
+  mutating schedules.
+- Missing optional sources produce warnings, not silent omissions.
+- The JSON shape is stable enough for scripts and tests.
+
+## Task: Implement a non-mutating inventory CLI
+
+```task
+id: ACTIVITY-WP-0019-T02
+status: todo
+priority: high
+state_hub_task_id: "538cb9a5-48f3-470c-8518-29ee66c96678"
+```
+
+Add a deterministic CLI path for listing scheduled automations without requiring
+LLM credentials or coding assistant tooling.
+
+Acceptance:
+
+- A script or module command, likely sharing code with
+  `activity_core.automation_status`, supports human and JSON output.
+- The command is read-only: it does not call `sync-schedules`, upsert schedules,
+  delete schedules, enqueue workflows, or write State Hub evidence.
+- It supports filters by activity id, activity name, enabled state, and trigger
+  type.
+- It loads from the database when configured and falls back to repo definition
+  files when the database is unavailable or explicitly disabled.
+- It optionally enriches rows from Temporal when `TEMPORAL_HOST` is configured,
+  with bounded timeouts so an unreachable service does not hang the command.
+- Unit tests cover DB rows, file fallback, disabled definitions, Temporal
+  enrichment unavailable, and JSON output.
+
+## Task: Add Make targets
+
+```task
+id: ACTIVITY-WP-0019-T03
+status: todo
+priority: high
+state_hub_task_id: "f2001721-07f3-42f5-a15e-0c7d1b0ed801"
+```
+
+Expose the inventory command through Make targets that are easy for humans,
+scripts, and coding assistants to run before asking for a prose summary.
+
+Acceptance:
+
+- `make automation-list` prints a concise human-readable inventory.
+- `make automation-list-json` emits the same inventory as JSON.
+- Optional Make variables pass through cleanly, for example `ENABLED=true`,
+  `TRIGGER=cron`, `ACTIVITY_ID=<uuid>`, or `FORMAT=json`.
+- `make help` lists both targets with clear one-line descriptions.
+- The targets do not require LLM access, Codex automation tooling, or
+  interactive prompts.
+
+## Task: Document the inventory workflow
+
+```task
+id: ACTIVITY-WP-0019-T04
+status: todo
+priority: medium
+state_hub_task_id: "f687743b-3936-413e-ae50-d35484ae9a81"
+```
+
+Update operator documentation so the scheduled automation inventory path is
+discoverable next to the status path.
+
+Acceptance:
+
+- `docs/runbook.md` documents `make automation-list` and
+  `make automation-list-json`.
+- The docs distinguish inventory from status: inventory answers what is
+  configured; status answers what happened in a time window.
+- The docs state that the command is read-only and uses activity-core-owned
+  scheduling evidence.
+- The docs include a compact example of the expected human output.
+
+## Task: Verify against current repo and live/degraded sources
+
+```task
+id: ACTIVITY-WP-0019-T05
+status: todo
+priority: medium
+state_hub_task_id: "5317b532-5cef-4eff-b6d8-3e85bbca8e8a"
+```
+
+Prove the target against the current scheduled automation definitions and
+degraded local conditions.
+
+Acceptance:
+
+- `make automation-list` shows the current scheduled automations, including
+  daily triage and weekly scheduled definitions when present in the selected
+  source.
+- JSON output is valid and includes the same rows.
+- A DB-unavailable run falls back to repo definition files or reports a clear
+  warning if no definitions are discoverable.
+- A Temporal-unavailable run exits successfully with Temporal warnings rather
+  than hanging.
+- Focused tests pass and the result is recorded in this workplan before the
+  workplan is moved to `finished`.