Files

Test / test (push) Has been cancelled

Details

feat(P7): IHF Phase 7 complete — advanced observability and operational integration

T01 schema: friction_scores, bottleneck_records, hub_health_snapshots,
cross_hub_propagations + migration 1743552000.

T02 Widget Pain Heatmap: computeFrictionScore (formula documented), RecomputeFriction
action, colour-coded grid view (green/yellow/amber/red).

T03 Workflow Bottleneck Analysis: detectBottlenecks across 4 pipeline stages
(candidate 30d, requirement 60d, decision 30d, observation 14d), idempotent,
severity from age ratio, resolve action.

T04 Hub Health Correlation: computeHubHealth (deduction table documented),
append-only HubHealthSnapshot, health history view, badge on hub Show page.

T05 Cross-Hub Propagation: annotation_cluster + widget_type_friction heuristics,
idempotent detection, acknowledge/resolve lifecycle.

T06 Operational Review Board: 4-panel AutoRefresh global dashboard — health matrix,
top-10 friction, bottleneck stage counts, open propagations.

T07 gate: 5 describe blocks in Test/Integration.hs; SCOPE.md updated Phase 7
complete; docs/phase7-summary.md written.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-29 21:49:22 +00:00

17 KiB

Raw Blame History

active, iteration, session_id, max_iterations, completion_promise, workplan_id, workplan_file, started_at

active	iteration	session_id	max_iterations	completion_promise	workplan_id	workplan_file	started_at
true	1		20	HEUREKA	IHUB-WP-0007	workplans/IHUB-WP-0007-ihf-phase7-advanced-observability-and-operational-integration.md	2026-03-29T21:41:13Z

Workplan Status Check — Do This First, Every Iteration

Read the workplan file at: workplans/IHUB-WP-0007-ihf-phase7-advanced-observability-and-operational-integration.md

Count the task blocks (fenced code blocks with language tag task):

How many tasks exist in total?
How many have status: done?

If every task has status: done AND the frontmatter status is done: The workplan is complete. Output exactly: HEUREKA Do nothing else. Stop here.

Otherwise: continue with the implementation below.

Workplan: IHUB-WP-0007 — IHF Phase 7 — Advanced Observability and Operational Integration

File: workplans/IHUB-WP-0007-ihf-phase7-advanced-observability-and-operational-integration.md

IHF Phase 7 — Advanced Observability and Operational Integration

Goal

Integrate interaction governance with broader operational intelligence. Phase 6 established cross-framework widget participation. Phase 7 turns the accumulated interaction data into operational intelligence: friction heatmaps, pipeline bottleneck detection, per-hub health scores, and cross-hub pattern propagation. The capstone is an Operational Review Board dashboard that gives hub leaders a unified view across all hubs.

Background

Phases 1–6 are complete. The IHF core (widget registry, interaction events, annotations, requirements, decisions, outcomes, agent assistance, cross-framework adapters) is stable and extensible.

The spec (§Phase 7) calls for:

Hub health correlation
Policy violation correlation
Workflow bottleneck analysis
Interaction pain heatmaps
Queue and job linkage
Cross-hub issue propagation analysis

Artifacts introduced: FrictionScore, BottleneckRecord, HubHealthSnapshot, CrossHubPropagation.

Reference: specs/InteractionHubFrameworkSpecification_v0.1.md §Phase 7, docs/phase6-summary.md, docs/ihp-controllers-views-forms.md.

Phase 7 Exit Criteria (from IHF spec §Phase 7)

Interaction data informs operational decision-making
Hub leaders can inspect systemic friction patterns
The platform supports cross-domain learning

Data Artifacts Introduced (Phase 7)

FrictionScore, BottleneckRecord, HubHealthSnapshot, CrossHubPropagation

Tasks

T01 — Schema: FrictionScore, BottleneckRecord, HubHealthSnapshot, CrossHubPropagation

id: IHUB-WP-0007-T01
status: todo
priority: high
state_hub_task_id: "86e31f8b-62a3-4176-9b10-2fe7a8dbcc23"

Add Phase 7 tables to Application/Schema.sql and write migration:

-- Aggregated pain score per widget, recomputed on demand or scheduled.
CREATE TABLE friction_scores (
    id UUID DEFAULT uuid_generate_v4() PRIMARY KEY NOT NULL,
    widget_id UUID NOT NULL REFERENCES widgets(id),
    score INTEGER NOT NULL DEFAULT 0,
    -- 0–100; higher = more friction
    annotation_count INTEGER NOT NULL DEFAULT 0,
    error_event_count INTEGER NOT NULL DEFAULT 0,
    regression_flag BOOLEAN NOT NULL DEFAULT FALSE,
    stale_candidate_count INTEGER NOT NULL DEFAULT 0,
    last_computed_at TIMESTAMP WITH TIME ZONE DEFAULT now() NOT NULL,
    UNIQUE (widget_id)
);

CREATE INDEX friction_scores_widget_id_idx ON friction_scores (widget_id);
CREATE INDEX friction_scores_score_idx ON friction_scores (score DESC);

-- Detected stalls at specific pipeline stages.
CREATE TABLE bottleneck_records (
    id UUID DEFAULT uuid_generate_v4() PRIMARY KEY NOT NULL,
    hub_id UUID NOT NULL REFERENCES hubs(id),
    stage TEXT NOT NULL,
    -- 'candidate' | 'requirement' | 'decision' | 'observation'
    subject_type TEXT NOT NULL,
    -- 'RequirementCandidate' | 'Requirement' | 'DecisionRecord' | 'DeploymentRecord'
    subject_id UUID NOT NULL,
    stalled_since TIMESTAMP WITH TIME ZONE NOT NULL,
    severity TEXT NOT NULL DEFAULT 'medium',
    -- 'low' | 'medium' | 'high' | 'critical'
    resolved_at TIMESTAMP WITH TIME ZONE,
    notes TEXT,
    created_at TIMESTAMP WITH TIME ZONE DEFAULT now() NOT NULL
);

CREATE INDEX bottleneck_records_hub_id_idx ON bottleneck_records (hub_id);
CREATE INDEX bottleneck_records_stage_idx ON bottleneck_records (stage);
CREATE INDEX bottleneck_records_resolved_idx ON bottleneck_records (resolved_at)
    WHERE resolved_at IS NULL;

-- Periodic health snapshots for trend tracking.
CREATE TABLE hub_health_snapshots (
    id UUID DEFAULT uuid_generate_v4() PRIMARY KEY NOT NULL,
    hub_id UUID NOT NULL REFERENCES hubs(id),
    health_score INTEGER NOT NULL,
    -- 0–100
    open_candidates INTEGER NOT NULL DEFAULT 0,
    regressed_widgets INTEGER NOT NULL DEFAULT 0,
    stale_decisions INTEGER NOT NULL DEFAULT 0,
    active_bottlenecks INTEGER NOT NULL DEFAULT 0,
    computed_at TIMESTAMP WITH TIME ZONE DEFAULT now() NOT NULL
);

CREATE INDEX hub_health_snapshots_hub_id_idx ON hub_health_snapshots (hub_id);
CREATE INDEX hub_health_snapshots_computed_at_idx
    ON hub_health_snapshots (hub_id, computed_at DESC);

-- Patterns detected across multiple hubs.
CREATE TABLE cross_hub_propagations (
    id UUID DEFAULT uuid_generate_v4() PRIMARY KEY NOT NULL,
    pattern_type TEXT NOT NULL,
    -- 'annotation_cluster' | 'widget_type_friction'
    source_hub_id UUID REFERENCES hubs(id),
    affected_hub_ids JSONB NOT NULL DEFAULT '[]',
    -- array of hub UUIDs
    summary TEXT NOT NULL,
    detected_at TIMESTAMP WITH TIME ZONE DEFAULT now() NOT NULL,
    status TEXT NOT NULL DEFAULT 'open',
    -- 'open' | 'acknowledged' | 'resolved'
    notes TEXT
);

CREATE INDEX cross_hub_propagations_status_idx ON cross_hub_propagations (status);
CREATE INDEX cross_hub_propagations_pattern_idx ON cross_hub_propagations (pattern_type);

Exit criteria: migrate runs cleanly; all Phase 7 types available in GHCi.

id: IHUB-WP-0007-T02
status: todo
priority: high
state_hub_task_id: "3a5ecd28-17c2-4258-bfd9-b3eaecf52135"

Add Application/Helper/FrictionScore.hs with computeFrictionScore:
- annotation_count — total annotations for widget
- error_event_count — events with event_type = 'errored'
- regression_flag — True if widget appears in regressedWidgetIds
- stale_candidate_count — open candidates older than 30 days
- Score formula (documented in module header):
```
score = min 100 $
    annotationCount * 5
  + errorEventCount * 10
  + (if regressionFlag then 20 else 0)
  + staleCandidateCount * 8
```
- Upserts into friction_scores (UPDATE if exists, INSERT otherwise)
Add RecomputeFrictionAction { hubId } to HubsController:
- Recomputes scores for all widgets in the hub
- Redirects back to heatmap view
Add FrictionHeatmapAction { hubId } view:
- Grid of widget cards, colour-coded by score band:
  - 0–19: green (bg-green-100)
  - 20–39: yellow (bg-yellow-100)
  - 40–59: amber (bg-orange-100)
  - 60+: red (bg-red-100)
- Each card: widget name, score, link to widget show
- "Recompute" button triggers RecomputeFrictionAction
Link from hub Show page as "Friction Heatmap"

Exit criteria: Scores compute correctly for test fixtures; heatmap renders with correct colour bands; recompute updates scores.

T03 — Workflow Bottleneck Analysis: stall detection across the pipeline

id: IHUB-WP-0007-T03
status: todo
priority: high
state_hub_task_id: "ada0347a-880b-454e-843f-4a9135ea8739"

Add Application/Helper/BottleneckDetector.hs with detectBottlenecks:
- Stage 1 — candidate: RequirementCandidate with status='open' and created_at < now() - interval '30 days'
- Stage 2 — requirement: Requirement with no linked DecisionRecord and created_at < now() - interval '60 days'
- Stage 3 — decision: DecisionRecord with no linked DeploymentRecord and decided_at < now() - interval '30 days'
- Stage 4 — observation: DeploymentRecord with no linked OutcomeSignal and deployed_at < now() - interval '14 days'
- Severity: critical if age > 2× threshold, else high if > 1.5×, else medium
- Upserts BottleneckRecord (skip if already exists for same subject)
Add DetectBottlenecksAction { hubId } — runs detector, redirects to dashboard
Add BottleneckDashboardAction { hubId } view:
- Table grouped by pipeline stage
- Columns: subject (linked), stalled since, age, severity badge
- "Resolve" button → ResolveBottleneckAction { bottleneckRecordId }
- "Detect" button triggers fresh detection
Link from hub Show page as "Bottlenecks"

Exit criteria: Stale candidates create bottleneck records; dashboard renders and groups correctly; resolve marks resolved_at.

T04 — Hub Health Correlation: composite health score and history

id: IHUB-WP-0007-T04
status: todo
priority: high
state_hub_task_id: "b0c932c5-fdb7-47b6-adc7-b4f8ed5555e6"

Add Application/Helper/HubHealth.hs with computeHubHealth:

Deduction table (documented in module):

-5  per open RequirementCandidate
-10 per regressed widget
-8  per stale DecisionRecord (decided > 30 days, no deployment)
-12 per active critical BottleneckRecord
-6  per active high BottleneckRecord
floor at 0

Inserts new HubHealthSnapshot (never updates — history is append-only)

Add SnapshotHubHealthAction { hubId } — computes and redirects to history
Add HubHealthHistoryAction { hubId } view:
- Table of snapshots: timestamp, score (colour-coded), component breakdown
- Latest score shown prominently at top
Show health score badge on hub Show page (next to dashboard links):
- Fetch latest snapshot; display colour-coded score pill
- If no snapshot: "–" with link to take first snapshot

Exit criteria: Snapshot computes correct score against test fixtures; history table renders in order; badge appears on hub Show page.

T05 — Cross-Hub Propagation Analysis: pattern detection across hubs

id: IHUB-WP-0007-T05
status: todo
priority: medium
state_hub_task_id: "7a860b9f-a835-47d6-96d8-2964ae37b12d"

Add Application/Helper/CrossHubPropagation.hs with detectPropagations:
- Annotation cluster heuristic: for each annotation category, count distinct hubs with ≥3 annotations in that category in the last 14 days. If ≥2 hubs qualify, emit a CrossHubPropagation with pattern_type='annotation_cluster' and a generated summary.
- Widget type friction heuristic: for each widget_type, count hubs where the max FrictionScore for that type is ≥40. If ≥2 hubs qualify, emit pattern_type='widget_type_friction'.
- Skip if a matching open/acknowledged propagation already exists (idempotent detection)
Add DetectPropagationsAction (global, no hubId) — runs detector
Add CrossHubPropagationsAction view (global):
- Table: pattern type, source hub, affected hubs (comma list), summary, detected at, status badge
- "Acknowledge" and "Resolve" actions
Link from global nav (alongside "Adapters", "Ops Review")

Exit criteria: Detection creates propagation records for qualifying patterns; duplicate runs are idempotent; acknowledge/resolve transitions work.

T06 — Operational Review Board Dashboard: cross-hub unified view

id: IHUB-WP-0007-T06
status: todo
priority: medium
state_hub_task_id: "ffabc4d1-c166-4b7d-8bec-55365cbe0666"

Add OperationalReviewBoardAction to a new OperationsController (or HubsController as a global action — no hubId parameter):
- Panel 1 — Hub health matrix: all hubs, latest health score (or "–"), colour-coded row, link to hub and to health history
- Panel 2 — Top friction widgets: top 10 across all hubs by FrictionScore.score DESC; columns: widget name, hub, score band, link
- Panel 3 — Active bottlenecks by stage: count of unresolved bottlenecks per stage across all hubs; click-through to hub bottleneck dashboard
- Panel 4 — Open cross-hub propagations: list of open/acknowledged propagation events with pattern type and affected hub count
autoRefresh — live-updates
Link from global nav as "Ops Review"
Link from global nav cross-hub propagation count badge if > 0

Exit criteria: Dashboard renders all four panels; health matrix shows all hubs; top friction list is correctly sorted; live-updates on data change.

T07 — Phase 7 gate: tests, consistency, docs

id: IHUB-WP-0007-T07
status: todo
priority: high
state_hub_task_id: "a14b94f8-3b27-4f0c-9949-60fb65a57a05"

Integration tests (Test/):
- FrictionScore compute formula: widget with known annotation count → expected score
- BottleneckRecord create + resolve: stale candidate → bottleneck detected; resolve sets resolved_at
- HubHealthSnapshot compute: hub with known candidates/regressions → expected score; history fetch returns in order
- CrossHubPropagation create + acknowledge + resolve
- OperationalReviewBoard action: compiles, fetches all hubs, returns counts
Consistency sync via State Hub MCP: check_repo_consistency(repo_slug="inter-hub", fix=True)
Documentation updates:
- Update SCOPE.md current state section: Phase 7 complete
- Write docs/phase7-summary.md: what was built, scoring formulae, bottleneck thresholds, cross-hub heuristics, known limitations, Phase 8 readiness
Smoke test checklist:
- Create two hubs with widgets and annotations; run friction recompute; verify heatmap colours
- Age a candidate by force-setting created_at; run detect bottlenecks; verify record appears
- Snapshot health for both hubs; verify Ops Review Board health matrix
- Trigger cross-hub propagation detection; verify propagation record
- Open Ops Review Board; confirm all four panels populate

Exit criteria: All tests pass; consistency sync reports no errors; smoke test completed; SCOPE.md updated.

Phase 7 Dependencies

Phases 1–6 schema stable (widget registry, interaction events, annotations, requirements, decisions, outcomes, agent proposals, adapter specs)
friction_scores requires widgets (T01 before T02)
bottleneck_records requires hubs, candidates, requirements, decisions, deployments (T01 before T03)
hub_health_snapshots requires hubs and reads from bottleneck_records (T03 before T04)
cross_hub_propagations requires hub friction scores (T02 before T05)
Operational Review Board aggregates all Phase 7 data (T02–T05 before T06)
All feature tasks (T01–T06) before gate (T07)

Notes

Friction scores are recomputed, not append-only. Each widget has at most one FrictionScore row (unique constraint on widget_id). Historical trend is not tracked at the friction level — use HubHealthSnapshot for trends.
Bottleneck detection is idempotent. Re-running the detector skips records where an unresolved bottleneck already exists for the same subject.
Health snapshots are append-only. Every SnapshotHubHealthAction call inserts a new row. This preserves the health history for trend analysis.
Cross-hub detection requires FrictionScores to be current. Run RecomputeFrictionAction for all hubs before DetectPropagationsAction.
No scheduled jobs in Phase 7. Detection and recomputation are triggered manually via UI or curl. Phase 8 can layer on a cron/job system.
Severity thresholds and score weights are constants in the helper modules. They are intentionally not stored in the DB to avoid config drift — change them in code and recompute.

How to Work

Stay strictly within the scope of the workplan above
Work through tasks in priority order (high → medium → low)
Use TDD where applicable: write a failing test, make it pass, then refactor
Use whatever test runner, linter, and build tools this repository already uses
Consult existing documentation (README, docs/, wiki/, specs/) for context
Document significant architecture decisions as ADRs if the project uses them

Updating Task Status

As you complete each task, edit the workplan file to update its status:

status: todo       →   status: in_progress   (when you start it)
status: in_progress →  status: done          (when it is verified complete)

When every task is done, also update the workplan frontmatter:

status: active  →  status: done

Success Criteria

Before marking the workplan done and outputting <promise>HEUREKA</promise>, verify all of the following are true:

Every task block in workplans/IHUB-WP-0007-ihf-phase7-advanced-observability-and-operational-integration.md has status: done
The workplan frontmatter status is done
The full test suite passes with no failures
The codebase passes the project's standard code-quality checks (linting, type checking, formatting — whatever applies to this project)
Documentation reflects the implemented behaviour

Output <promise>HEUREKA</promise> only when all five are genuinely true.

17 KiB Raw Blame History Unescape Escape