generated from coulomb/repo-seed
Agentic memory profile
This commit is contained in:
97
docs/agentic-memory-profile-pilot.md
Normal file
97
docs/agentic-memory-profile-pilot.md
Normal file
@@ -0,0 +1,97 @@
|
||||
# Agentic Memory Profile Pilot
|
||||
|
||||
Date: 2026-05-15
|
||||
Workplan: IB-WP-0017
|
||||
|
||||
## Purpose
|
||||
|
||||
This pilot validates agentic memory profile fixtures against concrete
|
||||
infospace work. It does not add reusable memory runtime infrastructure to
|
||||
`infospace-bench`.
|
||||
|
||||
## Pilot Selection
|
||||
|
||||
The selected corpus is `infospaces/wealth-vsm-legacy-slice`. It is bounded,
|
||||
reviewable, and already contains a source, entities, relation, evaluation,
|
||||
metrics, history, and an engine sync plan. That makes it a better pilot than a
|
||||
new synthetic corpus because the memory package can be evaluated against a real
|
||||
restart task: resume review of the Wealth/VSM entity and relation neighborhood.
|
||||
|
||||
## Memory Question Matrix
|
||||
|
||||
| Memory Question | Pilot Evidence | Acceptance Threshold |
|
||||
| --- | --- | --- |
|
||||
| Which reasoning decisions should become durable memory? | `decision.file-backed-pilot` and `constraint.no-durable-runtime` | A restart package explains ownership boundaries without rereading Workplan 17. |
|
||||
| Which conversation or workflow events are useful later? | `trace.entity-review-restart` and `event.workflow-restart-trace` | Events explain why a package item exists and what task it supports. |
|
||||
| Which knowledge graph neighborhoods improve review? | Wealth/VSM source and entity nodes | The package includes the active artifact neighborhood, not only planning notes. |
|
||||
| Which context package shapes help agents? | `restart-context-selection.yaml` | Eight or fewer items, source spans preserved, no live LLM required. |
|
||||
| Which profile parameters are too abstract or misplaced? | `context-package-evaluation.yaml` | Contract feedback is routed to Markitect or the engine, not hidden in this repo. |
|
||||
|
||||
## Fixture Contracts
|
||||
|
||||
The checked-in pilot uses Markitect contract versions:
|
||||
|
||||
- `markitect.memory.profile.v1`
|
||||
- `markitect.memory.graph.v1`
|
||||
- `markitect.memory.selection.v1`
|
||||
|
||||
The default test suite validates the profile and graph through
|
||||
`markitect_tool.memory.graph`, compiles the selection to a context package, and
|
||||
checks the deterministic fields against
|
||||
`restart-context-package.expected.yaml`.
|
||||
|
||||
## Context Package Evaluation
|
||||
|
||||
The restart package is considered useful when it:
|
||||
|
||||
- contains the boundary decision, no-runtime constraint, package plan, review
|
||||
gate, and active Wealth/VSM artifact neighborhood
|
||||
- preserves provenance for all selected nodes or synthetic Markitect event spans
|
||||
- remains under the declared 1200-token package budget
|
||||
- keeps runtime writes review-gated and fixture-only
|
||||
|
||||
The first pilot snapshot scores restart quality at `4.2/5.0` and provenance
|
||||
coverage at `1.0`.
|
||||
|
||||
## Engine Integration Plan
|
||||
|
||||
File-backed in this pilot:
|
||||
|
||||
- selected corpus and infospace manifest
|
||||
- Markitect memory profile, graph, and selection fixtures
|
||||
- expected package shape and evaluation metrics
|
||||
- workflow trace examples and review notes
|
||||
|
||||
Engine-backed later:
|
||||
|
||||
- durable memory node, edge, event, and audit storage
|
||||
- permission-aware query and activation behavior
|
||||
- retention, refresh, compaction, and policy decisions
|
||||
- dry-run and apply plans for durable memory writes
|
||||
|
||||
The first integration should mirror this fixture into `kontextual-engine` as an
|
||||
imported Markitect graph. Dry run should report creates, updates, denied writes,
|
||||
and policy reasons. Apply should require an explicit review gate and record an
|
||||
engine audit event separately from Markitect contract events.
|
||||
|
||||
## Architecture Feedback
|
||||
|
||||
Markitect contract feedback:
|
||||
|
||||
- Add a timestamp-stable context package output mode for golden fixtures.
|
||||
- Document when selected events should become package items versus metadata.
|
||||
- Make package provenance for implied edges easy to inspect.
|
||||
|
||||
Kontextual engine feedback:
|
||||
|
||||
- Import Markitect graph/profile envelopes without redefining node vocabulary.
|
||||
- Persist runtime audit events separately from Markitect memory events.
|
||||
- Keep durable memory updates review-gated and export Markitect-compatible
|
||||
package inputs.
|
||||
|
||||
Infospace-bench boundary:
|
||||
|
||||
- Keep corpus selection, applied metrics, evaluation history, workflow traces,
|
||||
and practical package-quality evidence here.
|
||||
- Do not store credentials, durable user memory, or general graph/event
|
||||
persistence inside an infospace.
|
||||
Reference in New Issue
Block a user