State-hub connect and initial workplans

2026-05-03 20:43:56 +02:00
parent ebb241dab6
commit f25bd2cf84
14 changed files with 657 additions and 2 deletions
--- a/docs/markitect-main-scope-assessment.md
+++ b/docs/markitect-main-scope-assessment.md
@@ -0,0 +1,116 @@
+# MarkiTect Main Scope Assessment
+
+Date: 2026-05-03
+
+## Purpose
+
+This assessment compares `/home/worsch/markitect-main` against the
+`infospace-bench` intent, PRD, and FRS. It identifies which ideas should be
+migrated, reimplemented, referenced, or deliberately left behind.
+
+## Summary
+
+`markitect-main` already contains substantial infospace work, but it is bundled
+with responsibilities that this repo explicitly excludes: markdown parsing,
+schema tooling, rendering, asset management, GraphQL experiments, prompt
+runtime infrastructure, and broad platform behavior.
+
+`infospace-bench` should therefore become a focused successor for the
+application-layer infospace lifecycle. The right move is selective
+reimplementation and migration of concepts, examples, tests, and documentation,
+not a package copy.
+
+## Source Areas Reviewed
+
+| MarkiTect area | Assessment | Successor action |
+| --- | --- | --- |
+| `markitect/infospace/` | Closest match. Contains entity metadata, config, evaluation, collection checks, history, composition, relation parsing, graph export, and CLI glue. | Reimplement as the first functional baseline, adapting boundaries to lower-layer deps. |
+| `examples/infospace-with-history/` | Strong reference experiment with `infospace.yaml`, source corpus, pipeline stages, viability thresholds, outputs, and metrics methodology. | Migrate as a reference pilot after pruning generated bulk and documenting provenance. |
+| `docs/infospace-primitives.md` | Useful conceptual reference for topic, discipline, entity, evaluation, viability, checks, history, and composition. | Rewrite into this repo's docs with updated architecture terms. |
+| `markitect/prompts/` | Provides generation workflow, dependency resolution, quality gates, traceability, and batch execution. | Reimplement only workflow-facing contracts; runtime/provider mechanics belong outside or behind adapters. |
+| `markitect/analysis/graph.py` and relationship exports | Relevant for inspection and visualization. | Keep graph/relationship inspection as an application feature; rely on lower layers for reusable primitives where possible. |
+| `markitect/schema/` and legacy schema modules | Useful as input constraints, but lower-level schema tooling is out of scope. | Reference or depend on schema tooling; do not copy wholesale. |
+| `markitect/packaging/transclusion/` and `markitect/spaces/transclusion/` | Useful conceptually for composed artifacts, but implementation is syntax/tooling heavy. | Treat as dependency or later integration point, not first migration. |
+| `markitect/assets/`, rendering plugins, static assets | Mostly outside current PRD/FRS except exported artifacts. | Leave behind unless a concrete infospace export needs assets. |
+| GraphQL/server experiments | Platform/API layer not validated for this repo. | Defer. Start with CLI/service contracts only after the domain model is stable. |
+
+## In-Scope Concepts To Carry Forward
+
+- `infospace.yaml` as a declarative project artifact
+- Topic, discipline binding, entity, relation, evaluation, and viability
+- Entity parsing into structured metadata
+- Collection checks: redundancy, coverage, coherence, consistency, granularity
+- Metrics history and snapshot diffs
+- Viability thresholds as explicit acceptance criteria
+- Relationship graph export and inspection
+- Reference corpus workflow: sources -> analyses -> entities -> mappings ->
+  evaluations -> reports
+- Pipeline definitions that are inspectable and reproducible
+
+## Reimplementation Boundaries
+
+The successor should reimplement interfaces around these concepts, but with
+cleaner responsibility boundaries:
+
+- Markdown parsing should come from `markitect-tool` or a thin adapter.
+- Persistence and long-running workflow orchestration should come from
+  `kontextual-engine`.
+- LLM calls should use `llm-connect` or an equivalent provider adapter.
+- This repo should own concrete configuration, project layout, reports,
+  fixtures, workflow definitions, and application-level glue.
+
+## Initial Architecture Recommendation
+
+Start with a file-backed baseline before adding a service:
+
+```text
+infospaces/<slug>/infospace.yaml
+infospaces/<slug>/artifacts/
+infospaces/<slug>/output/
+workflows/
+reports/
+docs/
+```
+
+Then add a small CLI around the FRS lifecycle:
+
+- `create`
+- `load` / `inspect`
+- `add-artifact`
+- `evaluate`
+- `check`
+- `export`
+
+Only introduce a service/API when there is a concrete need for long-running
+state, external consumers, or integration with `kontextual-engine`.
+
+## Migration Priority
+
+1. Documentation and repo shape
+2. Infospace config and artifact model
+3. Evaluation output and metrics history
+4. Collection checks and relationship inspection
+5. Reference infospace pilot
+6. Workflow execution and AI-assisted generation
+7. Service/API layer
+
+## Non-Migration List
+
+Do not migrate these directly into `infospace-bench`:
+
+- General markdown parser/serializer modules
+- Schema generator/validator libraries as reusable tooling
+- Asset management subsystem
+- Rendering/plugin infrastructure
+- GraphQL platform experiments
+- Database and orchestration infrastructure
+- Legacy compatibility layers
+
+## Open Questions
+
+- Should `infospace-bench` depend directly on local sibling repos during early
+  development, or use published/package boundaries from the start?
+- Which reference infospace should be the first maintained pilot: the existing
+  Wealth of Nations/VSM experiment, or a smaller purpose-built corpus?
+- Should State Hub track each infospace as its own workstream once the repo
+  model exists?