markitect-main

Author	SHA1	Message	Date
tegwick	df1fdf1842	feat(pipeline): per-stage max_tokens, LLM provenance, processing log - PipelineStage now supports max_tokens to override the 4096 default - SourcePipeline records provider/model on each entity file as HTML comment - output/processing-log.yaml tracks tokens, cost, duration, retries, errors - _call_llm returns (content, metadata) for downstream traceability - _http.py wraps JSON parse errors with body preview for debugging - infospace.yaml stages: extract/map=6000 tokens, synthesize=3000 tokens Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 14:50:49 +01:00
tegwick	5ede1de4b8	fix(pipeline): retry on 0-entity response, save raw debug, improve template - SourcePipeline: retry split_entities stage once when 0 entity delimiters are found (free-tier models intermittently return short non-formatted responses); save raw LLM response to <stage>-raw.md alongside prompts - Return None (pause pipeline) rather than writing empty view file when no entities found after max retries - _http.py: wrap json.JSONDecodeError in LLMAPIError with body preview - extract-entities.md: add explicit H2-heading format example to Output Format section to prevent models from using inline "Section:" format Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 14:26:28 +01:00
tegwick	72d9904485	feat(infospace): add process command for batch source file processing - Extend PipelineStage with name, output_dir, output_macro, split_entities, and macros fields for declarative pipeline config - Add SourcePipeline class (pipeline.py) using simple @{macro} substitution — no SQLite dependency, skip-if-exists per stage, LLM retry on rate limits, git commit per source - Add `markitect infospace process [GLOB_PATTERN]` CLI command with --all, --provider, --model, --check-after-each, --no-commit flags - Update infospace.yaml with output_dir, output_macro, split_entities, and macros for each pipeline stage in the WoN example Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 13:29:50 +01:00
tegwick	77dd3fee6d	fix(example): standardise domain enum and source chapter format in schema/rules Two root causes of metric fragmentation observed in collection checks: 1. Schema's Economic Domain used free-form examples ("labour economics, trade theory") which overrode the enum in extraction-rules.md, causing the LLM to produce multi-domain strings and non-canonical values. Fix: schema now specifies the exact 7-value enum with descriptions. 2. Source Chapter had no format constraint, producing 9 different formats for 7 chapters (full titles, mixed Roman/Arabic numerals, asterisks). Fix: extraction-rules now mandate "Book [Roman], Chapter [n]" exactly. These fixes are prerequisites for clean reprocessing (S3.2 continuation). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 13:02:05 +01:00
tegwick	715ef19d1c	infospace: remove example output — will replay chapter by chapter This commit clears the tangled example output so each chapter can be re-committed cleanly via S3.2.	2026-02-19 09:22:55 +01:00
tegwick	3ac8447c10	feat(example): add baseline metrics snapshot from collection checks run Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Initial metrics from S2.4 checks on 85 entities (7 of 35 chapters): coverage_ratio=0.361, redundancy=0.0, coherence_components=0.0, consistency_cycles=0.0, granularity_entropy=2.69 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 07:44:01 +01:00
tegwick	94cb2063af	feat(example): migrate to infospace config with tooling integration (S3.1) Add infospace.yaml declaring topic, disciplines, schemas, viability thresholds. Integrate infospace tooling into process_chapters.py with --infospace-status, --infospace-check, and --infospace-viability flags. Initial check: 85 entities, 4/5 viable (coverage 0.36 < 0.50 — only 7/35 chapters processed so far). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 02:29:53 +01:00
tegwick	d1c6e53754	docs: add infospace primitives reference (S2.7) Reference document covering all infospace tooling primitives: config, entity metadata, schema validation, per-entity evaluation, collection checks, metrics history, viability, composition, and CLI commands. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 02:05:09 +01:00
tegwick	b76d6d38c1	feat(infospace): add composition model for discipline binding (S2.6) Discipline resolution, viability checking, entity access, stale mapping detection, and binding management. CLI commands: bind-discipline, disciplines, stale-mappings. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 02:03:54 +01:00
tegwick	ce7f78d57d	feat(infospace): add metrics history and viability tracking (S2.5) History module with snapshot creation from check results, metrics file I/O, auto-append to history after checks, date-based snapshot lookup, and metric trend extraction. CLI commands: history, history-diff. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 02:01:00 +01:00
tegwick	11585e6968	feat(infospace): add collection-level quality checks C1–C5 (S2.4) Five concern checks: Redundancy (embedding/word overlap), Coverage (FCA gap analysis), Coherence (graph connectivity), Consistency (cycle detection), Granularity (Shannon entropy). Orchestrator runs all or selected checks, CLI `markitect infospace check` command added. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:54:22 +01:00
tegwick	3461d2f354	feat(infospace): add per-entity evaluation pipeline and CLI command (S2.3) Evaluation pipeline builds prompts from entity metadata, delegates to BatchEvaluator, parses structured LLM responses into ScoreEntry objects, and writes evaluation files. CLI: 'markitect infospace evaluate' with --provider, --entity, --chapter filters. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:48:34 +01:00
tegwick	3726503adb	feat(infospace): add lifecycle CLI commands — init, status, entities, viability (S2.2) Adds 'markitect infospace' command group with init (create config), status (entity count/domains/disciplines), entities (list with sort), and viability (threshold dashboard with pass/fail). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:46:54 +01:00
tegwick	b20fe4db68	feat(infospace): add infospace configuration model and state (S2.1) InfospaceConfig (topic, disciplines, schemas, competency questions, viability thresholds, pipeline) with YAML load/save and directory discovery. InfospaceState aggregates entities, evaluations, and viability checks for status reporting. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:44:14 +01:00
tegwick	144a88c0c2	feat(prompts): add batch LLM evaluation orchestrator (S1.6) BatchEvaluator runs evaluation prompts across item batches with incremental evaluation (skip unchanged via content digest), per-item error isolation, progress callbacks, and aggregate token usage tracking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:40:13 +01:00
tegwick	dc22017b7c	feat(analysis): add Formal Concept Analysis for coverage gap detection (S1.7) Pure-Python FCA implementation: FormalContext (entity × attribute binary relation with extent/intent/closure), ConceptLattice via NextClosure algorithm, find_gap_concepts() for structural coverage gaps, and find_empty_cells() for cross-tabulation analysis. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:38:35 +01:00
tegwick	f8c9ab33f0	feat(infospace): add structured evaluation output with history and diffing (S1.5) Add data models (ScoreEntry, EntityEvaluation, EvaluationSnapshot, SnapshotDiff) and I/O utilities for YAML frontmatter evaluation files, snapshot persistence, history append, and snapshot diffing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:35:22 +01:00
tegwick	bad01e32bd	feat(analysis): add graph analysis utilities with networkx (S1.4) Add connected components, betweenness centrality, Louvain community detection, modularity scoring, degree distribution, and cohesion/coupling computation. Wraps DependencyGraph via networkx (optional dependency) for downstream collection-level coherence metrics. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:34:53 +01:00
tegwick	267368eb60	feat(llm): add embedding adapter with cache and similarity utils (S1.3) Add OpenAI-compatible embedding support (works with both OpenAI and OpenRouter), file-based embedding cache with content-digest invalidation, and pure-Python cosine similarity utilities for downstream redundancy detection. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 01:22:21 +01:00
tegwick	9031e1162c	feat(infospace): add schema compliance validator (S1.2) Deterministic validation of EntityMeta against declarative schemas: section presence/word counts, heading format, domain enum values. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 00:48:57 +01:00
tegwick	03c6c5e8de	feat(infospace): add entity metadata parser (S1.1) Extract section-tree algorithm from SchemaGenerator into standalone core/section_tree.py and build markitect/infospace/ package with EntityMeta dataclass and parse_entity_file/parse_entity_directory. Foundation for schema compliance, coverage, and granularity metrics. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 00:27:45 +01:00
tegwick	b5e994b014	docs: preliminary introduction to Viable Information Spaces Conceptual overview of infospaces as structured, evaluable, composable knowledge collections. Establishes the vocabulary (topic, discipline, entity, viability), the build cycle (extract, map, evaluate, refine), the five collection quality concerns, and the composition model (hierarchical, networked, swarm). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 23:54:53 +01:00
tegwick	4ce856d4d0	docs: metrics methodology, collection-level tasks, and infospace tooling roadmap Add METRICS-METHODOLOGY.md documenting the theoretical frameworks (SEQUAL, OntoClean, OOPS!, OntoQA, FCA, DSL principles) adapted for two-layer evaluation (LLM-Eval + deterministic aggregation) across five collection concerns: redundancy, coverage, coherence, consistency, and granularity balance. Extend INFRA-TASKS.md with assignment assessment (tasks 4-7), per-concept metrics (tasks 8-12), and collection-level metrics (tasks 13-19). Add roadmap/infospace-tooling/PLAN.md defining terminology (infospace, topic, discipline, entity, evaluation, viability) and a three-stage implementation plan: Stage 1 platform additions, Stage 2 infospace tooling layer, Stage 3 example revision. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 23:53:21 +01:00
tegwick	2f0989f9bf	docs(infospace): document infospace.db and add to .gitignore The SQLite artifact database is a derived cache regenerable from committed files — no LLM calls needed. Added tutorial section explaining why it is excluded and how to rebuild it after a fresh clone. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 22:27:08 +01:00
tegwick	60f33443ae	feat(schema): add semantic schema generation as default mode Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details schema-generate now builds content-aware schemas from the document's section hierarchy instead of counting markdown syntax elements. Detects key-value tables, data tables, link lists, and mixed content patterns to produce schemas that reflect the actual document outline. Old behavior preserved via --mode syntactic. Validator and visualization tools pinned to syntactic mode for compatibility. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-16 18:49:50 +01:00
tegwick	120ed89780	fix(proxy): catch markitdown missing-dependency errors with clean hint When markitdown is installed but a format-specific sub-dependency is missing (e.g. pdfminer-six for PDF), translate the raw traceback into a DependencyMissingError with the correct install command. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 21:00:51 +01:00
tegwick	9fa239c140	fix(proxy): register markitdown extractor unconditionally Always register MarkitdownExtractor so it overrides specialized extractors for all its extensions. When markitdown-no-magika is not installed, users now see the correct install hint instead of the old pymupdf4llm message. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 20:52:07 +01:00
tegwick	e4fbba8a57	feat(proxy): add markitdown as default proxy backend Uses markitdown-no-magika (lighter fork without magika/onnxruntime) to handle PDF, HTML, DOCX, PPTX, XLSX, XLS, CSV, JSON, and XML files. Specialized extractors (pymupdf4llm, markdownify) remain as fallbacks when markitdown is not installed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 20:48:47 +01:00
tegwick	ac334c679d	feat(proxy): add proxy file system for non-markdown source conversion Introduces a new `markitect/proxy/` module with pluggable extractors that convert non-markdown sources (PDF, HTML) into tracked markdown proxy files. Proxy files preserve origin metadata (path, checksum, timestamp) so they can be kept in sync when the original changes. CLI commands: `proxy create`, `proxy update`, `proxy status`, `proxy extractors`. Built-in extractors: PDF (pymupdf4llm), HTML (markdownify), Markdown (built-in). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 19:06:09 +01:00
tegwick	69aea1ada7	refactor(version): separate version and release commands `markitect version` now prints a clean version string (Unix style), with -v for commit/branch/dirty. `markitect release` shows detailed development status: commits since tag, local changes, upstream divergence. No overlap between the two commands. Replaces get_version_info()/get_release_info() with get_version() and get_release_status(). Drops yaml output format from release (json + text sufficient). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 17:49:14 +01:00
tegwick	be3b4e3aae	fix(version): resolve version dynamically from git in dev checkouts When running from a git repo, use setuptools-scm at runtime to derive the version from tags. Falls back to the static _version.py only when not in a git repo (e.g. installed from wheel). This ensures `markitect version` stays correct without requiring `pip install -e .` after every tag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 17:22:38 +01:00
tegwick	ad23bb0b86	fix(version): normalize release info for CLI release command Add _normalize_release_info() to ensure get_release_info() returns keys expected by the CLI release command regardless of whether the release-management capability is available. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 16:37:40 +01:00
tegwick	5085c44de3	feat(llm): add llm-default and llm-preference commands, switch hardcoded default to gemini Add TOML-based config resolution with 7-level priority chain: CLI flags > env var > user preference > directory preference > directory default > user default > hardcoded fallback. New commands: llm-default (view/set/clear defaults), llm-preference (view/set/clear preferences). Each shows only its own scope. llm-check now displays source attribution for resolved provider/model. Existing commands (llm-helper, llm-check) refactored to use resolve_llm() instead of manual resolution. Hardcoded fallback changed from openrouter/aurora-alpha to gemini/gemini-2.5-flash due to persistent OpenRouter 502 errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 16:35:44 +01:00
tegwick	4631a9f794	feat(llm): add qwen3-coder-next to catalog and Known Models column Register qwen/qwen3-coder-next under the openrouter provider and extend llm-catalog with a "Known Models" column so all cataloged models are discoverable. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 00:17:57 +01:00
tegwick	269184f7a1	feat(llm): add llm-catalog and llm-check commands, rename helper → llm-helper Consistent llm-* naming scheme for all LLM CLI commands. llm-catalog shows provider metadata and key status; llm-check sends a minimal prompt to verify connectivity. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 00:12:50 +01:00
tegwick	69e2ec25ff	feat(helper): add interactive Q&A helper command Add `markitect helper <QUESTION>` CLI command that answers questions about markitect using its own documentation as LLM context. Uses OpenRouter with openrouter/aurora-alpha by default; model is configurable via --model flag or MARKITECT_HELPER_MODEL env var. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-12 23:28:20 +01:00
tegwick	41773f1320	feat(llm): add OpenAI adapter, entity archive policy, process chapters 5-7 Add OpenAIAdapter for the OpenAI chat completions API (apikey-chatgpt.txt or OPENAI_API_KEY). Set default model to arcee-ai/trinity-large-preview:free for the infospace pipeline and increase max_tokens from 4096 to 8192. Reprocess chapter 05 with Trinity Large (was Gemini: 1 truncated entity, now 19 complete entities). Process chapters 06 (Aurora Alpha, 10 entities) and 07 (Trinity Large, 15 entities including regenerated violent-policy.md). Canonical set now at 85 unique entities. Add entity archive policy: entities are never silently deleted. Retired entities move to output/entities/archive/ with a dated reason header. New CLI option: --archive-entity <slug> --reason "...". The --list output shows the archive count alongside the canonical set. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 23:39:44 +01:00
tegwick	880c1d1374	feat(llm): add Gemini adapter and process book-1-chapter-05 Add GeminiAdapter calling Google's Generative Language REST API (default model: gemini-2.5-flash). Register "gemini" as third provider in the factory and CLI. Add rate-limit retry with exponential backoff to the pipeline's _call_llm helper. Increase default max_tokens from 2000 to 4096. Process book-1-chapter-05 via Gemini free tier — 1 new entity extracted (necessaries-conveniencies-and-amusements-of-life), 41 existing entities correctly skipped by dedup. Canonical set now at 42 unique entities. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 22:54:37 +01:00
tegwick	2d1282a61e	feat(infospace): flat canonical entity set with cross-chapter deduplication Restructure entity storage from per-chapter subdirectories to a flat canonical set in output/entities/. Each entity exists as a single file; duplicates across chapters are detected by slug collision and skipped (first occurrence wins). Chapter views use {{ include }} transclusion to reference shared entity files. Add @{existing_entities} macro to extract-entities template so the LLM knows which entities already exist and focuses on genuinely new ones. Refactor _call_llm() from _execute_llm() for callers that handle their own file I/O. 41 unique entities from 4 chapters (2 duplicates removed). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 22:24:20 +01:00
tegwick	706981c39f	fix(prompts): fix three infrastructure bugs in prompt dependency resolution - ContentMacro: add __post_init__ to auto-derive raw_text when built programmatically, preventing str.replace("", X) corruption - MacroParser: add @{target} shorthand syntax support mapped to REQUIRED kind, updating parse, has_macros, count_macros, and find_macro_positions - Artifact: store content in model and SQLite DB, replace resolver placeholder with actual artifact content, add migration for existing databases Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 20:53:02 +01:00
tegwick	01b9596ce6	docs(examples): add infospace-with-history tutorial Comprehensive walkthrough covering schema design, prompt templates, artifact population, pipeline usage, LLM integration, git history tracking, metrics, and how to complete the remaining 31 chapters. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:50:49 +01:00
tegwick	ad84dd3a41	infospace: process book-1-chapter-04 via OpenRouter All 3 stages (entities, mappings, analysis) auto-generated. 1m53s wall time, 9,478 tokens (real), ~$0.07 est. cost. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:42:05 +01:00
tegwick	e806a701ca	infospace: process book-1-chapter-03 with LLM integration Auto-generated mappings and analysis via Claude Code CLI adapter. Entities were already present from a previous session. Stats: 5m04s wall time, ~51K estimated tokens, ~$0.35 estimated cost. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:32:24 +01:00
tegwick	fecc2fd4fa	feat(llm): add LLM integration module with OpenRouter and Claude Code adapters Implements markitect/llm/ package with concrete LLMAdapter implementations: - OpenRouterAdapter: HTTP via urllib with retry/backoff on 429/5xx - ClaudeCodeAdapter: subprocess-based Claude CLI with stdin piping - Factory pattern: create_adapter("openrouter") or create_adapter("claude-code") - API key resolution chain: constructor > env var > project-root key file - 42 unit tests, 2 integration tests (gated on API key / CLI availability) Also adds the infospace-with-history example with Wealth of Nations VSM analysis pipeline, templates, schemas, source chapters, and processed output for chapters 1-2. process_chapters.py now supports --provider and --model flags for automatic LLM-driven processing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:17:58 +01:00
tegwick	360c3b1de2	feat(examples): add content-generator example demonstrating Prompt Dependency Resolution Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details This example demonstrates the full workflow of generating InfoTech primers using MarkiTect's Prompt Dependency Resolution infrastructure. Features demonstrated: - Artifact creation and storage with content-based addressing - PromptTemplate with @{macro} resolution across multiple spaces - Automatic dependency tracking and graph construction - Provenance tracing from outputs back to inputs - Visualization export (Mermaid format) - Incremental execution with change detection Files added: - generate_primers.py: Complete working example - README.md: Quick start guide and architecture overview - TUTORIAL.md: Comprehensive 500+ line tutorial - templates/generate-primer.md: Template with macros - artifacts/topics/: ETL and Microservices topic definitions - artifacts/guidelines/: Authoring rules and research protocol - prepdr/: Original manual system (preserved for reference) Example output: - Generates 2 primers (ETL, Microservices) - Creates 8 artifacts across 4 information spaces - Records 8 dependency edges in SQLite database - Exports dependency graph visualization Run with: cd examples/content-generator && python generate_primers.py Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-09 23:50:07 +01:00
tegwick	8f54a5509e	Included run manifest schema to check prompt dependency resolution against later Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details	2026-02-09 20:38:07 +01:00
tegwick	7b4bd461c9	feat(prompts): implement Phase 8 - Observability & Traceability (FR-11) Complete implementation of Phase 8, the final phase of prompt dependency resolution infrastructure, adding full observability and traceability. ## Features (FR-11) ### FR-11.1: Complete Artifact Provenance Tracing - TraceabilityService: composition layer for full artifact lineage - Trace any artifact to producing PromptTemplate, input artifacts, generator runs, and quality validation results - ProvenanceTrace model with complete dependency chain reconstruction - RunSummary and ArtifactLineage models for structured trace output ### FR-11.2: Recomputation Query Infrastructure - PromptQueryService: cross-service complex queries - Run history queries with template and status filters - Stale artifact detection via impact debt analysis - Dependency graph statistics (nodes, edges, cycles, roots, leaves) - Content-based artifact lookups by digest ### Visualization Support - GraphExporter: DOT (Graphviz) and Mermaid format export - Supports all edge types (requires, generates, includes) - Handles isolated nodes, linear chains, diamonds, and complex graphs ### CLI Commands (prompt group) - `prompt trace <artifact_id>` - Full provenance trace as JSON - `prompt graph <artifact_id>` - Dependency graph (DOT/Mermaid) - `prompt runs` - List execution runs with filters - `prompt debt` - Show impact debt and stale artifacts - `prompt stats` - Dependency graph statistics ## Implementation Source files (8): - markitect/prompts/traceability/models.py - Trace data models - markitect/prompts/traceability/service.py - TraceabilityService - markitect/prompts/visualization/graph.py - Graph export - markitect/prompts/queries/operations.py - PromptQueryService - markitect/prompts/cli.py - Click CLI commands - Package __init__.py files (3) Tests (64 total, all passing): - tests/unit/prompts/test_traceability_service.py (21 tests) - tests/unit/prompts/test_visualization.py (14 tests) - tests/unit/prompts/test_query_operations.py (12 tests) - tests/integration/prompts/test_traceability_workflow.py (7 tests) - tests/integration/prompts/test_prompt_cli.py (10 tests) ## Architecture TraceabilityService is a composition layer that delegates to: - DependencyQueryService (transitive dependency lookups) - QualityValidator (validation history) - IncrementalExecutionEngine (impact debt queries) - Direct repository access (artifacts, edges) No duplicate data storage - all data comes from existing Phase 1-7 infrastructure (artifact repo, dependency repo, validation DB, debt DB). ## Verification All 2250 tests pass with 0 regressions. Phase 8 completes the full 8-phase implementation roadmap. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-09 20:32:18 +01:00
tegwick	704272644c	feat(prompts): implement Phase 7 - Quality & Validation (FR-9, FR-10) Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Add quality gate framework with schema validation (JSON Schema via jsonschema library), pattern validation (regex-based), multi-gate QualityValidator with SQLite persistence, HaltingPolicyEngine with budget/iteration/improvement checks, and RefinementLoop for iterative execute-validate-halt cycles. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 13:31:37 +01:00
tegwick	bd1d05ba79	feat(prompts): implement Phase 6 - Incremental Execution (FR-7, FR-8) Add change detection, structural diff-based impact analysis, configurable-depth incremental recomputation with circular suppression, and impact debt tracking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 13:18:27 +01:00
tegwick	9ce157400e	feat(prompts): implement Phase 5 - Dependency Tracking (FR-6) Add directed dependency graph with cycle detection, topological sort, and query service for finding dependents/dependencies transitively. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 13:18:18 +01:00

1 2 3 4 5 ...

565 Commits