markitect-main

Author	SHA1	Message	Date
tegwick	77dd3fee6d	fix(example): standardise domain enum and source chapter format in schema/rules Two root causes of metric fragmentation observed in collection checks: 1. Schema's Economic Domain used free-form examples ("labour economics, trade theory") which overrode the enum in extraction-rules.md, causing the LLM to produce multi-domain strings and non-canonical values. Fix: schema now specifies the exact 7-value enum with descriptions. 2. Source Chapter had no format constraint, producing 9 different formats for 7 chapters (full titles, mixed Roman/Arabic numerals, asterisks). Fix: extraction-rules now mandate "Book [Roman], Chapter [n]" exactly. These fixes are prerequisites for clean reprocessing (S3.2 continuation). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 13:02:05 +01:00
tegwick	715ef19d1c	infospace: remove example output — will replay chapter by chapter This commit clears the tangled example output so each chapter can be re-committed cleanly via S3.2.	2026-02-19 09:22:55 +01:00
tegwick	3ac8447c10	feat(example): add baseline metrics snapshot from collection checks run Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Initial metrics from S2.4 checks on 85 entities (7 of 35 chapters): coverage_ratio=0.361, redundancy=0.0, coherence_components=0.0, consistency_cycles=0.0, granularity_entropy=2.69 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-19 07:44:01 +01:00
tegwick	94cb2063af	feat(example): migrate to infospace config with tooling integration (S3.1) Add infospace.yaml declaring topic, disciplines, schemas, viability thresholds. Integrate infospace tooling into process_chapters.py with --infospace-status, --infospace-check, and --infospace-viability flags. Initial check: 85 entities, 4/5 viable (coverage 0.36 < 0.50 — only 7/35 chapters processed so far). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-19 02:29:53 +01:00
tegwick	4ce856d4d0	docs: metrics methodology, collection-level tasks, and infospace tooling roadmap Add METRICS-METHODOLOGY.md documenting the theoretical frameworks (SEQUAL, OntoClean, OOPS!, OntoQA, FCA, DSL principles) adapted for two-layer evaluation (LLM-Eval + deterministic aggregation) across five collection concerns: redundancy, coverage, coherence, consistency, and granularity balance. Extend INFRA-TASKS.md with assignment assessment (tasks 4-7), per-concept metrics (tasks 8-12), and collection-level metrics (tasks 13-19). Add roadmap/infospace-tooling/PLAN.md defining terminology (infospace, topic, discipline, entity, evaluation, viability) and a three-stage implementation plan: Stage 1 platform additions, Stage 2 infospace tooling layer, Stage 3 example revision. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 23:53:21 +01:00
tegwick	2f0989f9bf	docs(infospace): document infospace.db and add to .gitignore The SQLite artifact database is a derived cache regenerable from committed files — no LLM calls needed. Added tutorial section explaining why it is excluded and how to rebuild it after a fresh clone. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 22:27:08 +01:00
tegwick	41773f1320	feat(llm): add OpenAI adapter, entity archive policy, process chapters 5-7 Add OpenAIAdapter for the OpenAI chat completions API (apikey-chatgpt.txt or OPENAI_API_KEY). Set default model to arcee-ai/trinity-large-preview:free for the infospace pipeline and increase max_tokens from 4096 to 8192. Reprocess chapter 05 with Trinity Large (was Gemini: 1 truncated entity, now 19 complete entities). Process chapters 06 (Aurora Alpha, 10 entities) and 07 (Trinity Large, 15 entities including regenerated violent-policy.md). Canonical set now at 85 unique entities. Add entity archive policy: entities are never silently deleted. Retired entities move to output/entities/archive/ with a dated reason header. New CLI option: --archive-entity <slug> --reason "...". The --list output shows the archive count alongside the canonical set. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 23:39:44 +01:00
tegwick	880c1d1374	feat(llm): add Gemini adapter and process book-1-chapter-05 Add GeminiAdapter calling Google's Generative Language REST API (default model: gemini-2.5-flash). Register "gemini" as third provider in the factory and CLI. Add rate-limit retry with exponential backoff to the pipeline's _call_llm helper. Increase default max_tokens from 2000 to 4096. Process book-1-chapter-05 via Gemini free tier — 1 new entity extracted (necessaries-conveniencies-and-amusements-of-life), 41 existing entities correctly skipped by dedup. Canonical set now at 42 unique entities. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 22:54:37 +01:00
tegwick	2d1282a61e	feat(infospace): flat canonical entity set with cross-chapter deduplication Restructure entity storage from per-chapter subdirectories to a flat canonical set in output/entities/. Each entity exists as a single file; duplicates across chapters are detected by slug collision and skipped (first occurrence wins). Chapter views use {{ include }} transclusion to reference shared entity files. Add @{existing_entities} macro to extract-entities template so the LLM knows which entities already exist and focuses on genuinely new ones. Refactor _call_llm() from _execute_llm() for callers that handle their own file I/O. 41 unique entities from 4 chapters (2 duplicates removed). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 22:24:20 +01:00
tegwick	01b9596ce6	docs(examples): add infospace-with-history tutorial Comprehensive walkthrough covering schema design, prompt templates, artifact population, pipeline usage, LLM integration, git history tracking, metrics, and how to complete the remaining 31 chapters. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:50:49 +01:00
tegwick	ad84dd3a41	infospace: process book-1-chapter-04 via OpenRouter All 3 stages (entities, mappings, analysis) auto-generated. 1m53s wall time, 9,478 tokens (real), ~$0.07 est. cost. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:42:05 +01:00
tegwick	e806a701ca	infospace: process book-1-chapter-03 with LLM integration Auto-generated mappings and analysis via Claude Code CLI adapter. Entities were already present from a previous session. Stats: 5m04s wall time, ~51K estimated tokens, ~$0.35 estimated cost. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:32:24 +01:00
tegwick	fecc2fd4fa	feat(llm): add LLM integration module with OpenRouter and Claude Code adapters Implements markitect/llm/ package with concrete LLMAdapter implementations: - OpenRouterAdapter: HTTP via urllib with retry/backoff on 429/5xx - ClaudeCodeAdapter: subprocess-based Claude CLI with stdin piping - Factory pattern: create_adapter("openrouter") or create_adapter("claude-code") - API key resolution chain: constructor > env var > project-root key file - 42 unit tests, 2 integration tests (gated on API key / CLI availability) Also adds the infospace-with-history example with Wealth of Nations VSM analysis pipeline, templates, schemas, source chapters, and processed output for chapters 1-2. process_chapters.py now supports --provider and --model flags for automatic LLM-driven processing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 01:17:58 +01:00
tegwick	360c3b1de2	feat(examples): add content-generator example demonstrating Prompt Dependency Resolution Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details This example demonstrates the full workflow of generating InfoTech primers using MarkiTect's Prompt Dependency Resolution infrastructure. Features demonstrated: - Artifact creation and storage with content-based addressing - PromptTemplate with @{macro} resolution across multiple spaces - Automatic dependency tracking and graph construction - Provenance tracing from outputs back to inputs - Visualization export (Mermaid format) - Incremental execution with change detection Files added: - generate_primers.py: Complete working example - README.md: Quick start guide and architecture overview - TUTORIAL.md: Comprehensive 500+ line tutorial - templates/generate-primer.md: Template with macros - artifacts/topics/: ETL and Microservices topic definitions - artifacts/guidelines/: Authoring rules and research protocol - prepdr/: Original manual system (preserved for reference) Example output: - Generates 2 primers (ETL, Microservices) - Creates 8 artifacts across 4 information spaces - Records 8 dependency edges in SQLite database - Exports dependency graph visualization Run with: cd examples/content-generator && python generate_primers.py Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-09 23:50:07 +01:00
tegwick	5e3646fdff	feat: complete schema-evolution topic with ADR schema and markdown support This commit closes the schema-evolution topic (260105) by adding the final deliverable (ADR schema) and fixing markdown schema support across commands. ADR Schema Created: - Comprehensive Architecture Decision Record validation schema - 12 section classifications (7 required, 2 recommended, 2 optional, 3 improper/discouraged) - Content pattern validation for ADR formatting rules (status dates, decision statements, rationale structure) - Quality metrics for completeness (word counts, sentence counts) - Follows title case naming convention (Status, Context, Decision, etc.) Markdown Schema Support Fixed: - Fixed `markitect validate` command to support .md schemas - Added load_schema_from_path() for both .json and .md files - Updated structural and semantic validation to use schema dict - Fixed `markitect generate-stub` command to support .md schemas - Uses load_schema_from_path() instead of direct JSON loading - Created DocumentWrapper class in semantic_validator.py - Extracts headings from AST tokens (heading_open, inline) - Provides get_headings_by_level() interface expected by validators - Enables section validation to work with real documents Topic Closure: - Updated SCHEMA_EVOLUTION_WORKPLAN.md with completion summary - Phases 1-3: 100% complete (via Schema-of-Schemas and Semantic Validation) - Phase 4: Deferred as future enhancement (15-20 sessions) - Phase 5: 70% complete (docs done, CI/CD templates deferred) - Created DONE.md with comprehensive task checklist - Generated ADR template stub (examples/templates/adr-template.md) - Moved topic from roadmap/ to history/260105-schema-evolution/ Files Changed: - markitect/cli.py: Added markdown schema support to validate and generate-stub - markitect/semantic_validator.py: Added DocumentWrapper class for AST parsing - markitect/schemas/adr-schema-v1.0.md: New ADR validation schema (560 lines) - examples/templates/adr-template.md: Generated ADR template stub - history/260105-schema-evolution/: Moved completed topic to history Status: Schema evolution topic successfully closed with ADR schema as final deliverable. All schema commands now support markdown schemas. Section validation working correctly. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-06 12:32:38 +01:00
tegwick	d32dc41315	docs: update manpage and terminology examples to schema-of-schemas standard Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Updated example documentation to use the new schema-of-schemas standard with markdown schema format and multi-schema validation commands. Manpage Example Updates: - Changed schema reference from markdown-manpage-schema.json to manpage-schema-v1.0.md - Updated all commands to use new multi-schema validation syntax - Added examples of number-based validation (markitect schema-validate 2) - Added examples of batch validation (--all, ranges, lists) - Updated integration examples (CI/CD, pre-commit hooks, Makefile) - Documented schema registry workflow Terminology Example Updates: - Changed schema reference from terminology-schema.json to terminology-schema-v1.0.md - Updated all validation commands to use new CLI syntax - Added examples of schema-list and numbered selection - Added batch validation examples - Updated GitHub Actions and pre-commit hook examples - Documented schema registry access methods Key Changes: - All schema filenames now follow {domain}-schema-v{major}.{minor}.md convention - Commands use schema registry with numbered or filename selection - Batch validation examples added throughout - Integration examples updated to new standard - Documentation reflects markdown-first schema format All schemas validated successfully against metaschema. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-05 13:13:24 +01:00
tegwick	b6f95066a3	chore: establish schema-of-schemas workplan and reorganize roadmap This commit sets up the comprehensive workplan for implementing a markdown-first schema management system with naming conventions, versioning, and self-validation capabilities. ## Directory Reorganization - Renamed `todo/` → `roadmap/` for better organization - Created `roadmap/schema-of-schemas/` subdirectory - Moved schema management planning artifacts to dedicated directory ## Planning Artifacts Created ### Workplan & Documentation - WORKPLAN.md (19KB) - Comprehensive 6-phase implementation plan - SCHEMA_MANAGEMENT_PROPOSAL.md - Full analysis with 4 options - SCHEMA_MANAGEMENT_SUMMARY.md - Executive summary - README.md - Quick reference guide ### Example Schema - examples/schemas/manpage-schema-v1.md - Demonstrates markdown format ## Schema Management System Design ### Naming Convention Format: `{domain}-schema-v{major}.{minor}.md` Examples: - `manpage-schema-v1.0.md` - `terminology-schema-v1.0.md` - `api-documentation-schema-v1.0.md` ### Markdown-First Format Schemas will be markdown files with: - YAML frontmatter for metadata - Rich documentation sections - Embedded JSON schema in code block - Version history and examples ### Implementation Phases (8-10 days) Phase 0: Planning & Setup ✅ (0.5 days) - COMPLETE Phase 1: Filename Convention (1 day) - NEXT Phase 2: Markdown Loader (2-3 days) Phase 3: Schema-for-Schemas (2 days) Phase 4: Schema Migration (1-2 days) Phase 5: CLI & Documentation (1 day) Phase 6: Testing & Validation (1 day) ### Goals 1. ✅ Establish naming convention 2. ⏳ Implement filename validation 3. ⏳ Create markdown schema loader 4. ⏳ Build schema-for-schemas metaschema 5. ⏳ Migrate 5 existing schemas (remove 2 duplicates) 6. ⏳ Update CLI and documentation ## Updated Tracking ### TODO.md - Added Schema-of-Schemas as active work item - Documented Phase 1 tasks and timeline - Paused capability extraction work ### CHANGELOG.md - Added schema management system to [Unreleased] - Documented directory reorganization - Added "In Progress" section for current work ## Next Steps Begin Phase 1: 1. Implement schema_naming.py with validation 2. Add unit tests 3. Update CLI schema-ingest command 4. Create naming specification document ## Files Changed - CHANGELOG.md - Added unreleased schema management features - TODO.md - Updated active work tracking - roadmap/ - Reorganized from todo/ - roadmap/schema-of-schemas/ - New planning directory - examples/schemas/ - Example markdown schema 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 23:47:02 +01:00
tegwick	6df9b5df05	feat: add terminology schema example and improve schema-list command This commit completes Phase 2 of schema evolution work and establishes a new example demonstrating schema usage for terminology documents. ## New Features ### Terminology Validation Example (examples/terminology/) - Complete example terminology document with proper structure - JSON schema with MarkiTect extensions for validation - Demonstrates schema usage beyond manpages (glossaries, lexicons) - Validates term structure: Definition, Synonyms, Related Terms, Examples - Includes content control and quality validation rules - Full documentation with usage examples and best practices ### Schema Registration System - Registered terminology schema in markitect database - Created schema catalog (markitect/schemas/schema-catalog.yaml) - Copied schema to official location (markitect/schemas/) - Provides metadata, features, and usage info for all schemas ### Improved schema-list Command - Now displays creation timestamps in default output - Table format includes Created/Updated columns - Cleaner timestamp formatting (removed microseconds) - Better visibility into when schemas were added ## Files Changed Added: - examples/terminology/README.md - Complete documentation - examples/terminology/terminology-example.md - Example glossary - examples/terminology/terminology-schema.json - Validation schema - markitect/schemas/terminology-schema.json - Registered schema - markitect/schemas/schema-catalog.yaml - Schema registry Modified: - markitect/cli.py - Enhanced schema-list with timestamps - TODO.md - Documented Phase 2 completion and new example Moved: - SCHEMA_EVOLUTION_WORKPLAN.md → todo/ directory ## Schema Features Demonstrated - Heading hierarchy validation (H1 → H2 → H3) - Term structure validation with required/optional fields - Content quality metrics (word counts, readability targets) - MarkiTect extensions (x-markitect-sections, x-markitect-content-control) - Classification system (required/recommended/optional/discouraged/improper) ## Usage ```bash # List schemas with timestamps markitect schema-list # Validate terminology document markitect validate glossary.md --schema terminology-schema.json # View in table format markitect schema-list --format table ``` 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 23:07:36 +01:00
tegwick	82c1a3ab65	docs: add OPTIONS section to schema validation manpage Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Added comprehensive OPTIONS section with 18 command-line options organized into 4 categories: 1. Validation Options (5 options) - --schema, --schema-json, --detailed-errors, --error-format, --quiet 2. Schema Generation Options (3 options) - --output, --style, --title 3. Schema Management Options (4 options) - --schema-list, --schema-info, --schema-delete, --confirm 4. Phase 2 Schema Refinement Options (6 options) - --verbose, --dry-run, --interactive, --loosen-counts, --round-numbers, --migrate-deprecated This addresses the schema recommendation: - Before: OPTIONS section missing (recommended but not present) - After: OPTIONS section present with 424 words, 22 documented options The manpage now fully complies with all schema recommendations: ✅ All required sections present (SYNOPSIS, DESCRIPTION) ✅ All recommended sections present (OPTIONS, EXAMPLES, SEE ALSO, COPYRIGHT) ✅ Document still validates successfully 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 21:49:03 +01:00
tegwick	c46d9f7a0b	docs: update schema validation manual with Phase 1 features Comprehensively document the new classification system and content control features added in Phase 1. ## Documentation Updates ### New Content Added 1. Updated MarkiTect Extensions Section - Replaced deprecated x-markitect-required/recommended-sections - Documented x-markitect-sections with five classification levels - Documented x-markitect-content-control for content validation 2. Added Section Classification System (150+ lines) - Detailed explanation of all five classification levels: - required: Missing = ERROR - recommended: Missing = WARNING - optional: No validation impact - discouraged: Present = WARNING - improper: Present = ERROR - Validation behavior for each classification - JSON examples for each level 3. Added Content Control Documentation - Pattern validation (required/discouraged/forbidden) - Content quality metrics (word count, readability targets) - Content instructions for authors - Complete examples with explanations 4. Updated Schema Design Best Practices - Replaced old extension examples with new classification system - Added guidance on choosing appropriate classifications - Examples showing required, recommended, optional, discouraged, improper 5. Added Classification System Example - Complete working schema demonstrating all features - Validation scenarios showing different outcomes - Integration of sections and content-control extensions ## Changes Summary Lines Added: ~200 lines of new documentation Sections Updated: 4 major sections Examples Added: 8 new code examples Key Topics Covered: - Five-level classification system (required → improper) - Content pattern validation - Quality metrics and readability targets - Content instructions for document authors - Validation behavior for each classification - Complete working examples ## Validation ✅ Manual validates against improved markdown-manpage-schema.json ✅ All new features documented with examples ✅ Backward compatibility maintained ✅ Self-documenting: manual uses the features it documents The manual now comprehensively documents the Phase 1 enhanced schema system while itself validating against a schema using those features. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 21:20:27 +01:00
tegwick	2b687a4ca8	refactor: upgrade manpage schema to use new classification system Modernize the original markdown-manpage-schema.json to leverage Phase 1 classification features for improved flexibility and content guidance. ## Changes Replaced old extension format: ```json "x-markitect-required-sections": ["SYNOPSIS", "DESCRIPTION"], "x-markitect-recommended-sections": ["OPTIONS", "EXAMPLES"], "x-markitect-optional-sections": ["COMMANDS", "FILES"] ``` With new classification system: ```json "x-markitect-sections": { "SYNOPSIS": { "classification": "required", "heading_level": 2, "content_instruction": "...", "error_message": "..." } } ``` ## New Features Added Section Classifications: - 2 required: SYNOPSIS, DESCRIPTION - 4 recommended: OPTIONS, EXAMPLES, SEE ALSO, COPYRIGHT - 7 optional: COMMANDS, CONFIGURATION, FILES, EXIT STATUS, ENVIRONMENT, BUGS, AUTHORS Content Control: - Synopsis: Required patterns for command syntax, discouraged TODO/FIXME - Description: Quality metrics (50-1000 words), forbidden credential patterns - Examples: Required code blocks and comments Enhanced Guidance: - Per-section content instructions for authors - Custom error/warning messages - Alternative section names (e.g., OPTIONS \| GLOBAL OPTIONS \| FLAGS) - Content quality targets (word count, readability level) ## Validation ✅ Tested: markdown-schema-validation.1.md still validates successfully ✅ Backward compatible: Existing validation behavior preserved ✅ Enhanced: Now provides content guidance and flexible classifications This demonstrates the practical value of Phase 1 enhancements - the same schema now offers much richer validation and authoring guidance. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 21:09:34 +01:00
tegwick	d68e762612	feat: implement Phase 1 - Enhanced Schema Format with Classifications Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Complete Phase 1 of Schema Evolution Workplan implementing flexible content control and section classification system. ## New Features ### 1. x-markitect-sections Extension - Five classification levels: required, recommended, optional, discouraged, improper - Per-section content constraints (paragraphs, code blocks, lists) - Position hints for section ordering - Custom error/warning messages - Alternative section names support - Content instructions for authors ### 2. x-markitect-content-control Extension - Required/discouraged/forbidden pattern matching - Content quality metrics (word count, readability target, sentence count) - Content instruction arrays - Link validation configuration ### 3. Metaschema Validation - Updated markitect-metaschema.json with complete validation rules - Enhanced metaschema.py with validation methods for both extensions - Comprehensive validation of all extension properties - Clear error messages for invalid schemas ### 4. Documentation & Examples - Complete specification in docs/specifications/schema-extensions-spec.md - Enhanced manpage schema demonstrating all 5 classification levels - API documentation schema showing alternative patterns - Detailed usage examples and validation behavior ## Implementation Details Files Modified: - markitect/schemas/markitect-metaschema.json: Added extension definitions - markitect/metaschema.py: Added _validate_sections() and _validate_content_control() Files Created: - docs/specifications/schema-extensions-spec.md: Complete specification (v1.0) - examples/manpages/enhanced-manpage-schema.json: Demonstrates all classifications - examples/manpages/api-documentation-schema.json: Shows API doc patterns ## Validation Behavior Classification Levels: - required: Missing = ERROR (validation fails) - recommended: Missing = WARNING (validation succeeds with warnings) - optional: No validation impact - discouraged: Present = WARNING (validation succeeds with warnings) - improper: Present = ERROR (validation fails) ## Next Steps Phase 2: Schema Refinement Tools (schema-analyze, schema-refine, schema-compose) Phase 3: Enhanced Validation Engine (classification-aware validation, quality metrics) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 21:02:51 +01:00
tegwick	b51999582e	feat: add manpages example demonstrating schema validation Add comprehensive example showcasing schema validation with self-documenting manpage system: - markdown-manpage-schema.json: Reusable schema for Unix manpage structure - markdown-schema-validation.1.md: Complete manual about schema validation - README.md: Usage guide, integration examples, and best practices - SCHEMA_EVOLUTION_WORKPLAN.md: Roadmap for enhanced schema system The manual validates against its own schema, demonstrating dogfooding principle. Workplan outlines 5-phase evolution from rigid structural validation to flexible content control with blueprints. Key features demonstrated: - Schema-driven documentation structure - Self-validating documentation - Reusable validation patterns - Classification system design (required/recommended/optional/discouraged/improper) This sets foundation for Phase 1 implementation: enhanced schema format with section classification and content control. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-04 20:58:05 +01:00
tegwick	2e6f292e48	docs: Add design pattern examples and update submodule Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Add Design Pattern Documentation: - Add CopyFirstMigration.md - Documents the copy-first migration principle used in the TestDrive-JSUI capability migration - Add DontRepeatYourself.md - Documents the DRY principle - Add DesignPrincipleSchema.json - JSON schema for design pattern documentation Update Submodule: - Update testdrive-jsui submodule pointer to include Phase 4 documentation (migration completion with legacy file cleanup) Context: These design pattern examples document the principles applied during the successful TestDrive-JSUI migration, which serves as a reference implementation of the copy-first migration pattern. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-16 17:00:31 +01:00
tegwick	3a353b4d4f	feat: implement comprehensive asset shipping for md-render command Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Add automatic asset copying when rendering markdown to different output directories with intelligent defaults and full user control. Key Features: - Environment variable support: MARKITECT_OUTPUT_DIR sets default output directory - Smart defaults: auto-ship assets for directory output, disabled for file output - CLI control flags: --ship-assets and --no-ship-assets for explicit control - Timestamp-based copying: only copies when source newer than destination - Path preservation: maintains relative directory structure in output - Graceful error handling: missing assets logged as warnings, not failures Technical Implementation: - Enhanced asset discovery in markitect/assets/discovery.py with discover_assets_from_markdown() - Added environment variable priority: CLI --output > MARKITECT_OUTPUT_DIR > input directory - Comprehensive asset shipping logic with _ship_assets() function - Directory vs file output detection for intelligent default behavior Examples and Testing: - Added image-assets example directory with 6 sample images and comprehensive README - Created comprehensive TDD test suite with 10 tests covering all functionality - Tests validate environment variables, CLI flags, asset discovery, shipping logic, timestamp handling, missing assets, path preservation, and default behaviors Usage: markitect md-render file.md -o /output/dir/ # Auto-ships assets markitect md-render file.md --no-ship-assets # Suppresses shipping MARKITECT_OUTPUT_DIR=/docs markitect md-render file.md # Uses env var 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-29 23:12:44 +01:00
tegwick	ed33766c91	refactor: reorganize examples directory with topic-based subdirectories Reorganize examples directory into logical topic-based subdirectories with comprehensive documentation: - templates/: ISO/ARC42 documentation templates - asset-management/: Asset management prototypes and demos - essays/: Long-form content examples - invoicing/: Invoice generation examples - plugins/: Plugin development examples - issue-demos/: Issue prevention demonstrations - design-patterns/: Design pattern examples Each subdirectory includes a README.txt file with topic description and contributor signatures based on file creation timestamps. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-29 22:31:52 +01:00
tegwick	81d3da5fe7	feat: comprehensive asset management system and testing improvements Some checks failed Test Suite / unit-tests (3.11) (push) Has been cancelled Details Test Suite / unit-tests (3.12) (push) Has been cancelled Details Test Suite / integration-tests (push) Has been cancelled Details Test Suite / e2e-tests (push) Has been cancelled Details Test Suite / performance-tests (push) Has been cancelled Details Test Suite / code-quality (push) Has been cancelled Details Test Suite / security-scan (push) Has been cancelled Details Test Suite / test-summary (push) Has been cancelled Details Asset Management System (Issue #142): - Add complete asset management framework with deduplication - Implement AssetManager, AssetRegistry, and AssetDeduplicator classes - Add AssetPackager for markdown document packaging - Create comprehensive test suite for all asset management components - Add asset constants and custom exceptions for robust error handling Markdown Processing Enhancements: - Update markdown_commands.py with improved functionality - Enhanced parsing and content aggregation capabilities - Improved filename encoding/decoding for special characters Test Suite Improvements: - Add comprehensive tests for Issue #138 markdown parsing - Enhance Issue #139 content aggregation and end-to-end testing - Complete test coverage for new asset management features Examples and Documentation: - Update BildungsKanonJon.md example with enhanced content - Generate corresponding HTML output for documentation - Add asset registry configuration Development Tools: - Add install script for simplified setup This commit represents a major enhancement to MarkiTect's asset handling capabilities with full test coverage and improved markdown processing. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-12 19:57:31 +02:00
tegwick	ed9325f5ab	chore: added missing suffix	2025-10-08 10:24:50 +02:00
tegwick	2f878a7138	chore: commit examples and some cleanup	2025-10-08 10:14:51 +02:00
tegwick	5e0e6c395e	feat: complete Issue #141 asset management concepts with working prototypes Comprehensive analysis and implementation concepts for handling images and file includes with automatic deduplication based on MarkdownPackageFormats wiki study. ## Two Complete Concepts Delivered ### Concept A: Hash-Based Asset Store - Content-addressable storage using SHA-256 hashes - SQLite database for virtual name mapping and metadata - Perfect deduplication regardless of filename - Hash-based directory structure for optimal storage - Working prototype with 47 KB of implementation code ### Concept B: Package + Symlinks System (RECOMMENDED) - ZIP-based .mdpkg packages following wiki standards - Symlink-based deduplication in shared asset library - Compatible with standard tools and workflows - Visual transparency and tool integration - Working prototype with 51 KB of implementation code ## Key Features Demonstrated - ✅ Content deduplication: Same image content → single storage - ✅ Multiple names: Different filenames for identical content - ✅ Database integration: Asset metadata queryable and indexed - ✅ Package portability: ZIP-based distribution format - ✅ Working demos: Both concepts fully functional ## Analysis Results - Perfect Deduplication: Both concepts eliminate duplicate content storage - Implementation Complexity: Concept B more approachable, Concept A more efficient - Platform Compatibility: Concept A universal, Concept B symlink-dependent - User Experience: Concept B familiar workflows, Concept A requires tooling ## Technical Approach - Based on MarkdownPackageFormats wiki standards (.mdpkg, .mdz formats) - Python standard library (hashlib, sqlite3, zipfile, pathlib) - Content-addressable storage patterns for efficiency - Manifest-based metadata for package integrity ## Recommendations 1. Start with Concept B for rapid prototyping and user acceptance 2. Evolve to hybrid approach incorporating Concept A's hash-based efficiency 3. Follow .mdpkg standards for interoperability with emerging ecosystem 4. Implement CLI integration for seamless markitect workflow Both concepts solve the core requirements with working prototypes and clear trade-offs. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-08 01:51:54 +02:00
tegwick	b0de32d083	feat: implement comprehensive plugin architecture and extensions system (issue #19 ) Complete plugin system implementation providing extensible architecture for MarkiTect: 🏗️ Core Plugin Architecture: - BasePlugin abstract class with lifecycle management (initialize/cleanup) - Specialized plugin types: ProcessorPlugin, FormatterPlugin, ValidatorPlugin, ExporterPlugin, CommandPlugin - PluginMetadata system with version, dependencies, and type information - Plugin initialization and configuration validation 🔍 Plugin Discovery & Management: - PluginManager with automatic discovery from built-in modules and directories - PluginRegistry for centralized plugin registration and lifecycle management - Support for plugin loading, unloading, and reloading with configuration - Plugin discovery from multiple sources (built-in, directories, packages) 🛠️ CLI Integration: - markitect plugin-list: List all available plugins with metadata - markitect plugin-load: Load plugins with optional configuration - markitect plugin-unload: Unload plugins and cleanup resources - markitect plugin-info: Show detailed plugin information - markitect plugin-discover: Discover and refresh plugin catalog 📦 Built-in Plugins: - JSON/YAML/Table formatters for output formatting - Markdown/Text processors for content processing - Auto-registered via @register_plugin decorator - Comprehensive configuration options 🔧 Developer Experience: - @register_plugin decorator for easy plugin registration - Plugin configuration validation and error handling - Comprehensive API documentation with examples - Plugin development guide and best practices 📋 Example Plugins: - Advanced text processor with case conversion and pattern replacement - XML/CSV formatters demonstrating custom output formats - Complete examples showing plugin development patterns 🧪 Test Coverage: - 59 comprehensive tests covering all plugin functionality - Tests for plugin lifecycle, registration, discovery, and CLI integration - Error handling and edge case coverage - Built-in plugin validation Technical Implementation: - Plugin types: processor, formatter, validator, exporter, generator, importer, transformer, extension, backend, command - Configuration-driven plugin management with YAML/JSON support - Graceful error handling and plugin isolation - Plugin dependency validation and compatibility checking 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-03 11:23:32 +02:00
tegwick	935cae67e5	docs: added templates for usecase experiments	2025-10-03 00:39:10 +02:00
tegwick	27f4f6b1b1	feat: Add practical use case examples and comprehensive gap analysis - Created invoice template demonstrating business document requirements - Added design pattern example showing knowledge management use case - Included sample data file for template + data scenarios - Comprehensive gap analysis identifying 6 critical tooling limitations - Documented 3-phase development roadmap for enhanced capabilities - Based on Issue #63 use case brainstorming requirements Key gaps identified: 1. Template engine for dynamic document generation 2. Calculation system for mathematical operations 3. Batch processing for multi-document workflows 4. External data integration capabilities 5. Cross-document relationship management 6. Advanced output format support Ready for requirements engineering and epic decomposition. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-02 10:16:16 +02:00
tegwick	3af6fb9935	feat: Integrate Requirements Engineering Agent and fix Issue #59 test failures ## Major Integration - ✅ Integrated Requirements Engineering Agent into development workflow - ✅ Enhanced Makefile with requirements validation targets - ✅ Added pre-commit validation with mock compatibility checking - ✅ Enhanced TDD workflow to include foundation analysis ## Test Fixes - ✅ Fixed GiteaPlugin missing _add_comment_async method - ✅ Fixed LocalPlugin config.yml file not found errors in tests - ✅ Enhanced mock objects in CLI tests with proper domain model attributes - ✅ All Issue #59 tests now passing (38/38 tests pass) ## New Capabilities - `make validate-requirements` - Foundation analysis before development - `make check-interface-compatibility INTERFACE=Name` - Interface compatibility checking - `make generate-dev-checklist FEATURE='Name'` - Development checklist generation - `make validate-mocks` - Mock object compatibility validation - `make pre-commit-validate` - Complete pre-commit validation workflow ## Problem Prevention This integration prevents the exact interface compatibility issues and mock object mismatches that caused hours of debugging in Issue #59. The Requirements Engineering Agent provides proactive foundation analysis and catches problems before they occur. 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-02 00:45:06 +02:00

34 Commits