markitect-main/markitect at main - markitect-main - Gitea: Git with a cup of tea

coulomb/markitect-main

Files

History

tegwick e3e5b8ecc1

Test Suite / unit-tests (3.11) (push) Has been cancelled

Details

Test Suite / unit-tests (3.12) (push) Has been cancelled

Details

Test Suite / integration-tests (push) Has been cancelled

Details

Test Suite / e2e-tests (push) Has been cancelled

Details

Test Suite / performance-tests (push) Has been cancelled

Details

Test Suite / code-quality (push) Has been cancelled

Details

Test Suite / security-scan (push) Has been cancelled

Details

Test Suite / test-summary (push) Has been cancelled

Details

feat(infospace): systematic long-text processing — rich commit bodies, per-source eval/classify, chapters view

Three coordinated changes that let the pipeline produce a clean
chapter-by-chapter git history on long texts without archaeology after
the fact.

1. Richer commit messages. `SourcePipeline._git_commit` now diffs the
   staged changes, buckets added files by output subdirectory (entities,
   evaluations, classifications, mappings, analyses, metrics, logs), and
   includes counts in the commit body. So `git log` reads "entities:
   +23, evaluations: +23" per chapter instead of the same generic blurb
   on every commit. Zero behaviour change when no output changed; falls
   back to the original message if the diff query fails.

2. --eval-after-source / --classify-after-source on `infospace process`.
   After a source's stages succeed, the pipeline identifies which entity
   files are *new* (set diff of entity slugs before vs after), loads
   their EntityMeta, and runs per-entity evaluation and/or
   classification scoped to just those slugs before the per-source git
   commit lands. Result: each chapter's commit is self-contained —
   extraction + evaluation + classification in one atomic unit. Gated
   behind explicit flags because the cost is real (LLM latency per
   chapter rather than amortised across one bulk batch).

3. `markitect infospace chapters` subcommand. Lists source files in
   canonical order with entity count, evaluated count, classified
   count, and mean per-entity score per source. Text or JSON output.
   Natural triage surface for long-text infospaces — spot chapters that
   under-extracted or evaluated poorly.

Also: `docs/advanced-usage.md` gets a new "Systematic processing of
long texts" section with the recommended flag combo and the tradeoff
note on cost.

11 new unit tests cover the chapters command (text/json/no-sources),
the process flag wiring (help + provider requirement), and the
commit-body bucket logic. Full infospace+llm unit suite (315 tests)
green; 3 pre-existing infospace failures unchanged.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-04-22 08:24:26 +02:00

..

feat(analysis): add Formal Concept Analysis for coverage gap detection (S1.7)

2026-02-19 01:38:35 +01:00

refactor: failed attempt at edit mode recovery and robustness implementation

2025-11-12 00:19:03 +01:00

feat(infospace): add entity metadata parser (S1.1)

2026-02-19 00:27:45 +01:00

explode_variants

feat: complete Issue #150 - Advanced Packaging Features (.mdz, .mdt)

2025-10-13 23:09:18 +02:00

feat: reorganize tests by capability with separate test targets

2025-10-25 02:37:45 +02:00

feat(spaces): implement Phase 6 API Layer

2026-02-08 12:29:11 +01:00

feat(infospace,llm): agent ergonomics — entity lookup, model fallback, better errors

2026-04-22 01:07:25 +02:00

feat(infospace): systematic long-text processing — rich commit bodies, per-source eval/classify, chapters view

2026-04-22 08:24:26 +02:00

feat: Complete Issue #39 - Database CLI Reorganization with Comprehensive Legacy Compatibility System

2025-09-30 17:28:39 +02:00

feat(infospace,llm): stabilize free-tier eval workflow

2026-04-22 00:51:00 +02:00

matter_contentmatter

feat: Complete Issue #38 - Full MarkdownMatters CLI implementation with TDD8 methodology

2025-10-02 09:14:24 +02:00

matter_frontmatter

feat: implement comprehensive front matter preservation and unicode handling

2025-10-13 20:26:08 +02:00

matter_tailmatter

feat: Complete Issue #38 - Full MarkdownMatters CLI implementation with TDD8 methodology

2025-10-02 09:14:24 +02:00

feat: complete Issue #150 - Advanced Packaging Features (.mdz, .mdt)

2025-10-13 23:09:18 +02:00

feat: consolidate testdrive-jsui to capabilities and implement plugin self-declaration

2025-12-15 23:42:54 +01:00

feat: complete Issue #145 - Phase 4: Production Readiness and Release

2025-10-14 18:15:26 +02:00

feat: implement comprehensive User Profile Management System (issue #107 )

2025-10-04 01:53:31 +02:00

feat(llm): extract adapter layer for standalone llm-connect package (S1+S2)

2026-02-27 08:04:50 +01:00

fix(proxy): catch markitdown missing-dependency errors with clean hint

2026-02-13 21:00:51 +01:00

query_paradigms

feat: reorganize tests by capability with separate test targets

2025-10-25 02:37:45 +02:00

feat(schema): add semantic schema generation as default mode

2026-02-16 18:49:50 +01:00

feat: add changelog schema for Keep a Changelog validation

2026-01-06 13:31:02 +01:00

feat(spaces): implement Phase 8 Git History Tracking

2026-02-08 18:03:35 +01:00

feat: Complete Phase 4 - Remove legacy JavaScript files

2025-12-16 10:27:05 +01:00

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

feat: Complete Issue #65 Template Engine Foundation + Fix CLI Regression

2025-10-02 15:33:32 +02:00

fix: CSS injection and theme application bugs

2025-12-17 12:02:42 +01:00

feat: implement modular theme system with file-based theme organization

2025-11-10 11:43:25 +01:00

feat: add LinkValidator for semantic link validation (Phase 3)

2026-01-06 03:41:03 +01:00

__init__.py

chore: gitignore and repo cleanup

2025-09-16 03:04:18 +02:00

__version__.py

refactor(version): separate version and release commands

2026-02-13 17:49:14 +01:00

asset_commands.py

feat: implement Issue #143 - CLI integration and user experience for asset management

2025-10-14 13:46:34 +02:00

associated_files.py

feat: Complete Issue #40 - Associated Files Management with Interactive vs Automation Mode System

2025-09-30 13:09:37 +02:00

ast_cache.py

feat: Strategic pivot to CLI implementation with comprehensive foundation

2025-09-24 01:14:27 +02:00

ast_service.py

fix: Improve AST display content visibility for Issue #15

2025-09-27 09:31:47 +02:00

batch_processor.py

feat: implement batch processing and recursive operations (issue #17 )

2025-10-03 10:45:43 +02:00

cache_service.py

feat: Complete logging standardization with context-aware system

2025-09-27 08:28:10 +02:00

clean_document_manager.py

refactor(version): separate version and release commands

2026-02-13 17:49:14 +01:00

cli_utils.py

feat: implement Issue #143 - CLI integration and user experience for asset management

2025-10-14 13:46:34 +02:00

cli.py

feat(cli): add 'helper' alias for markitect helper command

2026-02-23 05:40:11 +01:00

config_manager.py

feat: optimize code quality with pylint analysis and critical fixes - Issue #130

2025-10-06 03:03:38 +02:00

database.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

document_manager.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

draft_generator.py

refactor: enhance draft generator documentation and code quality

2025-10-03 10:35:16 +02:00

exceptions.py

feat: optimize code quality with pylint analysis and critical fixes - Issue #130

2025-10-06 03:03:38 +02:00

frontmatter.py

feat: optimize code quality with pylint analysis and critical fixes - Issue #130

2025-10-06 03:03:38 +02:00

legacy_compat.py

feat: optimize code quality with pylint analysis and critical fixes - Issue #130

2025-10-06 03:03:38 +02:00

legacy_integration_example.py

feat: Complete Issue #39 - Database CLI Reorganization with Comprehensive Legacy Compatibility System

2025-09-30 17:28:39 +02:00

metaschema.py

feat: implement Phase 1 - Enhanced Schema Format with Classifications

2026-01-04 21:02:51 +01:00

parser.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

performance_tracker.py

feat: optimize code quality with pylint analysis and critical fixes - Issue #130

2025-10-06 03:03:38 +02:00

schema_analyzer.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

schema_generator.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

schema_loader.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

schema_naming.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

schema_refiner.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

schema_validator.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

semantic_validator.py

feat: complete schema-evolution topic with ADR schema and markdown support

2026-01-06 12:32:38 +01:00

serializer.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00

stub_generator.py

feat: optimize code quality with pylint analysis and critical fixes - Issue #130

2025-10-06 03:03:38 +02:00

validation_error.py

feat: Complete Issue #8 - Detailed Validation Error Reporting and CLI Enhancements

2025-09-29 21:21:21 +02:00

workspace.py

feat(spaces): implement Phase 0-1 of Information Space Service

2026-02-08 02:02:46 +01:00