coulomb/markitect-main

Fork 0

Go to file

tegwick e3e5b8ecc1

Test Suite / unit-tests (3.11) (push) Has been cancelled

Details

Test Suite / unit-tests (3.12) (push) Has been cancelled

Details

Test Suite / integration-tests (push) Has been cancelled

Details

Test Suite / e2e-tests (push) Has been cancelled

Details

Test Suite / performance-tests (push) Has been cancelled

Details

Test Suite / code-quality (push) Has been cancelled

Details

Test Suite / security-scan (push) Has been cancelled

Details

Test Suite / test-summary (push) Has been cancelled

Details

feat(infospace): systematic long-text processing — rich commit bodies, per-source eval/classify, chapters view

Three coordinated changes that let the pipeline produce a clean
chapter-by-chapter git history on long texts without archaeology after
the fact.

1. Richer commit messages. `SourcePipeline._git_commit` now diffs the
   staged changes, buckets added files by output subdirectory (entities,
   evaluations, classifications, mappings, analyses, metrics, logs), and
   includes counts in the commit body. So `git log` reads "entities:
   +23, evaluations: +23" per chapter instead of the same generic blurb
   on every commit. Zero behaviour change when no output changed; falls
   back to the original message if the diff query fails.

2. --eval-after-source / --classify-after-source on `infospace process`.
   After a source's stages succeed, the pipeline identifies which entity
   files are *new* (set diff of entity slugs before vs after), loads
   their EntityMeta, and runs per-entity evaluation and/or
   classification scoped to just those slugs before the per-source git
   commit lands. Result: each chapter's commit is self-contained —
   extraction + evaluation + classification in one atomic unit. Gated
   behind explicit flags because the cost is real (LLM latency per
   chapter rather than amortised across one bulk batch).

3. `markitect infospace chapters` subcommand. Lists source files in
   canonical order with entity count, evaluated count, classified
   count, and mean per-entity score per source. Text or JSON output.
   Natural triage surface for long-text infospaces — spot chapters that
   under-extracted or evaluated poorly.

Also: `docs/advanced-usage.md` gets a new "Systematic processing of
long texts" section with the recommended flag combo and the tradeoff
note on cost.

11 new unit tests cover the chapters command (text/json/no-sources),
the process flag wiring (help + provider requirement), and the
commit-body bucket logic. Full infospace+llm unit suite (315 tests)
green; 3 pre-existing infospace failures unchanged.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-04-22 08:24:26 +02:00

_issue-tracking

chore: follow subrepo

2025-12-17 23:08:02 +01:00

.claude

agent: improved capability integration

2025-12-17 19:38:06 +01:00

.github/workflows

feat: Implement comprehensive Testing Architecture Enhancement

2025-09-26 22:36:35 +02:00

.issues

feat: Integrate Requirements Engineering Agent and fix Issue #59 test failures

2025-10-02 00:45:06 +02:00

.venv_old

chore: Keep old venv from ubuntu 18.04 for now just in case.

2025-09-23 01:16:12 +02:00

agents

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

application

feat: Implement domain logic separation with clean architecture

2025-09-26 22:15:45 +02:00

assets

fix: exclude assets.db from version control

2025-11-10 12:14:12 +01:00

capabilities

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

config

feat: Implement unified configuration management system

2025-09-26 17:45:56 +02:00

cost_notes

feat: add Reset All button to EditControl panel

2025-11-14 15:25:29 +01:00

docs

docs(infospace): add advanced-usage, composition guide, and performance notes (C.4/C.5/C.6)

2026-04-21 07:02:46 +02:00

domain

fix: Eliminate all 111 test warnings by fixing root causes

2025-09-27 20:14:22 +02:00

examples

feat(infospace): systematic long-text processing — rich commit bodies, per-source eval/classify, chapters view

2026-04-22 08:24:26 +02:00

guides

chore: history cleanup

2025-10-03 03:39:43 +02:00

history

chore: moved information-space service to history

2026-02-08 21:26:54 +01:00

infrastructure

refactor: remove obsolete issue management system in favor of issue-facade

2025-10-24 21:25:04 +02:00

issue_tracker/cli

feat: complete issue-facade capability enhancement and project cleanup

2025-11-10 10:53:37 +01:00

markitect

feat(infospace): systematic long-text processing — rich commit bodies, per-source eval/classify, chapters view

2026-04-22 08:24:26 +02:00

migrations/prompts

feat(prompts): implement Phase 7 - Quality & Validation (FR-9, FR-10)

2026-02-09 13:31:37 +01:00

node_modules

refactor: Still trying to reorganize edit mode to be more robust

2025-11-04 21:59:22 +01:00

reports

feat: complete Issue #146 - Asset Management Implementation Milestone

2025-10-14 18:29:37 +02:00

roadmap

docs(roadmap): close out infospace tooling S3 and parent roadmap

2026-04-22 07:08:43 +02:00

scripts

feat: implement Phase 4 - Schema Migration

2026-01-05 09:38:43 +01:00

services

refactor: remove obsolete issue management system in favor of issue-facade

2025-10-24 21:25:04 +02:00

src

chore: update project state and prepare for image support development

2025-10-26 08:06:22 +01:00

testdata

chore: history cleanup

2025-10-03 03:39:43 +02:00

tests

feat(infospace): systematic long-text processing — rich commit bodies, per-source eval/classify, chapters view

2026-04-22 08:24:26 +02:00

tools

feat(schema): add semantic schema generation as default mode

2026-02-16 18:49:50 +01:00

wiki @ 8818df03d3

chore: commit examples and some cleanup

2025-10-08 10:14:51 +02:00

.clinerules

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

.custodian-brief.md

chore(consistency): sync task status from DB [auto]

2026-04-22 00:28:46 +02:00

.gitignore

docs(infospace): document infospace.db and add to .gitignore

2026-02-18 22:27:08 +01:00

.gitmodules

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

aliases.sh

feat: implement plugin-based architecture with md- command prefixes - Issue #44

2025-10-06 16:46:26 +02:00

asset_registry.json

refactor: delegate version management to release-management capability

2025-11-09 10:41:28 +01:00

CHANGELOG.md

docs: prepare CHANGELOG for v0.11.0 release

2026-01-06 22:29:02 +01:00

CLAUDE.md

docs(claude): expand CLAUDE.md with commands and architecture

2026-03-04 23:28:03 +01:00

demo_plugin_integration.py

feat: implement plugin infrastructure for rendering engines

2025-11-14 06:49:41 +01:00

DEPENDENCIES.md

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

GUARDRAILS.md

feat: implement unified DocumentNavigator with lazy loading for all modes

2025-11-10 19:39:46 +01:00

install

feat: comprehensive asset management system and testing improvements

2025-10-12 19:57:31 +02:00

install.py

feat: implement markitect installer with version/release commands (issue #80 )

2025-10-03 05:47:02 +02:00

install.sh

feat: implement markitect installer with version/release commands (issue #80 )

2025-10-03 05:47:02 +02:00

INTRODUCTION.md

docs: add comprehensive INTRODUCTION.md

2026-02-08 18:29:14 +01:00

Makefile

refactor: clean up JavaScript development files and enhance automated testing

2025-11-09 23:16:47 +01:00

package-lock.json

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

package.json

chore: rename markitect_project to markitect-main across project

2026-04-21 01:57:35 +02:00

pyproject.toml

feat(llm): extract adapter layer for standalone llm-connect package (S1+S2)

2026-02-27 08:04:50 +01:00

pytest-timeout.ini

feat: Implement test timeout infrastructure and fix failing tests

2025-10-01 18:07:05 +02:00

pytest.ini

fix: eliminate all test suite warnings - Issue #129

2025-10-06 02:11:28 +02:00

SCOPE.md

updated SCOPE file

2026-03-25 00:11:46 +01:00

test_asset_deployment.py

feat: complete asset deployment for plugin engines

2025-11-14 09:20:37 +01:00

test_browser_ready.py

fix: resolve JavaScript const redeclaration and MarkitectMain issues

2025-11-14 09:25:00 +01:00

test_cli_integration.py

feat: complete CLI integration with plugin system

2025-11-14 08:47:30 +01:00

test_cli_plugin.md

feat: complete CLI integration with plugin system

2025-11-14 08:47:30 +01:00

test_cli_simple.py

feat: complete CLI integration with plugin system

2025-11-14 08:47:30 +01:00

test_cli_with_assets.py

feat: complete asset deployment for plugin engines

2025-11-14 09:20:37 +01:00

test_complete_integration.py

feat: complete CLI integration with plugin system

2025-11-14 08:47:30 +01:00

test_integration.md

refactor: failed attempt at edit mode recovery and robustness implementation

2025-11-12 00:19:03 +01:00

test_plugin_discovery.py

feat: complete CLI integration with plugin system

2025-11-14 08:47:30 +01:00

test_strict_mode.html

refactor: failed attempt at edit mode recovery and robustness implementation

2025-11-12 00:19:03 +01:00

TODO.md

chore: updated header comments for TODO and CHANGELOG

2026-01-05 22:32:37 +01:00

docs/README.md

MarkiTect Documentation

Welcome to the MarkiTect documentation. This directory contains comprehensive documentation for developers, users, and contributors.

Documentation Structure

📐 Architecture Documentation (`architecture/`)

Deep technical documentation about system design, performance, and implementation details.

Capabilities Architecture - Critical: How capabilities work as independent git submodules and separation of concerns
Caching System - Why and how MarkiTect's AST caching delivers 60-85% performance improvements
Coming soon: Database Schema, CLI Architecture

👥 User Guides (`user-guides/`)

End-user documentation for working with MarkiTect CLI and features.

Coming soon: Getting Started, Command Reference, Best Practices

🔧 Development Documentation (`development/`)

Documentation for contributors and developers extending MarkiTect.

Coming soon: Contributing Guide, Testing Strategy, Release Process

Quick Links

For Users

Installation & Setup
Command Reference (coming soon)
Performance Guide (coming soon)

For Developers

Architecture Overview - System design and component relationships
Development Setup - Local development environment
API Documentation (coming soon)

Project Management

Project Status - Current development status
Roadmap - Strategic development plan
Current Tasks - Task management using Keep a Todofile format

Key Concepts

Core Architecture Principles

Parse Once, Use Many Times - AST caching for 60-85% performance improvement
Convention Over Configuration - Sensible defaults with minimal setup
Schema-Driven Processing - Structured markdown with validation
Relational Metadata - Database-powered document relationships

Performance Philosophy

MarkiTect treats markdown documents as structured, queryable data rather than plain text. This approach enables:

Lightning-fast document processing through intelligent caching
Complex querying and relationship management
Schema validation and consistency enforcement
Scalable performance that grows with your content

Contributing to Documentation

Documentation follows the same quality standards as code:

Clear Structure - Logical organization and navigation
Practical Examples - Real-world usage patterns
Performance Context - Why architectural decisions matter
User-Focused - Written for the intended audience

Documentation Standards

Use clear, concise language
Include practical examples
Explain the "why" behind design decisions
Keep technical accuracy as the highest priority
Update docs when changing functionality

This documentation is maintained alongside the codebase. For the most current information, always refer to the latest version in the repository.

Releases 1

MarkiTect 0.8.0 Latest

2025-11-08 20:34:42 +00:00

Languages

Python 84.7%

JavaScript 8%

HTML 5.6%

Makefile 1.3%

Shell 0.2%

Other 0.1%

docs/README.md

MarkiTect Documentation

Documentation Structure

📐 Architecture Documentation (architecture/)

👥 User Guides (user-guides/)

🔧 Development Documentation (development/)

Quick Links

For Users

For Developers

Project Management

Key Concepts

Core Architecture Principles

Performance Philosophy

Contributing to Documentation

Documentation Standards

📐 Architecture Documentation (`architecture/`)

👥 User Guides (`user-guides/`)

🔧 Development Documentation (`development/`)