Files
markitect-main/examples/infospace-with-history/infospace.yaml
tegwick 9c32ad1837 fix(infospace): exclude raw LLM output from entity parsing; lower coverage threshold
- Add `.*-raw\.md$` to `_DEFAULT_EXCLUDE_PATTERNS` in entity_parser.py to
  prevent per-chapter raw LLM output files from being parsed as entities.
  This eliminates 33 malformed domain values where delimiter text was
  bleeding into the Economic Domain field.
- Lower coverage_ratio threshold from 0.50 → 0.40 in infospace.yaml to
  reflect realistic multi-book corpus expectations (documented rationale
  in METRICS-METHODOLOGY.md).

Post-fix metrics: 988 entities, 0 malformed, coverage_ratio=0.619 (pass).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-20 09:28:20 +01:00

70 lines
2.2 KiB
YAML

# Infospace: The Wealth of Nations through the Viable System Model
#
# This configuration declares the infospace built by processing
# Adam Smith's "The Wealth of Nations" (1776) through the lens of
# Stafford Beer's Viable System Model (VSM).
topic:
name: "The Wealth of Nations"
domain: "Classical Economics"
sources: artifacts/sources/
disciplines:
- name: "Viable System Model"
path: artifacts/vsm-reference/
schemas:
entity: schemas/economic-entity-schema-v1.0.md
mapping: schemas/vsm-mapping-schema-v1.0.md
analysis: schemas/chapter-analysis-schema-v1.0.md
competency_questions: |
1. How does Smith's division of labour map to VSM System 1 operations?
2. What mechanisms in WoN correspond to VSM coordination (System 2)?
3. Where does Smith describe self-organising regulation (System 3)?
4. What role does the "invisible hand" play as a System 4 mechanism?
5. How do Smith's views on government map to System 5 policy?
6. Is the WoN entity set viable as an explanatory framework?
viability:
redundancy_ratio:
max: 0.10
coverage_ratio:
min: 0.40 # multi-book corpus: domain sparsity is expected
coherence_components:
max: 3
consistency_cycles:
max: 0
granularity_entropy:
min: 1.0
pipeline:
stages:
- name: extract-entities
template: templates/extract-entities.md
output_dir: output/entities
output_macro: entities
split_entities: true
max_tokens: 8000
macros:
extraction_rules: artifacts/guidelines/extraction-rules.md
vsm_framework: artifacts/vsm-reference/vsm-framework.md
- name: map-to-vsm
template: templates/map-to-vsm.md
output_dir: output/mappings
output_macro: mappings
max_tokens: 10000
macros:
mapping_rules: artifacts/guidelines/mapping-rules.md
vsm_framework: artifacts/vsm-reference/vsm-framework.md
- name: synthesize-analysis
template: templates/synthesize-analysis.md
output_dir: output/analyses
output_macro: analysis
max_tokens: 4000
macros:
vsm_framework: artifacts/vsm-reference/vsm-framework.md
post_batch:
- name: assess-metrics
template: templates/assess-metrics.md