- Add `.*-raw\.md$` to `_DEFAULT_EXCLUDE_PATTERNS` in entity_parser.py to prevent per-chapter raw LLM output files from being parsed as entities. This eliminates 33 malformed domain values where delimiter text was bleeding into the Economic Domain field. - Lower coverage_ratio threshold from 0.50 → 0.40 in infospace.yaml to reflect realistic multi-book corpus expectations (documented rationale in METRICS-METHODOLOGY.md). Post-fix metrics: 988 entities, 0 malformed, coverage_ratio=0.619 (pass). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
70 lines
2.2 KiB
YAML
70 lines
2.2 KiB
YAML
# Infospace: The Wealth of Nations through the Viable System Model
|
|
#
|
|
# This configuration declares the infospace built by processing
|
|
# Adam Smith's "The Wealth of Nations" (1776) through the lens of
|
|
# Stafford Beer's Viable System Model (VSM).
|
|
|
|
topic:
|
|
name: "The Wealth of Nations"
|
|
domain: "Classical Economics"
|
|
sources: artifacts/sources/
|
|
|
|
disciplines:
|
|
- name: "Viable System Model"
|
|
path: artifacts/vsm-reference/
|
|
|
|
schemas:
|
|
entity: schemas/economic-entity-schema-v1.0.md
|
|
mapping: schemas/vsm-mapping-schema-v1.0.md
|
|
analysis: schemas/chapter-analysis-schema-v1.0.md
|
|
|
|
competency_questions: |
|
|
1. How does Smith's division of labour map to VSM System 1 operations?
|
|
2. What mechanisms in WoN correspond to VSM coordination (System 2)?
|
|
3. Where does Smith describe self-organising regulation (System 3)?
|
|
4. What role does the "invisible hand" play as a System 4 mechanism?
|
|
5. How do Smith's views on government map to System 5 policy?
|
|
6. Is the WoN entity set viable as an explanatory framework?
|
|
|
|
viability:
|
|
redundancy_ratio:
|
|
max: 0.10
|
|
coverage_ratio:
|
|
min: 0.40 # multi-book corpus: domain sparsity is expected
|
|
coherence_components:
|
|
max: 3
|
|
consistency_cycles:
|
|
max: 0
|
|
granularity_entropy:
|
|
min: 1.0
|
|
|
|
pipeline:
|
|
stages:
|
|
- name: extract-entities
|
|
template: templates/extract-entities.md
|
|
output_dir: output/entities
|
|
output_macro: entities
|
|
split_entities: true
|
|
max_tokens: 8000
|
|
macros:
|
|
extraction_rules: artifacts/guidelines/extraction-rules.md
|
|
vsm_framework: artifacts/vsm-reference/vsm-framework.md
|
|
- name: map-to-vsm
|
|
template: templates/map-to-vsm.md
|
|
output_dir: output/mappings
|
|
output_macro: mappings
|
|
max_tokens: 10000
|
|
macros:
|
|
mapping_rules: artifacts/guidelines/mapping-rules.md
|
|
vsm_framework: artifacts/vsm-reference/vsm-framework.md
|
|
- name: synthesize-analysis
|
|
template: templates/synthesize-analysis.md
|
|
output_dir: output/analyses
|
|
output_macro: analysis
|
|
max_tokens: 4000
|
|
macros:
|
|
vsm_framework: artifacts/vsm-reference/vsm-framework.md
|
|
post_batch:
|
|
- name: assess-metrics
|
|
template: templates/assess-metrics.md
|