Files
markitect-main/examples/infospace-with-history/output/evaluations/economic_system_management.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

3.2 KiB

entity_slug, evaluator, evaluated_at, overall_score, scores
entity_slug evaluator evaluated_at overall_score scores
economic_system_management null 2026-02-23T05:18:32.062456 3.0
name value max_value rationale
definition_precision 2.0 5.0 The definition is overly broad and umbrella-like, encompassing "policies, institutions, and practices" without clear boundaries. It conflates the role of statesmen/legislators with the broader concept of economic system management, making it imprecise and somewhat circular.
name value max_value rationale
source_grounding 3.0 5.0 While Smith does discuss political economy as "the science of the statesman or legislator" in Book IV, the entity extrapolates this into a broader concept of "economic system management" that goes beyond what Smith explicitly articulates. The source grounding is partial but not fully supported.
name value max_value rationale
domain_placement 4.0 5.0 The "Regulation" domain assignment is appropriate given the focus on oversight, administration, and the role of legislators in economic arrangements. This aligns well with Smith's discussion of government's role in economic affairs.
name value max_value rationale
vsm_relevance 4.0 5.0 This entity maps well to VSM System 5 (identity/policy) as it deals with high-level governance and policy direction of economic systems. It also has some relevance to S3 (internal regulation/audit) regarding oversight functions.
name value max_value rationale
explanatory_value 2.0 5.0 The entity primarily names a general phenomenon rather than illuminating specific mechanisms or structural relations. It lacks the analytical depth to explain how economic system management actually works or what makes it effective.

Evaluation: Economic System Management

definition_precision — 2.0 / 5.0

The definition is overly broad and umbrella-like, encompassing "policies, institutions, and practices" without clear boundaries. It conflates the role of statesmen/legislators with the broader concept of economic system management, making it imprecise and somewhat circular.

source_grounding — 3.0 / 5.0

While Smith does discuss political economy as "the science of the statesman or legislator" in Book IV, the entity extrapolates this into a broader concept of "economic system management" that goes beyond what Smith explicitly articulates. The source grounding is partial but not fully supported.

domain_placement — 4.0 / 5.0

The "Regulation" domain assignment is appropriate given the focus on oversight, administration, and the role of legislators in economic arrangements. This aligns well with Smith's discussion of government's role in economic affairs.

vsm_relevance — 4.0 / 5.0

This entity maps well to VSM System 5 (identity/policy) as it deals with high-level governance and policy direction of economic systems. It also has some relevance to S3 (internal regulation/audit) regarding oversight functions.

explanatory_value — 2.0 / 5.0

The entity primarily names a general phenomenon rather than illuminating specific mechanisms or structural relations. It lacks the analytical depth to explain how economic system management actually works or what makes it effective.