Files
markitect-main/examples/infospace-with-history/output/evaluations/economic_system_evaluation_criteria.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

3.7 KiB

entity_slug, evaluator, evaluated_at, overall_score, scores
entity_slug evaluator evaluated_at overall_score scores
economic_system_evaluation_criteria null 2026-02-23T05:15:20.028943 3.0
name value max_value rationale
definition_precision 3.0 5.0 The definition captures a coherent concept about evaluation standards for economic systems, but it's somewhat broad and could apply to many different types of assessment frameworks. The connection to Smith's specific objectives (subsistence and revenue) provides some precision but the overall concept remains fairly general.
name value max_value rationale
source_grounding 2.0 5.0 While Smith does discuss the objectives of political economy (providing subsistence and revenue), he doesn't explicitly articulate a systematic framework of "evaluation criteria" as a distinct concept. This appears to be an analytical abstraction imposed on the text rather than something Smith directly presents.
name value max_value rationale
domain_placement 4.0 5.0 "General Theory" is appropriate since this represents meta-level analytical framework that would apply across different economic arrangements. The domain placement correctly identifies this as a theoretical construct rather than a specific policy or mechanism.
name value max_value rationale
vsm_relevance 4.0 5.0 This entity maps well to S3 (internal regulation/audit) and S5 (identity/policy) functions, as evaluation criteria are essential for monitoring system performance and maintaining coherent organizational identity. The concept has clear VSM relevance for understanding how economic systems assess and maintain their viability.
name value max_value rationale
explanatory_value 2.0 5.0 While the concept might be useful for comparative analysis, it doesn't illuminate specific mechanisms or structural relations that Smith describes. It's more of a meta-analytical tool than something that explains how economic systems actually function or why they succeed or fail.

Evaluation: Economic System Evaluation Criteria

definition_precision — 3.0 / 5.0

The definition captures a coherent concept about evaluation standards for economic systems, but it's somewhat broad and could apply to many different types of assessment frameworks. The connection to Smith's specific objectives (subsistence and revenue) provides some precision but the overall concept remains fairly general.

source_grounding — 2.0 / 5.0

While Smith does discuss the objectives of political economy (providing subsistence and revenue), he doesn't explicitly articulate a systematic framework of "evaluation criteria" as a distinct concept. This appears to be an analytical abstraction imposed on the text rather than something Smith directly presents.

domain_placement — 4.0 / 5.0

"General Theory" is appropriate since this represents meta-level analytical framework that would apply across different economic arrangements. The domain placement correctly identifies this as a theoretical construct rather than a specific policy or mechanism.

vsm_relevance — 4.0 / 5.0

This entity maps well to S3 (internal regulation/audit) and S5 (identity/policy) functions, as evaluation criteria are essential for monitoring system performance and maintaining coherent organizational identity. The concept has clear VSM relevance for understanding how economic systems assess and maintain their viability.

explanatory_value — 2.0 / 5.0

While the concept might be useful for comparative analysis, it doesn't illuminate specific mechanisms or structural relations that Smith describes. It's more of a meta-analytical tool than something that explains how economic systems actually function or why they succeed or fail.