Files
markitect-main/examples/infospace-with-history/output/evaluations/stock_of_the_country.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

3.3 KiB

entity_slug, evaluator, evaluated_at, overall_score, scores
entity_slug evaluator evaluated_at overall_score scores
stock_of_the_country null 2026-02-23T06:26:01.733242 1.4
name value max_value rationale
definition_precision 1.0 5.0 There is no definition provided at all, making this entity completely imprecise. Without any definitional content, it's impossible to assess whether the concept captures something distinct or is merely a vague umbrella term.
name value max_value rationale
source_grounding 2.0 5.0 While "stock of the country" appears to be terminology that could plausibly come from Smith's work (given his focus on national wealth and capital), the complete absence of source chapter information and context makes it impossible to verify actual grounding in the text. This could be a legitimate concept from the source or an interpretive addition.
name value max_value rationale
domain_placement 1.0 5.0 With no specified domain and no definition or context, there's no basis for evaluating whether the entity is correctly categorized. The economic/thematic domain assignment is entirely absent rather than correct or incorrect.
name value max_value rationale
vsm_relevance 2.0 5.0 The phrase "stock of the country" could potentially relate to S1 (operational resources) or S3 (internal state assessment) in VSM terms, but without definition or context, any VSM mapping would be pure speculation. The entity is too underdeveloped to meaningfully assess VSM relevance.
name value max_value rationale
explanatory_value 1.0 5.0 An entity with no definition, context, or domain specification provides zero explanatory power. It neither illuminates mechanisms nor structural relations, functioning merely as an empty label without substance.

Evaluation: Stock Of The Country

definition_precision — 1.0 / 5.0

There is no definition provided at all, making this entity completely imprecise. Without any definitional content, it's impossible to assess whether the concept captures something distinct or is merely a vague umbrella term.

source_grounding — 2.0 / 5.0

While "stock of the country" appears to be terminology that could plausibly come from Smith's work (given his focus on national wealth and capital), the complete absence of source chapter information and context makes it impossible to verify actual grounding in the text. This could be a legitimate concept from the source or an interpretive addition.

domain_placement — 1.0 / 5.0

With no specified domain and no definition or context, there's no basis for evaluating whether the entity is correctly categorized. The economic/thematic domain assignment is entirely absent rather than correct or incorrect.

vsm_relevance — 2.0 / 5.0

The phrase "stock of the country" could potentially relate to S1 (operational resources) or S3 (internal state assessment) in VSM terms, but without definition or context, any VSM mapping would be pure speculation. The entity is too underdeveloped to meaningfully assess VSM relevance.

explanatory_value — 1.0 / 5.0

An entity with no definition, context, or domain specification provides zero explanatory power. It neither illuminates mechanisms nor structural relations, functioning merely as an empty label without substance.