feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,65 @@
---
entity_slug: home_made_commodities
evaluator: null
evaluated_at: '2026-02-23T05:34:36.333162'
overall_score: 4.2
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition clearly distinguishes home made commodities from imported
goods and captures a distinct economic concept. It's precise and non-circular,
though it could be slightly more specific about what constitutes "local industry
and manufacturing."
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: This entity is directly grounded in Smith's text from Book IV, Chapter
5, where he explicitly discusses how bounties on exported corn affect the prices
of "home made commodities." The concept and terminology come straight from the
source material.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: The "Production" domain assignment is entirely appropriate, as home made
commodities are fundamentally about domestic production processes and manufacturing.
This clearly belongs in the production category rather than trade, consumption,
or other domains.
- name: vsm_relevance
value: 3.0
max_value: 5.0
rationale: This entity maps most naturally to S1 (primary operations) as it represents
the actual productive activities of an economic system. However, it's somewhat
abstract and could also relate to S4 (intelligence) when considering domestic
vs. foreign production decisions, making the VSM placement useful but not definitively
clear.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: The entity provides genuine explanatory value by illuminating the mechanism
through which export bounties affect domestic price structures via the corn-price
regulator effect. It reveals an important structural relationship between export
policies and domestic commodity pricing rather than merely naming a surface phenomenon.
---
# Evaluation: Home Made Commodities
## definition_precision — 4.0 / 5.0
The definition clearly distinguishes home made commodities from imported goods and captures a distinct economic concept. It's precise and non-circular, though it could be slightly more specific about what constitutes "local industry and manufacturing."
## source_grounding — 5.0 / 5.0
This entity is directly grounded in Smith's text from Book IV, Chapter 5, where he explicitly discusses how bounties on exported corn affect the prices of "home made commodities." The concept and terminology come straight from the source material.
## domain_placement — 5.0 / 5.0
The "Production" domain assignment is entirely appropriate, as home made commodities are fundamentally about domestic production processes and manufacturing. This clearly belongs in the production category rather than trade, consumption, or other domains.
## vsm_relevance — 3.0 / 5.0
This entity maps most naturally to S1 (primary operations) as it represents the actual productive activities of an economic system. However, it's somewhat abstract and could also relate to S4 (intelligence) when considering domestic vs. foreign production decisions, making the VSM placement useful but not definitively clear.
## explanatory_value — 4.0 / 5.0
The entity provides genuine explanatory value by illuminating the mechanism through which export bounties affect domestic price structures via the corn-price regulator effect. It reveals an important structural relationship between export policies and domestic commodity pricing rather than merely naming a surface phenomenon.