feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,65 @@
---
entity_slug: foreign_commodities
evaluator: null
evaluated_at: '2026-02-23T05:29:16.267357'
overall_score: 4.4
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition clearly distinguishes foreign commodities as goods produced
in other countries and imported for domestic consumption, with the key distinction
being their competition with local products. This is precise and non-circular,
though it could be slightly more specific about what constitutes "competition."
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: The entity is directly grounded in Smith's text from Book IV, Chapter
5, where he explicitly discusses foreign commodities in relation to bounties and
their effects on silver degradation. The context provided accurately reflects
Smith's actual argument about bounties' differential effects on foreign versus
domestic goods.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: Placement in the "Exchange" domain is highly appropriate since foreign
commodities are fundamentally about trade relationships and market exchanges between
nations. This concept sits at the heart of international commerce and domestic
market dynamics.
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: Foreign commodities map well to S4 (intelligence/environmental adaptation)
as they represent external environmental factors that domestic systems must monitor
and respond to. They also relate to S1 (primary operations) as inputs to domestic
economic processes.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: This entity illuminates important structural mechanisms in Smith's analysis
of how international trade affects domestic markets and pricing. It helps explain
the differential impacts of economic policies like bounties on various types of
goods, providing genuine insight into trade dynamics.
---
# Evaluation: Foreign Commodities
## definition_precision — 4.0 / 5.0
The definition clearly distinguishes foreign commodities as goods produced in other countries and imported for domestic consumption, with the key distinction being their competition with local products. This is precise and non-circular, though it could be slightly more specific about what constitutes "competition."
## source_grounding — 5.0 / 5.0
The entity is directly grounded in Smith's text from Book IV, Chapter 5, where he explicitly discusses foreign commodities in relation to bounties and their effects on silver degradation. The context provided accurately reflects Smith's actual argument about bounties' differential effects on foreign versus domestic goods.
## domain_placement — 5.0 / 5.0
Placement in the "Exchange" domain is highly appropriate since foreign commodities are fundamentally about trade relationships and market exchanges between nations. This concept sits at the heart of international commerce and domestic market dynamics.
## vsm_relevance — 4.0 / 5.0
Foreign commodities map well to S4 (intelligence/environmental adaptation) as they represent external environmental factors that domestic systems must monitor and respond to. They also relate to S1 (primary operations) as inputs to domestic economic processes.
## explanatory_value — 4.0 / 5.0
This entity illuminates important structural mechanisms in Smith's analysis of how international trade affects domestic markets and pricing. It helps explain the differential impacts of economic policies like bounties on various types of goods, providing genuine insight into trade dynamics.