feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,62 @@
---
entity_slug: regulated_proportion
evaluator: null
evaluated_at: '2026-02-23T06:16:58.804289'
overall_score: 4.6
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition clearly identifies a specific concept - the officially
established ratio between precious metals in currency systems. It avoids circularity
and captures a distinct monetary mechanism rather than a vague umbrella term.
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: This concept is directly grounded in Smith's actual discussion in Book
I, Chapter 5, where he examines the legal ratios between gold and silver and their
effects on monetary circulation. Smith explicitly analyzes how these regulated
proportions influence which metals remain in circulation.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: The "Regulation" domain is perfectly appropriate, as this entity describes
a specific form of government monetary regulation. It represents a clear regulatory
mechanism within Smith's broader analysis of currency systems.
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: This entity maps well to S3 (internal regulation) as it represents a
control mechanism that maintains monetary system stability through established
ratios. It could also relate to S2 (coordination) in preventing oscillations between
different metals in circulation.
- name: explanatory_value
value: 5.0
max_value: 5.0
rationale: This entity illuminates a crucial mechanism in monetary systems - how
legal ratios between metals affect their circulation patterns and relative values.
It explains the structural relationship between regulation and monetary stability
rather than merely naming a surface phenomenon.
---
# Evaluation: Regulated Proportion
## definition_precision — 4.0 / 5.0
The definition clearly identifies a specific concept - the officially established ratio between precious metals in currency systems. It avoids circularity and captures a distinct monetary mechanism rather than a vague umbrella term.
## source_grounding — 5.0 / 5.0
This concept is directly grounded in Smith's actual discussion in Book I, Chapter 5, where he examines the legal ratios between gold and silver and their effects on monetary circulation. Smith explicitly analyzes how these regulated proportions influence which metals remain in circulation.
## domain_placement — 5.0 / 5.0
The "Regulation" domain is perfectly appropriate, as this entity describes a specific form of government monetary regulation. It represents a clear regulatory mechanism within Smith's broader analysis of currency systems.
## vsm_relevance — 4.0 / 5.0
This entity maps well to S3 (internal regulation) as it represents a control mechanism that maintains monetary system stability through established ratios. It could also relate to S2 (coordination) in preventing oscillations between different metals in circulation.
## explanatory_value — 5.0 / 5.0
This entity illuminates a crucial mechanism in monetary systems - how legal ratios between metals affect their circulation patterns and relative values. It explains the structural relationship between regulation and monetary stability rather than merely naming a surface phenomenon.