feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,65 @@
---
entity_slug: value_of_silver
evaluator: null
evaluated_at: '2026-02-23T06:36:54.485676'
overall_score: 4.2
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition clearly distinguishes the value of silver (its purchasing
power in terms of labor/commodities) from silver as a physical commodity, and
specifies that this value varies due to identifiable factors like mine productivity
and market conditions. It avoids circularity and captures a distinct economic
concept.
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: This entity is directly grounded in Smith's extensive analysis in Book
I, Chapter 5, where he examines historical changes in silver's purchasing power
and its effects on prices and money rents. Smith uses detailed historical examples
to demonstrate how silver's value has fluctuated over time.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: "The \"Exchange\" domain is perfectly appropriate since the value of\
\ silver fundamentally concerns exchange relationships\u2014how much labor or\
\ commodities silver can command in market transactions. This is a core exchange\
\ mechanism rather than production, distribution, or consumption."
- name: vsm_relevance
value: 3.0
max_value: 5.0
rationale: This entity has moderate VSM relevance, primarily mapping to S4 (intelligence/environmental
adaptation) as it represents information about changing external conditions that
affect the economic system's operations. However, it's somewhat abstract and doesn't
clearly embody the cybernetic control functions that make VSM mapping most valuable.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: This entity provides significant explanatory power by illuminating how
monetary value itself can change over time due to supply-side factors, which helps
explain price movements and the real value of fixed payments. It reveals an important
mechanism underlying monetary economics rather than just naming a surface phenomenon.
---
# Evaluation: Value Of Silver
## definition_precision — 4.0 / 5.0
The definition clearly distinguishes the value of silver (its purchasing power in terms of labor/commodities) from silver as a physical commodity, and specifies that this value varies due to identifiable factors like mine productivity and market conditions. It avoids circularity and captures a distinct economic concept.
## source_grounding — 5.0 / 5.0
This entity is directly grounded in Smith's extensive analysis in Book I, Chapter 5, where he examines historical changes in silver's purchasing power and its effects on prices and money rents. Smith uses detailed historical examples to demonstrate how silver's value has fluctuated over time.
## domain_placement — 5.0 / 5.0
The "Exchange" domain is perfectly appropriate since the value of silver fundamentally concerns exchange relationships—how much labor or commodities silver can command in market transactions. This is a core exchange mechanism rather than production, distribution, or consumption.
## vsm_relevance — 3.0 / 5.0
This entity has moderate VSM relevance, primarily mapping to S4 (intelligence/environmental adaptation) as it represents information about changing external conditions that affect the economic system's operations. However, it's somewhat abstract and doesn't clearly embody the cybernetic control functions that make VSM mapping most valuable.
## explanatory_value — 4.0 / 5.0
This entity provides significant explanatory power by illuminating how monetary value itself can change over time due to supply-side factors, which helps explain price movements and the real value of fixed payments. It reveals an important mechanism underlying monetary economics rather than just naming a surface phenomenon.