feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,65 @@
---
entity_slug: combination_of_workmen
evaluator: null
evaluated_at: '2026-02-23T04:56:59.981537'
overall_score: 4.2
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition is quite precise, clearly describing collective action
by laborers to raise wages through coordinated efforts. It captures a distinct
concept with specific characteristics (violence, clamor, outcomes) rather than
being a vague umbrella term.
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: This entity is directly grounded in Smith's text from Book I, Chapter
8, where he explicitly discusses combinations of workmen and their characteristics.
The description accurately reflects Smith's observations about their frequency,
methods, and typical outcomes.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: The "Regulation" domain assignment is correct, as combinations of workmen
represent attempts to regulate wage levels through collective action. This fits
perfectly within the broader regulatory framework of labor market dynamics that
Smith analyzes.
- name: vsm_relevance
value: 3.0
max_value: 5.0
rationale: This entity has moderate VSM relevance, potentially mapping to S2 (coordination
among workers) or S4 (environmental adaptation to wage pressures). However, it's
primarily a reactive phenomenon rather than a systematic organizational function,
making the VSM mapping somewhat forced.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: The entity provides good explanatory value by illuminating the structural
power imbalance between workers and employers and the mechanisms workers use to
attempt wage increases. It reveals important dynamics about collective action
and market regulation, though it's more descriptive of outcomes than underlying
mechanisms.
---
# Evaluation: Combination Of Workmen
## definition_precision — 4.0 / 5.0
The definition is quite precise, clearly describing collective action by laborers to raise wages through coordinated efforts. It captures a distinct concept with specific characteristics (violence, clamor, outcomes) rather than being a vague umbrella term.
## source_grounding — 5.0 / 5.0
This entity is directly grounded in Smith's text from Book I, Chapter 8, where he explicitly discusses combinations of workmen and their characteristics. The description accurately reflects Smith's observations about their frequency, methods, and typical outcomes.
## domain_placement — 5.0 / 5.0
The "Regulation" domain assignment is correct, as combinations of workmen represent attempts to regulate wage levels through collective action. This fits perfectly within the broader regulatory framework of labor market dynamics that Smith analyzes.
## vsm_relevance — 3.0 / 5.0
This entity has moderate VSM relevance, potentially mapping to S2 (coordination among workers) or S4 (environmental adaptation to wage pressures). However, it's primarily a reactive phenomenon rather than a systematic organizational function, making the VSM mapping somewhat forced.
## explanatory_value — 4.0 / 5.0
The entity provides good explanatory value by illuminating the structural power imbalance between workers and employers and the mechanisms workers use to attempt wage increases. It reveals important dynamics about collective action and market regulation, though it's more descriptive of outcomes than underlying mechanisms.