feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,63 @@
---
entity_slug: progress_of_opulence
evaluator: null
evaluated_at: '2026-02-23T06:10:48.749140'
overall_score: 4.0
scores:
- name: definition_precision
value: 3.0
max_value: 5.0
rationale: The definition captures a distinct concept about differential economic
development across time and geography, but uses somewhat vague language like "distinct
systems" and "varying rates" without specifying the mechanisms or criteria that
define this progress.
- name: source_grounding
value: 4.0
max_value: 5.0
rationale: This concept is well-grounded in Smith's actual framework, as he explicitly
discusses how different nations and historical periods have developed varying
approaches to political economy based on their economic circumstances. The connection
to Book IV's comparative analysis of economic systems is authentic.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: '"General Theory" is the correct domain placement, as this concept operates
at the meta-level of explaining why different economic systems emerge, rather
than describing specific mechanisms or policies within those systems.'
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: This entity maps well to S4 (intelligence/environmental adaptation) as
it describes how economic systems adapt and evolve in response to different temporal
and geographical environments. It could also relate to S5 in terms of how nations
develop distinct economic identities.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: This entity provides genuine explanatory power by offering a framework
for understanding why different economic systems emerge historically and geographically,
rather than merely describing surface-level differences between nations' wealth
levels.
---
# Evaluation: Progress Of Opulence
## definition_precision — 3.0 / 5.0
The definition captures a distinct concept about differential economic development across time and geography, but uses somewhat vague language like "distinct systems" and "varying rates" without specifying the mechanisms or criteria that define this progress.
## source_grounding — 4.0 / 5.0
This concept is well-grounded in Smith's actual framework, as he explicitly discusses how different nations and historical periods have developed varying approaches to political economy based on their economic circumstances. The connection to Book IV's comparative analysis of economic systems is authentic.
## domain_placement — 5.0 / 5.0
"General Theory" is the correct domain placement, as this concept operates at the meta-level of explaining why different economic systems emerge, rather than describing specific mechanisms or policies within those systems.
## vsm_relevance — 4.0 / 5.0
This entity maps well to S4 (intelligence/environmental adaptation) as it describes how economic systems adapt and evolve in response to different temporal and geographical environments. It could also relate to S5 in terms of how nations develop distinct economic identities.
## explanatory_value — 4.0 / 5.0
This entity provides genuine explanatory power by offering a framework for understanding why different economic systems emerge historically and geographically, rather than merely describing surface-level differences between nations' wealth levels.