feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,64 @@
---
entity_slug: economic_system_effectiveness_evaluation
evaluator: null
evaluated_at: '2026-02-23T05:14:54.347867'
overall_score: 3.8
scores:
- name: definition_precision
value: 3.0
max_value: 5.0
rationale: The definition captures a coherent concept about evaluating economic
systems, but it's somewhat broad and could apply to many forms of economic analysis.
While it specifies criteria like national prosperity and consumer welfare, it
lacks the precision to distinguish this from general economic policy analysis.
- name: source_grounding
value: 4.0
max_value: 5.0
rationale: This is well-grounded in Smith's actual methodology in Book IV, Chapter
3, where he systematically evaluates mercantilist policies against criteria of
economic efficiency and public benefit. The entity accurately reflects Smith's
comparative approach to assessing different economic arrangements.
- name: domain_placement
value: 4.0
max_value: 5.0
rationale: '"General Theory" is appropriate since this represents Smith''s broader
methodological approach to economic evaluation rather than a specific policy or
mechanism. It captures the meta-level framework Smith uses to assess economic
systems.'
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: This maps well to S4 (intelligence/environmental adaptation) as it represents
the evaluative framework for assessing how well economic systems adapt to and
serve their environment. It could also relate to S5 (identity/policy) in terms
of establishing criteria for systemic evaluation.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: This entity illuminates an important structural aspect of Smith's analysis
- his systematic approach to comparing economic arrangements using consistent
criteria. It helps explain how Smith moves beyond mere description to principled
evaluation of economic policies and systems.
---
# Evaluation: Economic System Effectiveness Evaluation
## definition_precision — 3.0 / 5.0
The definition captures a coherent concept about evaluating economic systems, but it's somewhat broad and could apply to many forms of economic analysis. While it specifies criteria like national prosperity and consumer welfare, it lacks the precision to distinguish this from general economic policy analysis.
## source_grounding — 4.0 / 5.0
This is well-grounded in Smith's actual methodology in Book IV, Chapter 3, where he systematically evaluates mercantilist policies against criteria of economic efficiency and public benefit. The entity accurately reflects Smith's comparative approach to assessing different economic arrangements.
## domain_placement — 4.0 / 5.0
"General Theory" is appropriate since this represents Smith's broader methodological approach to economic evaluation rather than a specific policy or mechanism. It captures the meta-level framework Smith uses to assess economic systems.
## vsm_relevance — 4.0 / 5.0
This maps well to S4 (intelligence/environmental adaptation) as it represents the evaluative framework for assessing how well economic systems adapt to and serve their environment. It could also relate to S5 (identity/policy) in terms of establishing criteria for systemic evaluation.
## explanatory_value — 4.0 / 5.0
This entity illuminates an important structural aspect of Smith's analysis - his systematic approach to comparing economic arrangements using consistent criteria. It helps explain how Smith moves beyond mere description to principled evaluation of economic policies and systems.