feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,64 @@
---
entity_slug: economic_system_evolution
evaluator: null
evaluated_at: '2026-02-23T05:15:37.605382'
overall_score: 3.0
scores:
- name: definition_precision
value: 3.0
max_value: 5.0
rationale: The definition captures a coherent concept about economic change over
time, but uses somewhat vague terms like "various factors" and "circumstances"
without specifying mechanisms. It's reasonably distinct but could be more precise
about what drives evolutionary processes.
- name: source_grounding
value: 2.0
max_value: 5.0
rationale: While Smith does distinguish between ancient and modern systems, the
entity extrapolates this into a broader theory of "economic system evolution"
that goes well beyond what Smith explicitly discusses. The source reference to
"Book IV, Chapter 0" is also problematic as Chapter 0 doesn't exist.
- name: domain_placement
value: 4.0
max_value: 5.0
rationale: '"General Theory" is appropriate for this broad conceptual entity about
systemic change patterns. The concept spans multiple economic domains and represents
a meta-level theoretical framework rather than belonging to a specific economic
sector.'
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: This entity maps well to S4 (intelligence/environmental adaptation) as
it describes how economic systems adapt and evolve in response to changing conditions.
It could also relate to S5 regarding fundamental identity changes in economic
arrangements.
- name: explanatory_value
value: 2.0
max_value: 5.0
rationale: The entity names a phenomenon (systems change over time) but provides
little explanatory power about specific mechanisms, drivers, or patterns of evolution.
It remains at a high level of abstraction without illuminating particular causal
relationships or structural dynamics.
---
# Evaluation: Economic System Evolution
## definition_precision — 3.0 / 5.0
The definition captures a coherent concept about economic change over time, but uses somewhat vague terms like "various factors" and "circumstances" without specifying mechanisms. It's reasonably distinct but could be more precise about what drives evolutionary processes.
## source_grounding — 2.0 / 5.0
While Smith does distinguish between ancient and modern systems, the entity extrapolates this into a broader theory of "economic system evolution" that goes well beyond what Smith explicitly discusses. The source reference to "Book IV, Chapter 0" is also problematic as Chapter 0 doesn't exist.
## domain_placement — 4.0 / 5.0
"General Theory" is appropriate for this broad conceptual entity about systemic change patterns. The concept spans multiple economic domains and represents a meta-level theoretical framework rather than belonging to a specific economic sector.
## vsm_relevance — 4.0 / 5.0
This entity maps well to S4 (intelligence/environmental adaptation) as it describes how economic systems adapt and evolve in response to changing conditions. It could also relate to S5 regarding fundamental identity changes in economic arrangements.
## explanatory_value — 2.0 / 5.0
The entity names a phenomenon (systems change over time) but provides little explanatory power about specific mechanisms, drivers, or patterns of evolution. It remains at a high level of abstraction without illuminating particular causal relationships or structural dynamics.