feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,65 @@
---
entity_slug: economic_system_learning_process
evaluator: null
evaluated_at: '2026-02-23T05:18:15.097881'
overall_score: 2.6
scores:
- name: definition_precision
value: 2.0
max_value: 5.0
rationale: The definition is quite vague and umbrella-like, encompassing broad concepts
like "ways societies acquire knowledge" without clearly delineating what specifically
constitutes this learning process. It reads more like a general description of
social learning rather than a precise economic concept.
- name: source_grounding
value: 2.0
max_value: 5.0
rationale: The entity claims to derive from "Book IV, Chapter 0" which doesn't exist
(Book IV doesn't have a Chapter 0), and the context provided is extremely thin,
suggesting Smith's "approach to explaining different systems implies" this concept
rather than explicitly discussing it. This appears to be an inference rather than
a clearly stated concept in the source.
- name: domain_placement
value: 3.0
max_value: 5.0
rationale: '"General Theory" is appropriate given the broad, abstract nature of
the concept, though the entity might be better categorized under methodology or
epistemology rather than economic theory per se. The domain placement reflects
the vague nature of the concept itself.'
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: This entity maps well to S4 (intelligence/environmental adaptation) as
it describes how economic systems gather information and adapt based on learning
and experience. The learning process is fundamentally about intelligence gathering
and environmental scanning.
- name: explanatory_value
value: 2.0
max_value: 5.0
rationale: The entity names a general phenomenon (learning about economics) but
doesn't illuminate any specific mechanism or provide insight into how economic
systems actually function or improve. It's too abstract to offer genuine explanatory
power about economic structures or processes.
---
# Evaluation: Economic System Learning Process
## definition_precision — 2.0 / 5.0
The definition is quite vague and umbrella-like, encompassing broad concepts like "ways societies acquire knowledge" without clearly delineating what specifically constitutes this learning process. It reads more like a general description of social learning rather than a precise economic concept.
## source_grounding — 2.0 / 5.0
The entity claims to derive from "Book IV, Chapter 0" which doesn't exist (Book IV doesn't have a Chapter 0), and the context provided is extremely thin, suggesting Smith's "approach to explaining different systems implies" this concept rather than explicitly discussing it. This appears to be an inference rather than a clearly stated concept in the source.
## domain_placement — 3.0 / 5.0
"General Theory" is appropriate given the broad, abstract nature of the concept, though the entity might be better categorized under methodology or epistemology rather than economic theory per se. The domain placement reflects the vague nature of the concept itself.
## vsm_relevance — 4.0 / 5.0
This entity maps well to S4 (intelligence/environmental adaptation) as it describes how economic systems gather information and adapt based on learning and experience. The learning process is fundamentally about intelligence gathering and environmental scanning.
## explanatory_value — 2.0 / 5.0
The entity names a general phenomenon (learning about economics) but doesn't illuminate any specific mechanism or provide insight into how economic systems actually function or improve. It's too abstract to offer genuine explanatory power about economic structures or processes.