feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -0,0 +1,59 @@
|
||||
---
|
||||
entity_slug: security_preference_capital
|
||||
evaluator: null
|
||||
evaluated_at: '2026-02-23T06:20:24.504228'
|
||||
overall_score: 1.0
|
||||
scores:
|
||||
- name: definition_precision
|
||||
value: 1.0
|
||||
max_value: 5.0
|
||||
rationale: There is no definition provided at all, making it impossible to assess
|
||||
precision or conceptual distinctness. The term "Security Preference Capital" appears
|
||||
to be a compound phrase without clear meaning or boundaries.
|
||||
- name: source_grounding
|
||||
value: 1.0
|
||||
max_value: 5.0
|
||||
rationale: With no definition, context, or source chapter specified, there is no
|
||||
evidence this entity is grounded in Smith's actual text. The terminology does
|
||||
not align with Smith's 18th-century economic vocabulary or conceptual framework.
|
||||
- name: domain_placement
|
||||
value: 1.0
|
||||
max_value: 5.0
|
||||
rationale: The domain is listed as "unspecified," and without a definition, it's
|
||||
impossible to determine what economic or thematic category this entity should
|
||||
occupy. The entity appears to be floating without conceptual anchoring.
|
||||
- name: vsm_relevance
|
||||
value: 1.0
|
||||
max_value: 5.0
|
||||
rationale: Without any definition or context, this entity cannot be meaningfully
|
||||
mapped to any VSM system (S1-S5). It's unclear whether it represents an operational
|
||||
process, coordination mechanism, regulatory function, or strategic element.
|
||||
- name: explanatory_value
|
||||
value: 1.0
|
||||
max_value: 5.0
|
||||
rationale: An undefined entity with no context provides zero explanatory power about
|
||||
economic mechanisms or structural relations. It neither illuminates Smith's concepts
|
||||
nor adds analytical value to understanding "The Wealth of Nations."
|
||||
---
|
||||
|
||||
# Evaluation: Security Preference Capital
|
||||
|
||||
## definition_precision — 1.0 / 5.0
|
||||
|
||||
There is no definition provided at all, making it impossible to assess precision or conceptual distinctness. The term "Security Preference Capital" appears to be a compound phrase without clear meaning or boundaries.
|
||||
|
||||
## source_grounding — 1.0 / 5.0
|
||||
|
||||
With no definition, context, or source chapter specified, there is no evidence this entity is grounded in Smith's actual text. The terminology does not align with Smith's 18th-century economic vocabulary or conceptual framework.
|
||||
|
||||
## domain_placement — 1.0 / 5.0
|
||||
|
||||
The domain is listed as "unspecified," and without a definition, it's impossible to determine what economic or thematic category this entity should occupy. The entity appears to be floating without conceptual anchoring.
|
||||
|
||||
## vsm_relevance — 1.0 / 5.0
|
||||
|
||||
Without any definition or context, this entity cannot be meaningfully mapped to any VSM system (S1-S5). It's unclear whether it represents an operational process, coordination mechanism, regulatory function, or strategic element.
|
||||
|
||||
## explanatory_value — 1.0 / 5.0
|
||||
|
||||
An undefined entity with no context provides zero explanatory power about economic mechanisms or structural relations. It neither illuminates Smith's concepts nor adds analytical value to understanding "The Wealth of Nations."
|
||||
Reference in New Issue
Block a user