feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -0,0 +1,64 @@
|
||||
---
|
||||
entity_slug: value_in_use
|
||||
evaluator: null
|
||||
evaluated_at: '2026-02-23T06:36:35.869655'
|
||||
overall_score: 4.0
|
||||
scores:
|
||||
- name: definition_precision
|
||||
value: 4.0
|
||||
max_value: 5.0
|
||||
rationale: The definition clearly distinguishes value in use from exchange value
|
||||
and provides a specific meaning - utility for satisfying human wants/needs. It
|
||||
avoids circularity and captures Smith's distinct concept, though it could be slightly
|
||||
more precise about what constitutes "utility."
|
||||
- name: source_grounding
|
||||
value: 5.0
|
||||
max_value: 5.0
|
||||
rationale: This concept is directly and explicitly introduced by Smith in Book I,
|
||||
Chapter 4 as one of the two fundamental meanings of "value." The water-diamond
|
||||
paradox example is accurately referenced and represents Smith's own illustration
|
||||
of the concept.
|
||||
- name: domain_placement
|
||||
value: 5.0
|
||||
max_value: 5.0
|
||||
rationale: '"Consumption" is the correct domain placement since value in use relates
|
||||
to how consumers derive utility from goods to satisfy their wants and needs. This
|
||||
is fundamentally about the consumption side of economic activity rather than production
|
||||
or exchange.'
|
||||
- name: vsm_relevance
|
||||
value: 2.0
|
||||
max_value: 5.0
|
||||
rationale: This concept is too abstract and philosophical to map naturally to specific
|
||||
VSM systems. While it might relate broadly to S4 (understanding environmental
|
||||
needs) or S5 (value judgments), it doesn't represent an operational mechanism
|
||||
that fits cleanly into the VSM framework.
|
||||
- name: explanatory_value
|
||||
value: 4.0
|
||||
max_value: 5.0
|
||||
rationale: The concept provides significant explanatory power by establishing the
|
||||
foundational distinction between utility and market value, which is crucial for
|
||||
understanding Smith's value theory and the paradox of why useful things can be
|
||||
cheap. It illuminates a key structural relationship in economic thinking.
|
||||
---
|
||||
|
||||
# Evaluation: Value In Use
|
||||
|
||||
## definition_precision — 4.0 / 5.0
|
||||
|
||||
The definition clearly distinguishes value in use from exchange value and provides a specific meaning - utility for satisfying human wants/needs. It avoids circularity and captures Smith's distinct concept, though it could be slightly more precise about what constitutes "utility."
|
||||
|
||||
## source_grounding — 5.0 / 5.0
|
||||
|
||||
This concept is directly and explicitly introduced by Smith in Book I, Chapter 4 as one of the two fundamental meanings of "value." The water-diamond paradox example is accurately referenced and represents Smith's own illustration of the concept.
|
||||
|
||||
## domain_placement — 5.0 / 5.0
|
||||
|
||||
"Consumption" is the correct domain placement since value in use relates to how consumers derive utility from goods to satisfy their wants and needs. This is fundamentally about the consumption side of economic activity rather than production or exchange.
|
||||
|
||||
## vsm_relevance — 2.0 / 5.0
|
||||
|
||||
This concept is too abstract and philosophical to map naturally to specific VSM systems. While it might relate broadly to S4 (understanding environmental needs) or S5 (value judgments), it doesn't represent an operational mechanism that fits cleanly into the VSM framework.
|
||||
|
||||
## explanatory_value — 4.0 / 5.0
|
||||
|
||||
The concept provides significant explanatory power by establishing the foundational distinction between utility and market value, which is crucial for understanding Smith's value theory and the paradox of why useful things can be cheap. It illuminates a key structural relationship in economic thinking.
|
||||
Reference in New Issue
Block a user