Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.6 KiB
entity_slug, evaluator, evaluated_at, overall_score, scores
| entity_slug | evaluator | evaluated_at | overall_score | scores | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| economic_system_implementation | null | 2026-02-23T05:16:47.168866 | 2.4 |
|
Evaluation: Economic System Implementation
definition_precision — 2.0 / 5.0
The definition is overly broad and vague, essentially describing "how economic systems work in practice" without identifying any specific mechanisms or distinct conceptual boundaries. It reads more like a general description of policy implementation than a precise economic concept.
source_grounding — 1.0 / 5.0
The entity explicitly admits Smith does not discuss this concept in the referenced chapter, stating it is "implied" in his discussion - this represents a significant inferential leap rather than grounding in actual source text. Book IV, Chapter 0 would also be unusual as chapters typically start with Chapter 1.
domain_placement — 3.0 / 5.0
While "Regulation" is a reasonable domain for implementation processes, this concept is so broad it could equally belong in institutional economics, political economy, or policy studies. The domain assignment captures one aspect but doesn't reflect the entity's expansive scope.
vsm_relevance — 4.0 / 5.0
This entity maps well to multiple VSM systems - S1 (operational implementation), S3 (internal regulation and audit of implementation), and S4 (adaptation of implementation to environment). The implementation focus gives it clear VSM applicability across several systems.
explanatory_value — 2.0 / 5.0
The entity merely labels the general phenomenon of "putting economic ideas into practice" without illuminating any specific mechanisms, structural relationships, or causal processes. It adds descriptive terminology but little analytical insight into how economic systems actually function.