Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.7 KiB
entity_slug, evaluator, evaluated_at, overall_score, scores
| entity_slug | evaluator | evaluated_at | overall_score | scores | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| economic_system_evaluation_criteria | null | 2026-02-23T05:15:20.028943 | 3.0 |
|
Evaluation: Economic System Evaluation Criteria
definition_precision — 3.0 / 5.0
The definition captures a coherent concept about evaluation standards for economic systems, but it's somewhat broad and could apply to many different types of assessment frameworks. The connection to Smith's specific objectives (subsistence and revenue) provides some precision but the overall concept remains fairly general.
source_grounding — 2.0 / 5.0
While Smith does discuss the objectives of political economy (providing subsistence and revenue), he doesn't explicitly articulate a systematic framework of "evaluation criteria" as a distinct concept. This appears to be an analytical abstraction imposed on the text rather than something Smith directly presents.
domain_placement — 4.0 / 5.0
"General Theory" is appropriate since this represents meta-level analytical framework that would apply across different economic arrangements. The domain placement correctly identifies this as a theoretical construct rather than a specific policy or mechanism.
vsm_relevance — 4.0 / 5.0
This entity maps well to S3 (internal regulation/audit) and S5 (identity/policy) functions, as evaluation criteria are essential for monitoring system performance and maintaining coherent organizational identity. The concept has clear VSM relevance for understanding how economic systems assess and maintain their viability.
explanatory_value — 2.0 / 5.0
While the concept might be useful for comparative analysis, it doesn't illuminate specific mechanisms or structural relations that Smith describes. It's more of a meta-analytical tool than something that explains how economic systems actually function or why they succeed or fail.