Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.6 KiB
entity_slug, evaluator, evaluated_at, overall_score, scores
| entity_slug | evaluator | evaluated_at | overall_score | scores | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| colonial_economic_system_evaluation | null | 2026-02-23T04:48:29.343479 | 3.8 |
|
Evaluation: Colonial Economic System Evaluation
definition_precision — 3.0 / 5.0
The definition captures a coherent concept about systematic assessment of colonial policies, but uses somewhat vague terms like "efficiency" and "mutual benefits" without precise criteria. The core idea of comparative evaluation is clear, though the specific methodology remains underspecified.
source_grounding — 4.0 / 5.0
Smith does indeed conduct systematic comparisons of different colonial arrangements throughout Book IV, Chapter 7, consistently arguing that less restrictive systems produce better outcomes than monopolistic controls. The entity accurately reflects this comparative analytical approach present in the source text.
domain_placement — 4.0 / 5.0
"General Theory" is appropriate since this represents Smith's broader methodological approach to policy analysis rather than a specific economic mechanism. The evaluative framework transcends particular colonial arrangements to establish general principles about economic organization.
vsm_relevance — 4.0 / 5.0
This maps well to S4 (intelligence/environmental adaptation) as it represents the systematic gathering and analysis of information about different policy approaches to inform better decision-making. The comparative evaluation function is central to organizational intelligence systems.
explanatory_value — 4.0 / 5.0
This entity illuminates Smith's analytical methodology and explains how he arrives at his policy conclusions through systematic comparison rather than theoretical speculation alone. It reveals the empirical foundation underlying his arguments about economic organization.