Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.5 KiB
entity_slug, evaluator, evaluated_at, overall_score, scores
| entity_slug | evaluator | evaluated_at | overall_score | scores | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| colony_economic_system_evaluation | null | 2026-02-23T04:53:52.626581 | 4.0 |
|
Evaluation: Colony Economic System Evaluation
definition_precision — 3.0 / 5.0
The definition captures a distinct evaluative process but uses somewhat vague terms like "sustainable development" and "mutual benefit" that may not precisely reflect Smith's 18th-century framework. The concept is reasonably well-bounded but could be more specific about evaluation criteria.
source_grounding — 4.0 / 5.0
This entity is well-grounded in Book V, Chapter 3, where Smith extensively critiques colonial policies and their economic effects. Smith does indeed evaluate colonial systems against principles of effective economic organization, though he wouldn't have used modern terminology like "sustainable development."
domain_placement — 5.0 / 5.0
The "Regulation" domain is perfectly appropriate, as Smith's analysis of colonial economic systems focuses heavily on regulatory frameworks, trade restrictions, and governmental policies. This fits squarely within his broader examination of regulatory effectiveness.
vsm_relevance — 4.0 / 5.0
This entity maps well to S3 (internal regulation/audit) as it represents the mother country's assessment of colonial economic performance and policy effectiveness. It also touches on S4 (intelligence) through environmental scanning of colonial conditions and outcomes.
explanatory_value — 4.0 / 5.0
The entity illuminates Smith's systematic approach to analyzing colonial policies and reveals the structural relationship between regulatory frameworks and economic outcomes. It captures a genuine analytical mechanism rather than just describing surface phenomena.