Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.2 KiB
entity_slug, evaluator, evaluated_at, overall_score, scores
| entity_slug | evaluator | evaluated_at | overall_score | scores | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| sterling_mark | null | 2026-02-23T06:25:44.702735 | 4.4 |
|
Evaluation: Sterling Mark
definition_precision — 4.0 / 5.0
The definition clearly describes a specific certification mechanism for silver quality, distinguishing it from weight certification and drawing appropriate parallels to modern hallmarks. The concept is well-bounded and non-circular.
source_grounding — 5.0 / 5.0
This entity is directly grounded in Smith's text from Book I, Chapter 4, where he explicitly discusses sterling marks as examples of official quality certification stamps. The definition accurately reflects Smith's usage and context.
domain_placement — 5.0 / 5.0
The "Regulation" domain is perfectly appropriate, as sterling marks represent a form of official regulatory oversight that establishes standards and provides quality assurance in markets. This fits squarely within regulatory mechanisms that facilitate commerce.
vsm_relevance — 4.0 / 5.0
Sterling marks map well to S3 (internal regulation/audit) as they represent standardized quality control mechanisms, and potentially to S2 (coordination) as they reduce transaction friction by providing trusted information. The regulatory certification function has clear VSM relevance.
explanatory_value — 4.0 / 5.0
This entity illuminates an important mechanism for reducing information asymmetries and transaction costs in markets through official quality certification. It demonstrates how regulatory standards can facilitate trust and efficiency in commercial exchange.