Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on first pass; 3 failed (network errors). eval-summary --update-metrics written with per_entity_mean=3.9556. Viability dashboard: 6/6 PASS redundancy_ratio 0.0061 (max 0.10) coverage_ratio 0.6190 (min 0.40) coherence_comps 0.0000 (max 3) consistency_cycles 0.0000 (max 0) granularity_entropy 2.6748 (min 1.0) per_entity_mean 3.9556 (min 3.5) Dimension breakdown (mean across 985 entities): definition_precision 3.62 source_grounding 4.36 domain_placement 4.56 vsm_relevance 3.31 explanatory_value 3.94 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.6 KiB
entity_slug, evaluator, evaluated_at, overall_score, scores
| entity_slug | evaluator | evaluated_at | overall_score | scores | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| non_standard_metal | null | 2026-02-23T06:02:45.522631 | 2.6 |
|
Evaluation: Non Standard Metal
definition_precision — 2.0 / 5.0
The definition is vague and circular, defining non-standard metals as "precious metals that are not the primary standard" without clearly explaining what distinguishes them functionally or how they operate differently from standard metals. It lacks specificity about their actual role or characteristics within monetary systems.
source_grounding — 3.0 / 5.0
While Smith does discuss different metals in monetary systems in Book I, Chapter 5, the specific term "non-standard metal" and this particular conceptual framing appears to be an interpretive overlay rather than Smith's own terminology or explicit categorization. The concept may be implicit in his discussion but isn't clearly articulated as presented.
domain_placement — 4.0 / 5.0
The "Regulation" domain placement is appropriate since this concept deals with how monetary systems organize and standardize different metals for exchange. This fits well within regulatory frameworks for currency and value measurement.
vsm_relevance — 2.0 / 5.0
This entity doesn't map clearly to any specific VSM system and appears too abstract to have a natural VSM home. It describes a category of monetary instruments rather than a functional system component that would operate within the VSM framework.
explanatory_value — 2.0 / 5.0
The entity provides limited explanatory power, merely creating a category without illuminating the actual mechanisms of how these metals function or their structural relationships within monetary systems. It names a phenomenon but doesn't explain the underlying economic dynamics Smith describes.