Files
markitect-main/examples/infospace-with-history/output/evaluations/sterling_mark.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

3.2 KiB

entity_slug, evaluator, evaluated_at, overall_score, scores
entity_slug evaluator evaluated_at overall_score scores
sterling_mark null 2026-02-23T06:25:44.702735 4.4
name value max_value rationale
definition_precision 4.0 5.0 The definition clearly describes a specific certification mechanism for silver quality, distinguishing it from weight certification and drawing appropriate parallels to modern hallmarks. The concept is well-bounded and non-circular.
name value max_value rationale
source_grounding 5.0 5.0 This entity is directly grounded in Smith's text from Book I, Chapter 4, where he explicitly discusses sterling marks as examples of official quality certification stamps. The definition accurately reflects Smith's usage and context.
name value max_value rationale
domain_placement 5.0 5.0 The "Regulation" domain is perfectly appropriate, as sterling marks represent a form of official regulatory oversight that establishes standards and provides quality assurance in markets. This fits squarely within regulatory mechanisms that facilitate commerce.
name value max_value rationale
vsm_relevance 4.0 5.0 Sterling marks map well to S3 (internal regulation/audit) as they represent standardized quality control mechanisms, and potentially to S2 (coordination) as they reduce transaction friction by providing trusted information. The regulatory certification function has clear VSM relevance.
name value max_value rationale
explanatory_value 4.0 5.0 This entity illuminates an important mechanism for reducing information asymmetries and transaction costs in markets through official quality certification. It demonstrates how regulatory standards can facilitate trust and efficiency in commercial exchange.

Evaluation: Sterling Mark

definition_precision — 4.0 / 5.0

The definition clearly describes a specific certification mechanism for silver quality, distinguishing it from weight certification and drawing appropriate parallels to modern hallmarks. The concept is well-bounded and non-circular.

source_grounding — 5.0 / 5.0

This entity is directly grounded in Smith's text from Book I, Chapter 4, where he explicitly discusses sterling marks as examples of official quality certification stamps. The definition accurately reflects Smith's usage and context.

domain_placement — 5.0 / 5.0

The "Regulation" domain is perfectly appropriate, as sterling marks represent a form of official regulatory oversight that establishes standards and provides quality assurance in markets. This fits squarely within regulatory mechanisms that facilitate commerce.

vsm_relevance — 4.0 / 5.0

Sterling marks map well to S3 (internal regulation/audit) as they represent standardized quality control mechanisms, and potentially to S2 (coordination) as they reduce transaction friction by providing trusted information. The regulatory certification function has clear VSM relevance.

explanatory_value — 4.0 / 5.0

This entity illuminates an important mechanism for reducing information asymmetries and transaction costs in markets through official quality certification. It demonstrates how regulatory standards can facilitate trust and efficiency in commercial exchange.