Files
markitect-main/examples/infospace-with-history/output/evaluations/stamp_masters.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

3.7 KiB

entity_slug, evaluator, evaluated_at, overall_score, scores
entity_slug evaluator evaluated_at overall_score scores
stamp_masters null 2026-02-23T06:24:37.392130 4.4
name value max_value rationale
definition_precision 4.0 5.0 The definition is precise and clearly distinguishes stamp-masters as officials who certify linen cloth quality, drawing clear parallels to aulnagers (woollen cloth) and mint officials (metal currency). The concept is distinct and well-bounded within the standardization framework.
name value max_value rationale
source_grounding 5.0 5.0 This entity is directly grounded in Smith's text from Book I, Chapter 4, where he explicitly mentions stamp-masters alongside other certification officials as examples of public institutions providing quality assurance. The definition accurately reflects Smith's usage without adding external interpretations.
name value max_value rationale
domain_placement 5.0 5.0 The "Regulation" domain assignment is perfectly appropriate, as stamp-masters represent a clear regulatory function involving official oversight and standardization of commercial goods. This fits squarely within the broader category of market regulation and quality control mechanisms.
name value max_value rationale
vsm_relevance 4.0 5.0 Stamp-masters map naturally to S3 (internal regulation/audit) as they perform monitoring and control functions to ensure quality standards within the commercial system. They could also relate to S2 (coordination) by reducing market oscillations through standardization, giving them clear VSM relevance.
name value max_value rationale
explanatory_value 4.0 5.0 This entity illuminates an important structural mechanism in Smith's analysis of how commercial societies develop institutional solutions for quality assurance and standardization. It demonstrates the principle that market efficiency requires trusted certification systems, adding genuine explanatory power to understanding commercial development.

Evaluation: Stamp Masters

definition_precision — 4.0 / 5.0

The definition is precise and clearly distinguishes stamp-masters as officials who certify linen cloth quality, drawing clear parallels to aulnagers (woollen cloth) and mint officials (metal currency). The concept is distinct and well-bounded within the standardization framework.

source_grounding — 5.0 / 5.0

This entity is directly grounded in Smith's text from Book I, Chapter 4, where he explicitly mentions stamp-masters alongside other certification officials as examples of public institutions providing quality assurance. The definition accurately reflects Smith's usage without adding external interpretations.

domain_placement — 5.0 / 5.0

The "Regulation" domain assignment is perfectly appropriate, as stamp-masters represent a clear regulatory function involving official oversight and standardization of commercial goods. This fits squarely within the broader category of market regulation and quality control mechanisms.

vsm_relevance — 4.0 / 5.0

Stamp-masters map naturally to S3 (internal regulation/audit) as they perform monitoring and control functions to ensure quality standards within the commercial system. They could also relate to S2 (coordination) by reducing market oscillations through standardization, giving them clear VSM relevance.

explanatory_value — 4.0 / 5.0

This entity illuminates an important structural mechanism in Smith's analysis of how commercial societies develop institutional solutions for quality assurance and standardization. It demonstrates the principle that market efficiency requires trusted certification systems, adding genuine explanatory power to understanding commercial development.