Files
markitect-main/examples/infospace-with-history/output/evaluations/sterling_mark.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

63 lines
3.2 KiB
Markdown

---
entity_slug: sterling_mark
evaluator: null
evaluated_at: '2026-02-23T06:25:44.702735'
overall_score: 4.4
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition clearly describes a specific certification mechanism for
silver quality, distinguishing it from weight certification and drawing appropriate
parallels to modern hallmarks. The concept is well-bounded and non-circular.
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: This entity is directly grounded in Smith's text from Book I, Chapter
4, where he explicitly discusses sterling marks as examples of official quality
certification stamps. The definition accurately reflects Smith's usage and context.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: The "Regulation" domain is perfectly appropriate, as sterling marks represent
a form of official regulatory oversight that establishes standards and provides
quality assurance in markets. This fits squarely within regulatory mechanisms
that facilitate commerce.
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: Sterling marks map well to S3 (internal regulation/audit) as they represent
standardized quality control mechanisms, and potentially to S2 (coordination)
as they reduce transaction friction by providing trusted information. The regulatory
certification function has clear VSM relevance.
- name: explanatory_value
value: 4.0
max_value: 5.0
rationale: This entity illuminates an important mechanism for reducing information
asymmetries and transaction costs in markets through official quality certification.
It demonstrates how regulatory standards can facilitate trust and efficiency in
commercial exchange.
---
# Evaluation: Sterling Mark
## definition_precision — 4.0 / 5.0
The definition clearly describes a specific certification mechanism for silver quality, distinguishing it from weight certification and drawing appropriate parallels to modern hallmarks. The concept is well-bounded and non-circular.
## source_grounding — 5.0 / 5.0
This entity is directly grounded in Smith's text from Book I, Chapter 4, where he explicitly discusses sterling marks as examples of official quality certification stamps. The definition accurately reflects Smith's usage and context.
## domain_placement — 5.0 / 5.0
The "Regulation" domain is perfectly appropriate, as sterling marks represent a form of official regulatory oversight that establishes standards and provides quality assurance in markets. This fits squarely within regulatory mechanisms that facilitate commerce.
## vsm_relevance — 4.0 / 5.0
Sterling marks map well to S3 (internal regulation/audit) as they represent standardized quality control mechanisms, and potentially to S2 (coordination) as they reduce transaction friction by providing trusted information. The regulatory certification function has clear VSM relevance.
## explanatory_value — 4.0 / 5.0
This entity illuminates an important mechanism for reducing information asymmetries and transaction costs in markets through official quality certification. It demonstrates how regulatory standards can facilitate trust and efficiency in commercial exchange.