Files
markitect-main/examples/infospace-with-history/output/evaluations/drawback.md
tegwick a9ca0adfcf feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)
Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 09:36:46 +01:00

3.1 KiB

entity_slug, evaluator, evaluated_at, overall_score, scores
entity_slug evaluator evaluated_at overall_score scores
drawback null 2026-02-23T05:07:48.137058 4.6
name value max_value rationale
definition_precision 5.0 5.0 The definition is highly precise and non-circular, clearly distinguishing drawbacks as refunds of previously paid duties upon re-export. It captures a distinct fiscal mechanism rather than a vague concept.
name value max_value rationale
source_grounding 5.0 5.0 This entity is directly grounded in Smith's text from Book IV, Chapter 5, where he explicitly discusses drawbacks and their distinction from bounties. The definition accurately reflects Smith's treatment of this specific trade policy instrument.
name value max_value rationale
domain_placement 5.0 5.0 The "Regulation" domain assignment is perfectly appropriate, as drawbacks are a specific regulatory mechanism governing trade duties and customs. This fits squarely within Smith's analysis of commercial policy and government intervention.
name value max_value rationale
vsm_relevance 4.0 5.0 This entity maps well to S3 (internal regulation) as a control mechanism managing duty flows, and potentially S2 (coordination) for managing trade oscillations. It represents a concrete regulatory instrument rather than an abstract concept.
name value max_value rationale
explanatory_value 4.0 5.0 The entity illuminates an important mechanism in Smith's analysis of trade policy, showing how governments can encourage re-export without providing net subsidies. It reveals the structural logic behind a specific form of commercial regulation that differs meaningfully from bounties.

Evaluation: Drawback

definition_precision — 5.0 / 5.0

The definition is highly precise and non-circular, clearly distinguishing drawbacks as refunds of previously paid duties upon re-export. It captures a distinct fiscal mechanism rather than a vague concept.

source_grounding — 5.0 / 5.0

This entity is directly grounded in Smith's text from Book IV, Chapter 5, where he explicitly discusses drawbacks and their distinction from bounties. The definition accurately reflects Smith's treatment of this specific trade policy instrument.

domain_placement — 5.0 / 5.0

The "Regulation" domain assignment is perfectly appropriate, as drawbacks are a specific regulatory mechanism governing trade duties and customs. This fits squarely within Smith's analysis of commercial policy and government intervention.

vsm_relevance — 4.0 / 5.0

This entity maps well to S3 (internal regulation) as a control mechanism managing duty flows, and potentially S2 (coordination) for managing trade oscillations. It represents a concrete regulatory instrument rather than an abstract concept.

explanatory_value — 4.0 / 5.0

The entity illuminates an important mechanism in Smith's analysis of trade policy, showing how governments can encourage re-export without providing net subsidies. It reveals the structural logic behind a specific form of commercial regulation that differs meaningfully from bounties.