feat(example): add per-entity LLM evaluations for 985 WoN entities (S3.3)

Batch evaluation of all 988 entities via OpenRouter. 984 succeeded on
first pass; 3 failed (network errors). eval-summary --update-metrics
written with per_entity_mean=3.9556.

Viability dashboard: 6/6 PASS
  redundancy_ratio   0.0061  (max 0.10)
  coverage_ratio     0.6190  (min 0.40)
  coherence_comps    0.0000  (max 3)
  consistency_cycles 0.0000  (max 0)
  granularity_entropy 2.6748 (min 1.0)
  per_entity_mean    3.9556  (min 3.5)

Dimension breakdown (mean across 985 entities):
  definition_precision  3.62
  source_grounding      4.36
  domain_placement      4.56
  vsm_relevance         3.31
  explanatory_value     3.94

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-02-23 09:36:46 +01:00
parent 81a4c8796a
commit a9ca0adfcf
986 changed files with 63216 additions and 1 deletions

View File

@@ -0,0 +1,65 @@
---
entity_slug: excise_duty_drawback
evaluator: null
evaluated_at: '2026-02-23T05:24:54.872330'
overall_score: 4.6
scores:
- name: definition_precision
value: 4.0
max_value: 5.0
rationale: The definition is precise and clearly distinguishes excise duty drawback
from general drawbacks by specifying it applies to domestically produced goods
and excise duties specifically. It captures a distinct mechanism rather than a
vague concept, though it could be slightly more concise.
- name: source_grounding
value: 5.0
max_value: 5.0
rationale: This entity is well-grounded in Smith's actual discussion of drawbacks
in Book IV, Chapter 4, where he explicitly analyzes how drawbacks on excise duties
function and their role in maintaining natural economic balance. The concept directly
reflects Smith's own analysis rather than imposing external interpretations.
- name: domain_placement
value: 5.0
max_value: 5.0
rationale: The "Regulation" domain assignment is entirely appropriate, as excise
duty drawbacks are a specific regulatory mechanism that government implements
to counteract the distortive effects of its own taxation policies. This fits perfectly
within Smith's broader discussion of commercial regulations.
- name: vsm_relevance
value: 4.0
max_value: 5.0
rationale: This entity maps well to S2 (coordination/anti-oscillation) as it represents
a regulatory mechanism designed to prevent artificial distortions in capital allocation
and maintain economic balance. It also has some S3 (internal regulation) characteristics
as a corrective policy tool.
- name: explanatory_value
value: 5.0
max_value: 5.0
rationale: This entity provides excellent explanatory value by illuminating the
specific mechanism through which governments can neutralize the competitive disadvantages
their own taxation creates for domestic producers in export markets. It demonstrates
Smith's nuanced understanding of how regulatory interventions can sometimes preserve
rather than distort natural economic processes.
---
# Evaluation: Excise Duty Drawback
## definition_precision — 4.0 / 5.0
The definition is precise and clearly distinguishes excise duty drawback from general drawbacks by specifying it applies to domestically produced goods and excise duties specifically. It captures a distinct mechanism rather than a vague concept, though it could be slightly more concise.
## source_grounding — 5.0 / 5.0
This entity is well-grounded in Smith's actual discussion of drawbacks in Book IV, Chapter 4, where he explicitly analyzes how drawbacks on excise duties function and their role in maintaining natural economic balance. The concept directly reflects Smith's own analysis rather than imposing external interpretations.
## domain_placement — 5.0 / 5.0
The "Regulation" domain assignment is entirely appropriate, as excise duty drawbacks are a specific regulatory mechanism that government implements to counteract the distortive effects of its own taxation policies. This fits perfectly within Smith's broader discussion of commercial regulations.
## vsm_relevance — 4.0 / 5.0
This entity maps well to S2 (coordination/anti-oscillation) as it represents a regulatory mechanism designed to prevent artificial distortions in capital allocation and maintain economic balance. It also has some S3 (internal regulation) characteristics as a corrective policy tool.
## explanatory_value — 5.0 / 5.0
This entity provides excellent explanatory value by illuminating the specific mechanism through which governments can neutralize the competitive disadvantages their own taxation creates for domestic producers in export markets. It demonstrates Smith's nuanced understanding of how regulatory interventions can sometimes preserve rather than distort natural economic processes.