--- id: IDENTITY-WP-0003 type: workplan title: "Research corpus backfill and model refinement" domain: infotech repo: identity-canon status: finished owner: codex topic_slug: canon created: "2026-06-19" updated: "2026-06-21" state_hub_workstream_id: "af85e0a3-ccb8-4cfd-8859-d0794769e3e2" --- # Research corpus backfill and model refinement This workplan follows up on `ResearchProposal.md` Immediate Next Steps 4 and 10. The seeded source notes under `research/` still contain `TODO` placeholders; the terminology, canon, model, and scenario artifacts from `IDENTITY-WP-0002` are candidate hypotheses until the corpus is backfilled. Priority order follows `OpenQuestions.md` corpus guidance: start with record and subject semantics (provisioning and federation), then authorization projections, then social graph, verifiable claims, and entity-resolution sources. ## Backfill Provisioning And Federation Source Notes ```task id: IDENTITY-WP-0003-T01 status: done priority: high state_hub_task_id: "fb88b28d-da4d-4f78-90d7-30e1ef49d6b6" ``` Backfill the nine source notes in `research/identity-provisioning/` and `research/authentication-federation/`. Replace template placeholders with source-backed terminology extracts, modeling assumptions, conflicts, candidate canonical mappings, and open questions. Use the structure defined in `research/CorpusIndex.md`. ## Backfill Authorization And Social Graph Source Notes ```task id: IDENTITY-WP-0003-T02 status: done priority: high state_hub_task_id: "bc684b78-6af4-4988-927c-67520d81bdb1" ``` Backfill the eight source notes in `research/authorization-relationships/` and `research/social-community-graphs/`. Capture how each source models principals, subjects, actors, groups, organizations, tenants, and relationships without collapsing them into a single overloaded term. ## Backfill Verifiable Claims And Entity-Resolution Source Notes ```task id: IDENTITY-WP-0003-T03 status: done priority: medium state_hub_task_id: "d73ac6a7-2744-40b2-984c-4c86ef7493cf" ``` Backfill the six source notes in `research/verifiable-claims/` and `research/entity-resolution-privacy/`. Document synonymity, identifier, pseudonymity, and privacy semantics that affect canonical account-linking and relationship modeling. ## Refresh Terminology Artifacts From Corpus ```task id: IDENTITY-WP-0003-T04 status: done priority: high state_hub_task_id: "dc1ca5f0-b511-499c-aa47-76f88d2a20a6" ``` Update `terminology/TerminologyInventory.md` and `terminology/TerminologyConflictMap.md` using extracted terminology from the backfilled source notes. Keep mappings marked as candidate until conflicts are reviewed against `canon/CanonicalGlossary.md`. ## Refine Conceptual Model Against Scenario Tests ```task id: IDENTITY-WP-0003-T05 status: done priority: high state_hub_task_id: "989b1e68-dceb-4877-87f3-c7c93ddf076d" ``` Revise `model/ConceptualModel.md` so each scenario in `scenarios/ScenarioTests.md` has an explicit representation path. Record any scenario that the current model cannot satisfy without glossary or principle changes. ## Reconcile Canon Surfaces And Open Questions ```task id: IDENTITY-WP-0003-T06 status: done priority: medium state_hub_task_id: "5e05f41d-07d1-4566-b2ec-deda56c3727a" ``` Update `canon/DesignPrinciples.md`, `canon/CanonicalGlossary.md`, `OpenQuestions.md`, and `DownstreamRecommendations.md` to reflect corpus findings. Resolve questions where the source evidence is sufficient; otherwise leave them open with citations to the relevant source notes.