Files
shard-wiki/research/260614-jupyter-deep-dive/README.md
tegwick 25a714efa7 research: Jupyter Notebooks deep dive; UC-84 (SHARD-WP-0004 T3)
.ipynb JSON cells + embedded computed outputs with fragile execution
provenance; derived output stored inside the source. Non-Markdown/lossy;
kernel = capability, default = present snapshot + static render.
Enriches UC-54/55/59/35; links UC-32/83/79. Marks T3 done.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-14 23:08:13 +02:00

36 lines
1.7 KiB
Markdown

# 260614 — Jupyter Notebooks deep dive
Date: 2026-06-14 · Source: **SHARD-WP-0004 T3**
## What this is
A deep dive into **Jupyter Notebooks**: the **`.ipynb` JSON** document (ordered cells:
markdown / code+outputs / raw), **kernels**, **embedded computed outputs** (MIME bundles),
and **execution-count provenance**. The dominant modern computational document and the
concrete case where the **derived output is captured and stored *inside* the source** with
real-but-fragile provenance.
## Why it matters
- Tests the T1 **replication- vs derivation-projection** split on the dominant real artifact
and adds the wrinkle that **outputs are stored back inside the source** — the source/
projection line runs *through* the document.
- The page model (T12) must carry a **notebook shape**: ordered cells with code cells owning
embedded computed outputs that have **weak execution provenance** (run order, environment
not captured).
- Non-Markdown + lossy translation (T15): JSON+MIME bundles; nbconvert→Markdown is lossy and
directional. JSON diffs are noisy → Jupytext text-pairing / nbdime (T13).
## Yield
- **UC-84** (new): attach/project a computational notebook preserving cell structure +
embedded outputs, surfacing outputs as **snapshots with weak execution provenance**;
re-execution **capability-gated**, default = present snapshot + static render.
- Enrich **UC-54, UC-55, UC-59, UC-35**; links **UC-32, UC-83, UC-79**.
## Contents
| Path | Role |
|------|------|
| `findings.md` | `.ipynb` model, kernels/execution-count fragility, ecosystem (nbconvert/Jupytext/papermill/nbdime/nbstripout), capability profile, INTENT mapping, UC seed, architecture notes, open questions |