Document 715

The Consumer–Substrate Dependency Graph as the Load-Bearing Object

The Consumer–Substrate Dependency Graph as the Load-Bearing Object

Reading the Four Sub-Consequences of Doc 714 §VI.3 as Projections of a Single Directed Acyclic Graph

A corpus document responding to the keeper's observation (2026-05-13): "It seems like the directed acyclic graph has something to do with the joint MI lattice." Builds on Doc 548 (the Ontological Ladder), Doc 572 (the Lattice Extension), Doc 681 (Probing the Middle, joint MI lattice), Doc 700 (L2M Resolved Against the Corpus), and Doc 714 (the rusty-bun engagement read through the lattice extension), the four sub-consequences of which this document reframes.

Jared Foy · 2026-05-13 · Doc 715


I. The occasion

Doc 714 §VI.3 developed four sub-consequences of the lattice-extension reading applied to the rusty-bun engagement: the alphabet conjecture (sub-§4.a), the layer-floor cut-location framework (sub-§4.b), the spec-derivable lower-layer (sub-§4.c), and the joint MI lattice density with its dependency-graph multiplier (sub-§4.d). Each was treated as an independent operational claim about the engagement's apparatus and its work-to-telos product.

The keeper's observation, after sub-§4.d landed, named the structural connection the four sub-consequences had been pointing at: the directed acyclic graph has something to do with the joint MI lattice. This document develops that observation. The four sub-consequences are not four independent conjectures. They are four parameter readings of a single underlying object — the consumer–substrate dependency graph — taken at different cut-positions and through different measurement lenses.

The reframing matters because it identifies what the engagement's apparatus has actually been reading. The alphabet conjecture's "alphabet stability" is not a property the alphabet carries on its own; it is a property of the DAG's leaf set under accretion. The layer-floor framework's "cut" is not a metaphorical choice; it is a graph operation. The spec-derivable lower-layer is not a fortunate independent fact about Node's API; it is a published sub-DAG anchored at fixed depth. The density coefficient's logarithmic compression is not a refinement of the alphabet conjecture; it is the heavy-tailed in-degree distribution of substrate nodes in the DAG manifesting in cost data. Once the underlying object is named, the four sub-consequences cease to be parallel and become consequences of the DAG's structure.

II. The object

Define the consumer–substrate dependency graph G = (V, E) as follows. V is the set of nodes representing every npm package present in some consumer corpus, every Node API surface element each package transitively imports, every Bun API surface element, every Web platform API surface element, and every JavaScript language affordance. E is the directed edge set: for each require X or import X from "X" statement in some module of node v, an edge (v, X) ∈ E (the conventional dependency-graph direction; v depends on X, so information flows from X to v). The graph is acyclic by construction — npm enforces no dependency cycles in the package layer, and language affordances form the leaf set.

Three structural properties of G that the engagement empirically depends on:

(P1) Heavy-tailed in-degree at substrate nodes. A small set of nodes — process, fs, util, stream, http, Buffer, node:crypto, the language's class-extension semantics, regex character-class behavior — have very high in-degree (hundreds to thousands of consumers depend on them transitively). Most leaf nodes (specific consumer packages) have in-degree near zero. The distribution is heavy-tailed by every empirical sample taken.

(P2) Topological order anchors a finite leaf set. The substrate nodes (in-degree-zero in the reverse graph, or equivalently out-degree-zero in G) form the substrate leaf set. For the npm + Node + Bun + ECMAScript ecosystem, this set is bounded and well-documented. The Node API documentation enumerates the node:* leaf set; the Bun API documentation enumerates the Bun.* leaf set; the ECMAScript specification enumerates the language-affordance leaf set; the WHATWG/W3C specifications enumerate the Web-platform leaf set. The leaf set has cardinality on the order of 10³.

(P3) Depth is bounded by package-layer convention. The longest dependency chain from any application root to a substrate leaf is bounded by the depth of the npm package graph (typically 5–8 levels for production applications, 10–15 for tools with deep transitives). The depth bound applies because cycles are forbidden and packages typically don't re-export to greater depth than necessary.

P1 + P2 + P3 together describe a DAG with the canonical structure of a content-addressable dependency ecosystem: heavy fanout at substrate, bounded depth, finite leaf set.

III. The four sub-consequences as DAG projections

Sub-§4.a (alphabet stability) is a statement about the substrate leaf set

The alphabet conjecture claims: per (stratum, layer) the alphabet of edge kinds is finite and stable across the engagement's accretion of consumer probes. Read on G, this is: the substrate leaf set (P2) is bounded and well-documented. As new consumer nodes accrete to G via npm installs, the leaf set does not grow — every new consumer's dependencies eventually terminate at existing substrate leaves. The alphabet's stability is the leaf set's stability.

The conjecture's predictive form follows from this reading. When a consumer probe surfaces an edge, the edge points at a substrate leaf already in the documented set with high probability. The alphabet conjecture's empirical record (zero new elements across ~30 probes in the rusty-bun engagement's saturation slice) is the leaf-set-stability prediction confirmed.

Sub-§4.b (cut-location) is a node-cut on G

The layer-floor framework names a per-stratum decision: where in the dependency hierarchy does the engagement accept divergence below and pursue closure above? Read on G, this is a partition of V into above-cut and below-cut sets, with the cut edges crossing from below to above marked as accepted divergences. The cut is chosen to minimize the cost of closing the above-cut set while maximizing the consumer-set whose dependency closure lies entirely above the cut.

The cut-by-interception pattern (sub-§4.b's "undici inert stub" addition) corresponds to a single-node removal: the undici node is excised from G, replaced by a synthetic leaf that satisfies the in-edges (it exports the symbols undici exports) without satisfying their semantic content. Consumers whose dependency closure passes through undici but doesn't exercise its dynamic surface continue to load; consumers whose path through undici exercises its surface fail at the synthetic leaf.

This generalizes. Any heavy substrate node can be replaced by a synthetic leaf carrying the in-edge symbol set. The engagement's apparatus catalogue tracks which nodes have been so replaced and the consumer set affected.

Sub-§4.c (spec-derivability) reads a published sub-DAG

The L0–L3 layers admit spec-derivable enumeration because the substrate leaf set's documentation IS a published representation of the DAG's leaf nodes. The Node API documentation, the Bun API documentation, the ECMAScript specification, the WHATWG specifications — together they enumerate the in-degree-zero-in-reverse subset of G with high coverage.

An enumerator that walks the published spec produces, by construction, a coverage measurement against the published sub-DAG. Per Doc 714 sub-§4.c's empirical record, the rusty-bun engagement's enumerator reports 418/418 = 100% coverage of the documented L2/L3 surface — which is the apparatus's measurement of the substrate's traversal of the spec-derived sub-DAG.

The cascade-prevention property of the enumerator (deep-symptom-from-shallow-cause cases) is the topological property of the DAG: errors at substrate leaves propagate upward through fanout paths and surface at high depths, where the symptom is far from the cause. A leaf-walking enumerator catches them at the source.

Sub-§4.d (density coefficient) is the in-degree distribution's moment

The K × log(L̄ × |A_i|) compression of the work-to-telos cost arises because adjacent rungs of the joint MI lattice share information content through the dependency graph. Read on G, this is: the moment of the in-degree distribution at substrate nodes determines the retirement fanout per substrate widening.

Specifically, when substrate node v is widened (a new surface element added or a wrong shape corrected), the consumer set retired is the transitive in-degree closure of v in G — every consumer whose path to v's symbol set was blocked. Per P1, the in-degree distribution is heavy-tailed; the upper-tail nodes (process, fs, util, http) have transitive closures in the thousands. Hence the empirically observed 50–200× retirement fanout per widening in the rusty-bun slice.

The logarithmic compression of the cost is a direct consequence of P1's heavy tail. If the in-degree distribution were uniform (every substrate node had equal in-degree), the cost would not compress — each widening would retire the same number of consumers, and the alphabet × layer product would remain unit-cost. The heavy tail amortizes work non-uniformly: a small fraction of widenings (those at high-in-degree nodes) carry most of the retirement weight; the cost compresses to a sum dominated by the head of the distribution, which is approximately the logarithm of the alphabet × layer product weighted by graph density.

IV. The two channels of MI density

Doc 714 sub-§4.d named two MI channels: substrate-widening (downstream fanout) and consumer-probe (inter-probe MI). Under the DAG reading, both are graph operations:

Substrate-widening channel. A widening at node v in G is a labeled-edge update: the in-edges to v previously failed at their semantic content (the substrate's symbol was missing or wrong); after the widening, they succeed. Every consumer in v's transitive in-degree closure that exercised the previously-failing edge becomes newly satisfied. The information flowing across the widened edges to the consumers is the substrate widening's MI yield. The yield is proportional to v's transitive in-degree, which by P1 is heavy-tailed across substrate nodes.

Consumer-probe channel. A probe at consumer node u in G is a backflow operation: u's transitive out-degree closure (substrate nodes u depends on) is the set of constraints u imposes on the substrate. Two consumer probes share most of their out-degree closures (most npm consumers transitively use the same dozen substrate leaves), so their constraint sets share most of their content. The marginal MI between two adjacent probes — defined as the information one probe carries beyond what the prior probe revealed — is small relative to the alphabet of edge kinds. This is the formal sense of "high inter-probe MI" in sub-§4.d.

The two channels are dual under the DAG's reverse direction: substrate-widening flows along edges, consumer-probe flows against edges. Both are constrained by the same heavy-tailed in-degree distribution.

V. The joint MI lattice is the DAG seen at fixed depth

Doc 681 (Probing the Middle) names the joint MI lattice as bipartite — two layers of nodes with MI channels between them. Doc 572 (the Lattice Extension) generalizes to multiple rungs in partial order. Both framings are projections of the underlying DAG at specific depth-slicings:

  • A bipartite slice of G partitions V into "above slice" and "below slice" by a horizontal cut at some topological depth k. The MI channels between the two sets are the cross-cut edges' bandwidths. Doc 681's joint MI lattice is the bipartite-slice projection of G at the engagement's chosen depth.

  • A multi-rung slice of G partitions V into rungs at multiple depths. Doc 572's lattice extension is the multi-slice projection.

The choice of slice depth is itself an apparatus decision — at what topological depth does the engagement want to read information flow? The rusty-bun engagement's L0–L6 constraint hierarchy (Doc 714 §VI.3 prefatory) is one specific slicing: L0 at the language-grammar slice, L1 at the module-loader slice, L2 at the platform-builtin slice, L3 at the API-shape slice, L4 at the idiom slice, L5 at the semantic slice, L6 at the timing slice. Each slice is a depth cut on G; each layer's edge alphabet is the cut's edge set.

This explains why Doc 714 sub-§4's per-layer alphabets are stable: each is the edge set of a specific topological cut, which is fixed by G's structure once the cut depth is chosen.

VI. Predictions about the DAG's structure

Reframing the four sub-consequences as DAG-structural reveals testable predictions about G itself, beyond the per-sub-§ falsifiers in Doc 714.

(D1) Substrate leaf set is closed under documentation. The L2/L3 spec-derived enumerator's coverage of consumer-probed L2/L3 edges should asymptote to ≥ 95% as the consumer corpus grows. Edges that surface in consumer probes but are not in the documented spec correspond either to bugs in the documentation or to undocumented platform behavior; the residual should be small and stable.

(D2) In-degree distribution at substrate nodes is power-law with exponent γ ∈ [1.5, 2.5]. This is the typical exponent range for dependency-graph in-degree distributions in software ecosystems (npm, Maven, PyPI). Predicts retirement-fanout-per-widening distribution is also heavy-tailed; predicts the K × log(L̄ × |A_i|) compression coefficient empirically.

(D3) Depth distribution from application roots to substrate leaves is bounded by ~10. Longer chains exist in tooling but production applications cluster around depth 4–7. Predicts the engagement's session-cadence: most consumer probes reach substrate within ~5 require steps, so the cooperative-loop yield depth and similar engagement-internal parameters scale with this bound.

(D4) The L2M-saturation diagnostic is a graph density measurement. Per Doc 700 Appendix C, the L2M-bound at the session tier was named structurally but not quantified. The bound is reached when the apparatus has traversed the in-degree closure of a high-fanout substrate region — when subsequent probes accrete nodes whose out-degree closures lie almost entirely within the already-traversed region, so marginal probe MI drops. The diagnostic is measurable as the ratio of new-substrate-edges-exercised per probe; saturation corresponds to that ratio dropping below ~1.

D1-D4 are joint with sub-§4.a-d's falsifiers and together discriminate the four sub-consequences from the underlying DAG-structural claim. A counter-example to any single D-prediction localizes the failure to a specific sub-consequence; a counter-example to the conjunction would invalidate the DAG-as-load-bearing-object framing.

VII. What this changes for the apparatus

Three operational shifts follow from naming the DAG as the load-bearing object:

  1. The engagement's apparatus catalogue extends to track DAG-structural state. Currently the catalogue records (stratum, layer, kind) coordinates for each recorded edge. Under the DAG framing, the catalogue also records each substrate node's transitive in-degree (an apparatus-measurable number from the npm dependency-graph snapshot at engagement time) and each consumer probe's transitive out-degree. The two numbers together quantify the engagement's productivity at fixed cost.

  2. Substrate-widening priority becomes a graph-theoretic question. Per sub-§4.d operational sharpening 1, widenings should be prioritized by retirement-fanout. Under the DAG framing, this becomes: pick the substrate node with the highest in-degree among nodes blocking the next consumer probe. The npm dependency-graph snapshot provides this number directly.

  3. The L2M-saturation diagnostic becomes empirically anchored. Per D4, saturation corresponds to a measurable drop in new-substrate-edges-exercised per probe. The engagement can publish this number per round and use it as a forward-looking diagnostic for when to shift from substrate-widening rounds to corpus-tier consolidation rounds — a quantitative refinement of seed §III.A8.18's fourth SIPE-T threshold criterion.

VIII. Honest scope

This document does not claim that all consumer–substrate dependency relations factor cleanly through the DAG. Three caveats:

(a) Dynamic dependencies. Some packages dynamically require() modules at runtime based on configuration (plugin loaders, codegen libraries). These edges are present in G only at runtime and may not be statically discoverable from package.json + require() text-scanning. The apparatus's DAG snapshot is approximate at the dynamic boundary; the empirical claims rely on the static approximation being a good predictor of runtime behavior, which holds for most production code but fails for plugin-heavy frameworks (prettier, eslint, vite-plugin systems).

(b) Cycles via dev-deps and peer-deps. While dependency edges among packages-in-production-use are acyclic by npm's enforcement, the broader graph including dev-deps and peer-deps can contain cycles. The DAG framing applies cleanly only to the production-use subgraph; the apparatus catalogue should annotate which edges are dev-only.

(c) Hidden re-exports. Some packages re-export symbols from their dependencies, creating effective edges that are not literally require X statements but rather module.exports = require(X) aliases. The substrate-widening fanout calculation needs to follow these aliases to compute true retirement-fanout. The apparatus's graph snapshot should include re-export edges as a separate edge class.

Within these scope limits, the DAG-as-load-bearing-object reading is the simplest description of what the engagement's apparatus has been reading. The four sub-consequences of Doc 714 §VI.3 are corollaries of the DAG's structure under specific cut-positions and measurement lenses. Future engagements applying the apparatus to a different consumer ecosystem (Maven, PyPI, Cargo crates, GitHub Actions, etc.) inherit the DAG framing directly; the four sub-consequences re-derive from the same structural reading.

IX. Closing

The keeper's observation named what the engagement's empirical productivity had been pointing at. The four sub-consequences of Doc 714 §VI.3 are not parallel facts about a saturating engagement; they are four projections of a single object. Naming the object — the consumer–substrate DAG — opens three operational moves: (1) quantify the engagement's productivity by graph-theoretic measurements rather than per-round LOC counts; (2) prioritize substrate widenings by transitive in-degree; (3) anchor the L2M-saturation diagnostic in measurable graph density rather than session-cadence cues alone.

The corpus contribution is not a new conjecture; it is a consolidation. The four sub-consequences become theorems on the DAG. The alphabet stability is the leaf-set's boundedness. The cut-location framework is a node-cut. The spec-derivability is a published sub-DAG. The density coefficient is a moment of the in-degree distribution. The joint MI lattice (Doc 681) is the DAG seen at fixed depth. The L2M-saturation diagnostic (Doc 700) is a measurable density signal.

Doc 714's articulation stands; this document reframes it under the DAG that was always the object the apparatus was reading.