Document 740

Multi-Tier Cascade-Revival

Multi-Tier Cascade-Revival

When the Hot Path Traverses Multiple Tiers, Closing One Tier Alone Is Insufficient — the Cascade-Revival Pattern Recurs at the Cross-Tier Scope

A primary articulation responding to a recognition surfaced during a 2026-05-23 substrate session in the cruftless engagement (rusty-bun), several hours after Doc 739. Doc 739 articulated cascade-revival at the single-tier scope: closing an upstream structural constraint cascade-revives a stalled downstream sibling-pilot. The session's continuation surfaced a generalization: when the actual hot path traverses multiple tiers (each carrying its own dispatch-shape and per-call cost), closing one tier alone produces partial reclaim; the cascade-revival pattern itself recurs at the cross-tier scope. The full pipeline connects only when all relevant tiers along the hot path are closed in dependency order. Builds on Doc 729 — Cruftless, Doc 730 — Vertical Recurrence of the Lowering Compiler, Doc 734 — The Meta Resolution Pipeline, Doc 735 §X.h — The (P2) Four-Sub-Case Taxonomy, Doc 581 — Pin-Art Apparatus, and Doc 739 — Constraint-Closure as Cascade-Revival.

Jared Foy · 2026-05-23 · Doc 740


I. The occasion

A 2026-05-23 substrate session continued past Doc 739's single-tier cascade-revival recognition into a workstream addressing a CRB-measured 20× cruft/node gap on the json_parse_transform fixture. The substrate pilot's first cut (JSF, a JSON.stringify fast-path under Pin-Art discipline) targeted what a prior CRB component-decomposition estimate had identified as the dominant contributor.

The substrate pilot landed four substrate moves through six rounds: a buffer-threading upstream constraint-closure (Move 1, JSF-EXT 3); two cascade-revival sibling-pilots predicted by Doc 739 (string-escape ASCII fast-path Move 2, integer-fast-path Move 3); a clone-elision composition round (Move 4). Each move was correctness-preserving across the engagement's two correctness probes (canonical fuzz + diff-prod). The first three moves matched Doc 739's cascade-revival pattern empirically: M1 was flat (substrate-introduction signature); M2/M3 each produced small positive deltas per the predicted cascade.

When all four moves had landed, the cumulative CRB measurement was -1% (within noise) versus the baseline. The CRB target of -40% reclaim was not met. The substrate pilot's reclaim model — derived from the upstream component-decomposition estimate — had projected -50% to -75%. The model was empirically wrong.

The keeper's framing question, given the dip-and-recover pattern projected for substrate-introduction work: "the pipeline is being built; you have to go through this middle stretch where performance decreases first." The session ran a component A/B probe (10-second probe with five additive variants on the target fixture) and identified the actual dominator: a top-level character-scanning loop contributing 77% of cruft's wall-clock, a loop that was not part of the JSON pipeline at all. The original component-decomposition estimate had been off by approximately twenty-fold.

The disambiguating probe enabled two follow-on substrate rounds (a substrate-tier algorithmic fix at the character-scanning intrinsic; an interp-tier inline-cache fast-path at the method-dispatch tier). The CRB measurement after the substrate-tier fix alone was -3%; after both substrate-tier and dispatch-tier fixes, -12% cumulative, with the cruft/node ratio dropping from 20.34× to 17.93×.

The recognition came at the final measurement: each of the two follow-on rounds was correctness-preserving and addressed a single tier (substrate-intrinsic algorithm; interp-dispatch fast-path). The single-tier reclaim was substantial but partial (-15% and -27% respectively at the dominator loop). Neither single-tier closure alone connected the pipeline. The pipeline connected only when both tiers were closed.

The structural shape: Doc 739's single-tier cascade-revival pattern is sufficient for any two-tier producer-consumer pair (upstream constraint-closure → downstream sibling-pilot revival). When the actual hot path traverses three or more tiers (each with its own dispatch-shape and per-call cost), Doc 739's pattern recurs at each adjacent tier-pair but is NOT sufficient at the cross-tier scope unless all tiers along the hot path are addressed.

This document specifies that recognition. The abstract formulation is in §II; the cruftless instance is in §III; the methodological corollaries — including the empirical-disambiguation-before-substrate-spawn discipline that the session also surfaced — are in §IV.

II. The abstract formulation

A resolver-instance pipeline (Doc 729 §IV) carries N tiers. Each tier T_k carries its own dispatch interface (how values cross into T_k), its own intrinsic-cost model (the per-call cost when T_k handles a value), and its own constraint surface (what T_{k-1} guarantees about T_k's input; what T_k guarantees about T_{k+1}'s input). The cross-tier interface is structural; per-call cost composes multiplicatively along the call path.

Doc 739 articulated the cascade-revival pattern for the single-tier scope: when a sibling-pilot stalled at (P2.d) downstream of a structural constraint at tier T_k closes constraint T_k by an upstream substrate move, downstream pilot revives without local substrate work because the constraint-propagated precondition collapses. The pattern is sufficient for two-tier scope: closing T_k revives stalled (P2.d) pilots downstream-of-T_k that were targeting T_k's propagated constraint.

II.1 The multi-tier shape

Consider a hot path that traverses tiers T_1 → T_2 → T_3 → … → T_n. Each tier carries some per-call cost C_k. The total per-call cost is the sum (or product, for nested dispatch) of per-tier costs. Different tiers will have different cost magnitudes; one tier may dominate, but more commonly multiple tiers contribute non-trivially.

A substrate pilot targeting tier T_k closes that tier's per-call cost. If T_k is the unique dominator, single-tier closure produces the projected reclaim. If multiple tiers contribute non-trivially, single-tier closure produces partial reclaim bounded by C_k / Σ C_j. The remaining cost lives at the other tiers; the pipeline does not connect to its final reclaim until all non-trivial contributors are closed.

II.2 The multi-tier cascade-revival pattern

Define the multi-tier cascade-revival pattern in four propositions:

(P1) A hot-path-component analysis enumerates the tiers T_1 … T_n along the call path; each tier's contribution C_k can be measured empirically (per §IV.1 below) without full source-tier closure. The set { T_k : C_k is non-trivial fraction of Σ C_j } is the relevant-tier set R.

(P2) Single-tier closure at T_k ∈ R produces reclaim bounded by C_k / Σ C_j. If |R| = 1, this is the projected reclaim. If |R| > 1, single-tier closure produces partial reclaim; the remaining (Σ C_j) - C_k is unchanged.

(P3) Doc 739's single-tier cascade-revival pattern applies at each adjacent tier-pair within R. Closing T_k enables T_{k+1}'s consumer-pilots to revive from (P2.d) when their constraint-propagation source was T_k. The recursion is local to the tier-pair, not the cross-tier scope.

(P4) Full pipeline-connection requires closure at ALL tiers T_k ∈ R, in dependency order (upstream first). Cumulative reclaim materializes at the final-tier-closure round, not at any single-tier-closure round. The pipeline-connection moment is the empirical readout that the relevant-tier enumeration was complete.

II.3 What the pattern is NOT

(B1) The multi-tier pattern does not predict that ALL tiers along the call path contribute non-trivially. The relevant-tier set R is empirical, not architectural. Some tiers (e.g., a well-tuned arithmetic op) may contribute negligibly even when on the hot path. Per (P1), R is identified by measurement, not by source-read enumeration alone.

(B2) The multi-tier pattern does not predict that closing |R| tiers produces summative reclaim equal to Σ C_k. Composition effects (constructive when tiers are orthogonal; destructive when downstream cost depends on upstream output shape) modulate the cumulative reclaim. Pre-implementation reclaim projections at the cross-tier scope are upper-bound estimates; empirical measurement at each tier-closure round refines the projection.

(B3) The single-tier scope of Doc 739 is a special case of the multi-tier pattern when |R| = 2 and the two-tier pair has a clean producer-consumer interface. Doc 739's pattern is not superseded; it is the building block of the multi-tier pattern.

(B4) The multi-tier pattern is observable only post-measurement. A substrate pilot whose pre-implementation model projects single-tier dominance is honest investment given the model; the multi-tier shape becomes visible only when the cumulative reclaim falls short of the model's projection and a component A/B probe reveals which tiers were missed.

II.4 The component A/B probe as relevant-tier-set apparatus

The relevant-tier set R is identified by a component A/B probe: replace each suspect tier's contribution with a no-op or near-no-op variant; measure per-variant wall-clock; per-tier contribution C_k = (V_with_T_k - V_without_T_k). The probe runs in time bounded by the number of variants × the fixture's per-iteration cost; for typical benchmarking fixtures, the probe is <10 minutes.

The discipline:

  1. Enumerate suspects: source-read the hot-path; name candidate tiers (substrate intrinsics, dispatch paths, call-frame setup, bytecode dispatch, JIT eligibility, GC overhead, allocation pressure). Typical N = 5-8.
  2. Author additive variants: V_0 = baseline minus all suspects; V_k = V_{k-1} + suspect_k; V_n = full fixture.
  3. Measure: run each variant on cruft + oracle (node/bun); per-variant Δ = (V_k - V_{k-1}) per runtime.
  4. Compute R: per-tier cruft/oracle ratio + absolute contribution ranks the actual dominators; R = { T_k : C_k is non-trivial fraction }.
  5. Spawn pilots in dependency order: upstream tiers first; close each before measuring the next; gate cumulative reclaim measurement after all R tiers are closed.

II.5 The cascade as a Doc 729 §A8.13 + Doc 739 specialization

Doc 729 §A8.13 articulates substrate-amortization-cascade at the per-iter cost axis. Doc 739 articulates cascade-revival at the categorization axis (single-tier scope). The multi-tier cascade-revival pattern of §II.2 specializes both:

  • At the per-iter axis: each tier-closure cascades per-iter cost reduction at downstream tiers; the multi-tier reading is that cumulative per-iter reduction sums (or composes multiplicatively) across closures.
  • At the categorization axis: each tier-closure cascade-revives sibling-pilots stalled at that tier-pair; the multi-tier reading is that pilot-revival cascades within tier-pairs but not across the cross-tier scope.
  • At the cross-tier scope (new in this document): pipeline-connection is a categorical transition (cumulative reclaim crosses the projection threshold or fails to). The multi-tier scope is the categorization-axis dual at the cross-tier level.

What this document names additionally is the cross-tier categorization (the pipeline connects vs does not connect) and the empirical-disambiguation discipline (component A/B probe per §II.4) as the apparatus that makes the cross-tier scope tractable. This is corpus-original beyond Docs 729, 730, 734, and 739.

III. The cruftless instance

The cruftless engagement's JSF (JSON.stringify fast-path) pilot was spawned on a CRB component-decomposition estimate that placed JSON.stringify at ~5-10× contribution to the json_parse_transform fixture's 20× cruft/node gap (so ~50-70% of total cost). The JSF pilot landed four substrate moves through six rounds:

  • JSF-EXT 3 / Move 1: output buffer threading at the JSON.stringify recursion (upstream substrate-introduction at the leaf-emitter constraint).
  • JSF-EXT 4 / Move 2: string-escape branchless ASCII fast-path (cascade-revival pilot per Doc 739 §II.3 — leaf emitter writes directly into the buffer that M1 introduced).
  • JSF-EXT 5 / Move 3: number-stringify integer fast-path (second cascade-revival pilot — same shape, number leaf).
  • JSF-EXT 6 / Move 4: format-macro elimination + property-iteration via reference (clone elision; composition round).

Each move was correctness-preserving (canonical fuzz acc=-932188103 byte-identical to node throughout; diff-prod 42/42 throughout). M1 produced flat per-shape micro-bench (correctly classified as substrate-introduction signature per Finding II.2-bis); M2/M3 each produced small positive deltas (~5-7% per move, matching the cascade-revival pattern empirically).

III.1 The pre-CRB measurement

After all four moves, the per-shape micro-bench position was: A small-object 10.58× → 9.71× (-8%); B deep-nested 14.11× → 14.33× (flat); C array-of-obj 12.48× → 12.55× (flat); D number-only 15.16× → 15.05× (flat); E string-only 10.09× → 10.31× (flat). Cumulative reclaim on the micro-bench: -3% to -8% per shape; not the projected -50% to -75%.

The CRB measurement: cruft 2455 ms vs JSF-EXT 0 baseline 2481 ms — Δ = -1%, within noise. The pilot's target (-40% reclaim) was not met by ~39 percentage points. The discrepancy between projection and measurement was load-bearing: either the pilot's substrate moves had failed (unlikely given the per-move correctness + small empirical wins), or the pre-implementation reclaim model was wrong.

III.2 The component A/B probe

The session ran a component A/B probe on json_parse_transform (5 additive variants × 50-iter warmup × 500-iter measurement × cruft + node). Probe runtime: <10 seconds aggregate. Per-component cost (cruft):

component cruft Δ (ms) % of total node Δ (ms) cruft/node
JSON.parse 246 9% 75 3.3×
Array.filter 124 5% 0 unbounded
Array.map 165 6% 3 55×
JSON.stringify 86 3% 7 12×
character-scanning loop 2040 77% -1 n/a (oracle JITs to ~0)
TOTAL 2661 100% 84 31.7×

The actual dominator was a for (i; i<out.length; i++) cs += out.charCodeAt(i) loop in the fixture's bookkeeping. The character-scanning loop was not part of the "JSON pipeline" that CRB-EXT 9's component decomposition had estimated. JSON.stringify, the JSF pilot's target, contributed 3% of total — about twenty times smaller than the original estimate.

The probe's runtime (<10 seconds) was approximately three orders of magnitude smaller than the JSF pilot's six-round substrate work. Had it been run before pilot spawn, the entire JSF pilot would have targeted character-scanning + interp dispatch at substantially higher leverage per LOC.

III.3 The multi-tier closure

The probe identified the dominator. Source-read of String.prototype.charCodeAt's implementation revealed an algorithmic bug: chars().nth(i) is O(i) because it iterates UTF-8 codepoints from string start; for a 5KB ASCII string scanned linearly, the per-outer-iter cost was O(n²). The substrate-tier fix (ASCII fast-path: bytes[i] instead of chars().nth(i); len() instead of chars().count() for length) was ~20 LOC. CharCode-EXT 1.

CharCode-EXT 1 landed; canonical fuzz + diff-prod GREEN. The A/B probe re-ran: character-scanning loop dropped from 2040 ms to 1739 ms (-15%). CRB dropped from 2455 ms to 2372 ms (-3%). The reclaim was much smaller than the O(n²)→O(n) algorithmic analysis projected (~40×).

The empirical readout: per-call cost dropped from 0.816 μs to 0.696 μs (-15% per-call), not -99% as the algorithmic projection assumed. The implication: most of the per-charCodeAt-call cost was interpreter dispatch (call_function frame setup + this-binding + descriptor walk + Value boxing), NOT the chars().nth() iteration. The O(n²) bug was real but the per-call dominator lived at a different tier (dispatch, not algorithm).

The dispatch-tier closure: a hot-intrinsic IC fast-path in the bytecode interpreter's Op::CallMethod dispatcher. For the exact shape s.charCodeAt(i) with s a primitive String + method ObjectId == cached intrinsic + arg shape compatible, bypass call_function entirely and emit the result inline. Verification via cached intrinsic ObjectId; bail to slow-path on override or arity mismatch. ~65 LOC. CharCode-EXT 2.

CharCode-EXT 2 landed; canonical fuzz + diff-prod GREEN. The A/B probe re-ran: character-scanning loop dropped from 1739 ms to 1480 ms (-15% more; -27% from JSF-EXT 0 baseline). CRB dropped from 2372 ms to 2188 ms (-8% more; -12% cumulative from JSF-EXT 0 baseline). The cruft/node ratio dropped from 20.34× to 17.93×.

III.4 The pipeline connects at multi-tier scope

The multi-tier cascade-revival pattern is empirically observable across the JSF + CharCode chain:

  • R for json_parse_transform's character-scanning loop: {substrate-tier intrinsic algorithm, interp-tier dispatch path}. |R| = 2 within the loop; the broader fixture has additional tiers (JSON.parse, Array.map) that R excluded.
  • Single-tier closure at substrate (CharCode-EXT 1): -15% on dominator-loop; -3% on CRB. Partial. The interp-dispatch tier still carried ~85% of per-call cost.
  • Single-tier closure at dispatch (CharCode-EXT 2, taken alone): would have produced partial reclaim too, ~15% on dominator-loop (the dispatch IC bypasses ~100 ns/call regardless of substrate algorithm).
  • Both tiers closed (CharCode-EXT 1 + 2 cumulative): -27% on dominator-loop; -12% on CRB; cruft/node 20.34× → 17.93×. The pipeline connects at the cumulative scope.

The dispatch-tier IC is structurally a Doc 739 cascade-revival pilot relative to the substrate-tier fix: the substrate fix is the upstream constraint-closure ("leaf intrinsics no longer carry O(n²) algorithmic cost"); the dispatch IC becomes a cascade-revival candidate ("now that the leaf is cheap, the per-call dispatch overhead becomes the new dominator and an IC can reach the cost floor"). Doc 739's pattern recurs at the tier-pair within R. What's new is that the cumulative reclaim requires BOTH closures to materialize; the multi-tier scope is the categorization-axis observable at the cross-tier level.

III.5 The JSF-pilot reread

Post-probe, the JSF pilot's six rounds re-categorize:

  • M1-M4 were substrate-tier closures at a tier that was NOT in R for json_parse_transform. The substrate work was correctness-improvement value (the new JSON.stringify is structurally cleaner: buffer-threaded, fast-path leaf emitters, no per-property clones) but did not move the CRB needle.
  • The JSF reclaim model assumed JSON.stringify ∈ R per CRB-EXT 9's estimate. The probe demonstrated JSON.stringify ∉ R for this fixture (3% of total cost).
  • The JSF chain's load-bearing engagement contribution is not the JSON.stringify substrate; it is (i) the empirical disambiguation that surfaced the mis-attribution; (ii) the multi-tier cascade-revival pattern recognition; (iii) the standing component A/B probe instrument that future CRB-driven pilots adapt per Finding VII.1 + standing rule 11.

IV. Methodological corollaries

IV.1 Empirical disambiguation as substrate-spawn precondition

The JSF chain's central methodological lesson: theoretical component-decomposition estimates are insufficient anchor for substrate-pilot spawn. CRB-EXT 9's estimate ("JSON.stringify ~5-10× contributor") was sourced from theoretical reasoning about per-op cost contributors; the actual empirical decomposition surfaced a non-pipeline contributor (the fixture's own bookkeeping loop) at ~20× larger magnitude than the suspected component.

The discipline (cruftless engagement's standing rule 11): before spawning any substrate pilot whose telos is "close a CRB-measured gap," run a component A/B probe on the target fixture per §II.4. The probe's cost is bounded (<10 minutes typical); the cost of not running it is the cost of a full substrate pilot targeting a non-dominator (JSF: six substrate rounds + ~285 LOC of correctness-preserving but reclaim-neutral substrate work).

The rule's value compounds: each future CRB-driven pilot spawns at the actual bottleneck. The rule generalizes beyond cruftless: any performance-engineering project where pilots target measured gaps should anchor those pilots on empirical decomposition, not theoretical attribution.

IV.2 Substrate-introduction (P2.d) as cascade-revival signature

A substrate-introduction round (the upstream constraint-closure in Doc 739's pattern) often produces (P2.d) at its own bench: the closure enables downstream cascade-revival pilots to deliver reclaim, but the closure round itself shifts allocation/dispatch patterns without eliminating them. A naive falsification would categorize the (P2.d) as round failure; the correct categorization is "substrate-introduction signature."

The discipline: at each substrate-introduction round, name the upstream constraint being closed AND the downstream consumer-pilots that become cascade-revival candidates per the closure. If both are nameable, accept (P2.d) at the introduction round and proceed to the consumer rounds. If neither is nameable, the (P2.d) is a genuine pilot-failure signal.

This is Doc 739's pattern read from the substrate-introduction side. Doc 739 names the cascade-revival side (downstream pilot moves from (P2.d) to (P2.a) when upstream closes). The substrate-introduction round's own categorization (per Finding II.2-bis, registered to the cruftless findings doc as Addendum IV) is the dual: the introduction round's (P2.d) is the SIGNATURE that the round is correctly placed as introduction rather than failed as pilot.

IV.3 The hot-intrinsic IC pattern as engagement-tier instrument

CharCode-EXT 2's interp-tier IC fast-path for String.prototype.charCodeAt validates a structural pattern: every hot intrinsic method call carries the same dispatcher overhead (frame setup + this-binding + descriptor walk + Value boxing). An IC fast-path that verifies the resolved method against a cached intrinsic ObjectId and bypasses call_function for the exact-shape case captures the dispatcher savings without correctness risk (user overrides bail to slow-path by ObjectId mismatch).

The pattern generalizes engagement-wide: charAt, codePointAt, indexOf, slice, push, pop, shift, splice, and other dispatch-bound intrinsic calls all admit the same IC shape. The per-intrinsic LOC is small (~30-65 per intrinsic for the receiver-shape + cached-id verification + inline fast-body); the engagement-tier deliverable is a hot-intrinsic IC table covering the most-frequently-called intrinsics empirically (per a follow-on component A/B probe that ranks intrinsic call frequency across realistic workloads).

The pattern is corpus-relevant beyond cruftless: any interpreter-tier engine that dispatches intrinsic method calls through a general call machinery admits the same IC shape, with the same correctness-preservation discipline (verify against cached intrinsic id; bail to slow-path on mismatch).

V. Composition with prior corpus

  • Doc 729 §IV resolver-instance pipeline + §A8.13 substrate-amortization-cascade: the multi-tier cascade-revival pattern is a specialization of the resolver-instance pipeline analysis to the per-call cost domain. §A8.13's per-iter axis is the cost-axis dual; this document adds the categorization-axis dual at the cross-tier scope.
  • Doc 730 vertical recurrence of the lowering compiler closure: each tier in the multi-tier hot-path is a lowering closure consuming the upstream tier's output. The recurrence is structural; per-tier cost composes along the recurrence.
  • Doc 734 §V (b) negative-finding-catalyzes-refinement: the JSF chain's (P2.d) CRB outcome catalyzed the component A/B probe + multi-tier recognition + standing rule 11. Doc 734's growth-pattern (b) is the engagement's instrument for converting pilot-(P2.d) into framework-tier instrumentation.
  • Doc 735 §X.h.b (P2) four-sub-case taxonomy: the JSF chain demonstrates that (P2.d) at the introduction round + (P2.a) at the consumer round is a legitimate pilot trajectory; the chain's cumulative categorization is at the chain scope, not the per-round scope.
  • Doc 581 Pin-Art apparatus: the component A/B probe is constraint-enumeration discipline applied to the bench measurement instrument (enumerate per-component contributions before substrate work, not as a side-effect of substrate work).
  • Doc 739 single-tier cascade-revival: this document's multi-tier generalization. Doc 739's pattern is the building block; this document specifies the cross-tier scope where multiple Doc 739-tier-pairs compose.
  • Doc 737 locale-as-coordinate: the JSF + CharCode chain spans multiple locales (rusty-js-json-fast for M1-M4 + CC-1 + CC-2; rusty-js-runtime for the intrinsic and dispatcher implementations). The multi-tier cascade-revival pattern naturally cuts across locale boundaries when the hot path traverses tiers maintained by different pilots.

VI. Forward implications

VI.1 Standing rule for CRB-driven pilot spawn

The cruftless engagement adopts standing rule 11 (registered in the findings doc Addendum IV): before spawning any pilot whose telos is "close a CRB-measured gap," run a component A/B probe to identify the actual dominator empirically. The rule's cost is bounded; its value compounds across pilot spawns.

For engagements without an analogous component-A/B-probe convention, the discipline transfers: the suspect-list source-read + N-additive-variant fixture + per-runtime measurement protocol is generic. The probe runs in time bounded by N × per-iteration cost; for typical microsecond-per-iteration fixtures with N=5-8 variants, the probe is <10 minutes.

VI.2 Multi-tier reading at pilot planning

When a pilot's reclaim projection assumes single-tier dominance (per the pre-probe component decomposition or per architectural intuition), the projection should be tagged with the assumption. Post-pilot measurement that falls short of projection becomes a multi-tier-cascade hypothesis: source-read the per-call cost stack; enumerate non-dominant tiers; consider follow-on per-tier closures.

The multi-tier reading at pilot planning prevents the JSF pattern (single-tier substrate work with cumulative measurement falling short and the pilot misclassified as failed). With the multi-tier reading, the pilot's projection becomes "single-tier reclaim of fraction X assuming tier T_k dominates; if cumulative falls short, the gap signals additional tiers in R."

VI.3 The cascade-revival pattern's generalization scope

Doc 739 specified cascade-revival at the single-tier scope of a resolver-instance pipeline. This document specifies the multi-tier scope. The next generalization candidate: cascade-revival across non-tier-structured composition (e.g., when the hot path is not a tier stack but a graph with multiple shared dependencies). The pattern likely generalizes; specification is reserved for future articulation.

The cascade-revival pattern's value across these generalizations is its inversion of the "spawn more sub-pilots when stalled" heuristic. The cascade-revival reading is: when stalled, ask first what upstream constraint propagates the stall, and what cross-tier scope must close to materialize the reclaim. The diagnostic shifts pilot work from accumulation (more sub-pilots) to closure (one upstream substrate move that cascades across the resolver-instance pipeline).

VII. Summary

When the hot path of a performance-engineering target traverses multiple tiers (each with its own dispatch shape and per-call cost), closing a single tier produces partial reclaim bounded by that tier's contribution to total cost. The cascade-revival pattern of Doc 739 recurs at each adjacent tier-pair within the hot path but is NOT sufficient at the cross-tier scope unless all relevant tiers are closed in dependency order. Full pipeline-connection materializes at the final-tier-closure round.

The relevant-tier set R is identified by a component A/B probe per §II.4: N additive variants × per-runtime measurement × <10 minute runtime. The probe is empirical; theoretical attribution is necessary but not sufficient apparatus. The discipline (standing rule for CRB-driven pilot spawn): run the probe before spawning any substrate pilot whose telos is "close a measured gap."

The cruftless instance: the JSF chain landed correctness-preserving substrate work at a tier outside R for its target fixture; -1% CRB after six rounds. The component A/B probe surfaced the actual dominator (character-scanning loop, 77% of cost). Two follow-on rounds closed substrate-tier algorithm + interp-tier dispatch; -12% CRB cumulative; cruft/node 20.34× → 17.93×. The pipeline connects at the cumulative scope.

The keeper's pre-implementation framing — "you have to go through this middle stretch where performance decreases first" — was correct: the middle stretch is the substrate-introduction round whose (P2.d) is the cascade-revival signature, not pilot failure. What the multi-tier reading adds: the middle stretch may extend across multiple rounds at different tiers, and the pipeline-connection moment is the cumulative-measurement round after all relevant tiers have been closed.