SD-033: PFC Subdivision Architecture
Claim ID: SD-033 (parent) + SD-033a–e (subdivisions) Subject: pfc.subdivision_architecture Status: candidate, v3_pending (pre-implementation; SD-033c/d are graph-consolidation only) Registered: 2026-04-19 GAP-9 graph-consolidation complete: 2026-06-09 — SD-033c subsumption closed bidirectionally; SD-033d design doc written + linked (sd_033d_premotor_sma_analog.md); SD-033e V3/V4 boundary made explicit (substrate stays V4). See commitment_closure_plan.md node commitment_closure:GAP-9. Depends on: SD-032a, MECH-094, MECH-261, ARC-035, MECH-116, MECH-151, MECH-152, MECH-235 Paired with: SD-032 (cingulate integration substrate) — together form the V3 cognitive-control backbone
Problem
MECH-261 (mode-conditioned write gating) commits to writing into “a PFC-analog policy/rule-level substrate,” but ree-v3 has no registered substrate that does what primate / human lateral PFC actually does: hold stimulus-abstracted rule and goal representations across delays, persist them through distractor events, and emerge through training. The same audit turns up two other structural gaps. OFC is under-registered: existing claims lump OFC function under vmPFC (ARC-035 / MECH-151/152/235), losing the Rudebeck & Murray 2014 and Stalnaker et al 2015 dissociation between specific-outcome prediction and scalar-value integration. And nothing in ree-v3 reserves a substrate address for premotor/SMA sequence-execution (already implicit in E3) or frontopolar branching (V4 scope).
The risk is the same lumping failure mode that produced SD-003 → SD-010 → SD-011 on the nociceptive side: implement a monolithic “PFC-analog” module now and pay the cost of re-splitting later. The fix is to register the PFC as a five-subdivision cluster up front, using primate biology as the template, and scope implementation work to the subset that is actually load-bearing for V3.
The lit-pull (targeted_review_pfc_subdivision_architecture, 7 entries, mean confidence 0.80) supports this across three prongs: (A) lateral-PFC rule representation is a biologically distinct substrate (Miller & Cohen 2001; Badre & Nee 2018; Mansouri 2020); (B) OFC and vmPFC are dissociable by selective-lesion and cognitive-map evidence (Rudebeck & Murray 2014; Stalnaker et al 2015); (C) the lateral-PFC → pre-SMA → SMA → dorsal premotor gradient (Tanji & Hoshi 2008) locates sequence execution downstream of rule-holding, and frontopolar cortex (Koechlin & Summerfield 2007) carries the branching / nested-control function at the rostral end.
Subdivisions
| ID | Subdivision | Biological analogue | ree-v3 function | V3 implementation status |
|---|---|---|---|---|
| SD-033a | Lateral-PFC-analog (mid-lateral rule/goal) | Mid-lateral PFC (Badre & Nee 2018 level) | Holds stimulus-abstracted rules/goals; projects top-down bias into E3 | IMPLEMENTED 2026-04-20 (see sd_033a_lateral_pfc_analog.md) |
| SD-033b | OFC-analog (specific-outcome / task-structure) | OFC (Rudebeck & Murray 2014; Stalnaker 2015) | Specific-outcome prediction + cognitive map of task structure; E2’s substrate address | Build (second priority) |
| SD-033c | vmPFC-analog (subjective value integration) | vmPFC (existing ARC-035 hub) | Scalar value integration across goals / needs / options | Register-only (graph consolidation) |
| SD-033d | Premotor/SMA-analog (sequence execution) | Pre-SMA + SMA + dorsal premotor (Tanji & Hoshi 2008) | Sequence proposal + execution (already inside E3) | Register-only (graph consolidation) |
| SD-033e | Frontopolar-analog (branching / nested control) | Frontopolar cortex / BA 10 (Koechlin & Summerfield 2007) | Counterfactual branching, suspended-operation return | Defer (V4 scope) — hooks only in V3 |
SD-033a — Lateral-PFC-analog (mid-lateral rule/goal substrate)
The V3 implementation priority. Until SD-033a exists, MECH-261’s “write gating on a PFC-analog” is vacuous — there is no real target for the gate to protect.
Must satisfy four functional signatures:
- Stimulus-abstracted format. Representation encodes the rule that applies, not the specific stimulus the rule operates on (Mansouri 2020). Signature: transfer to novel stimuli under the same rule produces similar representational activity.
- Distractor-resistant persistence. Rule-selective activity persists through simulated distractor stimuli, internal_replay events, and MECH-092 micro-quiescence cycles without being overwritten. This is what MECH-261’s write-suppression-during-replay gate is protecting.
- Top-down bias into E3. Projects a bias signal into E3’s trajectory-selection machinery (multiplicative gain on trajectory scores or additive shift on rule-matched candidates — design choice during implementation). Miller & Cohen 2001 framing: rule-as-top-down-bias, not rule-as-direct-action-control.
- Training-dependent emergence. Representations develop over many task exposures, driven by slow consolidation writes from hippocampal-cortical dialogue during offline_consolidation mode (Peyrache et al 2009; Frankland & Bontempi 2005).
MECH-261 gating profile. Writes licensed during external_task and internal_planning; suppressed during internal_replay (protect held rule); consolidative during offline_consolidation.
SD-033b — OFC-analog (specific-outcome prediction / task-structure map)
Co-present representational content (MECH-263):
- Specific-outcome prediction (Rudebeck & Murray 2014 “oracle” framing). Given state s and action a, produce a prediction of the specific outcome — its identity, modality, expected magnitude — not just a scalar value. Devaluation sensitivity is the signature: if the value of outcome o changes while the state-action → outcome mapping does not, behaviour changes appropriately.
- Cognitive map of task structure (Stalnaker/Cooch/Schoenbaum 2015). Structured representation of what states the agent can be in, what transitions are possible, and what structural role each state plays — including discriminating between states that differ only in their latent structural role (same sensory input, same reward, different position in the task).
The two framings are not alternatives; they are complementary. The cognitive map carries structured outcome predictions, and specific-outcome queries are resolved against it. ree-v3’s E2 is already a specific-outcome predictor computationally; SD-033b gives it a biological substrate address and makes its representational content (state-space structure) explicit.
MECH-261 gating profile. Speculative writes during internal_planning (counterfactual outcome queries); suppressed during internal_replay (do not overwrite the learned task model from imagined content); consolidative during offline_consolidation (slow updating from accumulated real-outcome experience — the systems-consolidation target); active read during external_task.
SD-033c — vmPFC-analog (subjective value integration)
Registration-only. Consolidation complete (GAP-9, 2026-06-09). The computational function already exists in ree-v3 under ARC-035 / MECH-151 / MECH-152 / MECH-235. SD-033c registers it as a named subdivision of SD-033 so the PFC substrate graph is complete. The consolidation is now a closed bidirectional edge: SD-033c carries a subsumes: list of the four source claims and each source carries the reciprocal instantiates: SD-033c (claims.yaml), so the subsumption is queryable from either direction. consolidation_status: complete on the claim record. No new implementation, no status change.
Reads specific-outcome predictions from SD-033b, affective / harm signals from SD-032, and homeostatic drive from SD-012; produces the unified subjective-value signal downstream selection machinery consults. Scalar value rides on top of SD-033b’s structured representation — SD-033c does not itself represent specific outcomes or task structure (per Rudebeck & Murray 2014 and Stalnaker et al 2015 dissociations).
MECH-261 gating profile. Writes during external_task (value updates from realised outcomes); reduced-gain writes during internal_replay (replayed trajectories must not directly overwrite value estimates — this is what forces replay to propagate via SD-033a / SD-033b); consolidative during offline_consolidation.
SD-033d — Premotor/SMA-analog (sequence execution)
Registration-only. Design doc written (GAP-9, 2026-06-09): sd_033d_premotor_sma_analog.md (the design_doc field on the claim was null until this pass). Function already implemented inside E3’s trajectory-proposal and selection machinery + the MECH-090 commitment loop. Registered as a named subdivision so the claim graph mirrors the biology rather than implying a premotor-less architecture.
Corresponds to the Tanji & Hoshi 2008 pre-SMA (sequence planning, set-switching) + SMA (sequence execution) + dorsal premotor (stimulus-action binding) gradient, downstream of SD-033a. The design doc gives the full machinery mapping: PMd→propose_trajectories / action_object; pre-SMA→E3TrajectorySelector.select; SMA→MECH-090 committed-step stepping (_committed_step_idx + bistable BetaGate); sequence-end→ARC-028/MECH-105 completion release. Finer-grained premotor subdivision is not registered here — if future evidence shows E3’s trajectory machinery inadequately covers the pre-SMA / SMA / premotor distinctions, SD-033d can be split into SD-033d-i / d-ii / d-iii (the split trigger is recorded in the design doc).
MECH-261 gating profile. Sequence-level writes are hypothesis-tagged via MECH-094 (the specific case): licensed during external_task and internal_planning; suppressed during internal_replay unless the hypothesis tag is explicitly set; consolidated during offline_consolidation.
SD-033e — Frontopolar-analog (branching / nested control)
V4 scope. Do not implement in V3. Registered pre-emptively to (i) signal the architectural intent so V3 design choices stay forward-compatible, and (ii) prevent the SD-010/SD-011-style late-split failure mode from recurring at the V3/V4 boundary.
V3/V4 boundary (GAP-9, 2026-06-09 — made explicit). GAP-9 formally defers the substrate and keeps implementation_phase: v4 (it is not pulled to v3): SD-033e’s only reverse-dependents are MECH-264/MECH-265, both v4, and nothing v3 depends on it, so there is no genuine v3 dependency to reclassify. The split is clean:
- V3-scope (present + complete): the forward-compatibility hook only — the reserved
parallel_goal_deliberationoperating-mode name, MECH-261’s keyed-dict gate logic, and the no-opfrontopolar_analog.pystub reserving the module /sd_033ewrite-gate / MECH-264-MECH-265 head surface (bit-identical-OFF). This is all V3 owes; GAP-9 closes it. - V4-scope (deferred): the actual counterfactual-value substrate (MECH-264 head + MECH-265 monitor), its prerequisites (MECH-163 multi-step planning; a dual-active-goal env), the frontopolar lit-pull, and the SD-033e design doc — all V4-entry work, not GAP-9 deliverables.
The substrate REE’s V4 long-horizon ethical reasoning will eventually depend on — explicit counterfactual weighing of parallel policies is what Frankfurt-style reflective endorsement of first-order motivations requires, and Koechlin & Summerfield 2007 identify frontopolar cortex as the anatomical substrate for this function.
V3 forward-compatibility requirement (GAP-9: present + complete). The MECH-261 operating_mode vocabulary reserves the future mode name parallel_goal_deliberation (renamed 2026-04-19 from the earlier deliberative_branching placeholder after the Prong D lit-pull), and MECH-261’s gate logic is implemented as a dictionary keyed on mode names (not a fixed 4-tuple) so V4 can be added without disruptive schema changes. A no-op stub module (ree-v3/ree_core/pfc/frontopolar_analog.py) reserves the module surface, the sd_033e write-gate target, and the MECH-264/MECH-265 head interface; it is bit-identical-OFF under the default use_frontopolar_analog=False. This is the entirety of the V3-scope work for SD-033e — the substrate itself is V4 (see “V3/V4 boundary” below).
V4 prerequisite. A follow-up lit-pull on frontopolar function (Burgess; Boorman; Christoff; Mansouri et al 2015 frontopolar review) is required before implementation. The Koechlin & Summerfield 2007 entry is the entry point, not the complete survey.
Information flow
SD-032a (salience-network coordinator)
|
| operating_mode vector
v
+------------------------------------------+
| MECH-261 mode-conditioned gates |
+---+--------+---------+----------+--------+
| | | |
external internal internal offline
task planning replay consolidation
| | | |
v v v v
+----------------+ writes (soft-boundary, per-mode weight)
| | |
| SD-033a <-----+--------+<----- consolidation writes
| (lateral PFC, | from HC/cortical dialogue
| rule/goal) | (Peyrache 2009; Frankland
| | & Bontempi 2005)
| top-down bias |
| v |
| E3 ------> SD-033d (premotor/SMA; inside E3)
| ^ |
| | v
| | actions
| |
| | queries (state, action)
| v
| SD-033b <----- writes: speculative (planning), consolidative (offline)
| (OFC, specific-outcome suppressed during internal_replay
| + task-structure map)
| |
| | structured outcome predictions
| v
| SD-033c <----- value writes: external_task (realised),
| (vmPFC, reduced-gain internal_replay
| scalar value)
| |
| | unified value signal
| v
| E3 trajectory scoring (combined with SD-033a bias)
+----+
|
| (SD-033e frontopolar: deferred; hook = reserved mode name only)
.
.
SD-032 (cingulate) <---- cross-cluster: SD-032b adaptive-control
writes to striatal-analog, bypassing SD-033c
for the affective-pain -> action-value channel.
Dopamine-analog credit assignment shapes
the ACC->striatum weight.
The interface contract with SD-032. SD-032a emits operating_mode; MECH-261 translates that vector into per-substrate gate weights; SD-033 subdivisions consume the gates. SD-033a is MECH-261’s primary write target under external_task and internal_planning. SD-033b is the secondary target under internal_planning and offline_consolidation. SD-033c is gated with reduced-gain during internal_replay to force replay to propagate through SD-033a / SD-033b rather than directly overwriting value. SD-033d inherits MECH-094 hypothesis-tag gating as a special case.
The within-cluster gradient. SD-033a → SD-033d captures the Tanji & Hoshi lateral-PFC → pre-SMA → SMA → dorsal premotor executive-control gradient: rule at the rostral end, sequence execution at the caudal end. SD-033b and SD-033c sit off-axis as the specific-outcome and scalar-value substrates feeding E3’s trajectory evaluation.
Minimum-viable V3 implementation path
Ordered. Do not skip ahead.
- SD-033a (lateral-PFC-analog). Minimum-viable substrate. Build as a persistent rule/goal-holding module with (i) stimulus-abstracted representation, (ii) distractor-resistant persistence, (iii) top-down bias projection into E3, (iv) MECH-261-gated write interface. Design-level choices to settle during implementation: format of rule representation (population code vs distributed embedding vs explicit symbolic), persistence mechanism (recurrent activity vs synaptic hold), bias-signal projection (multiplicative gain vs additive shift into E3 trajectory scores). Test MECH-262’s three signatures (generalisation, distractor-resistance, training-dependent emergence) before declaring complete.
- SD-033b (OFC-analog). Second priority. Give E2 a proper substrate address. Current E2 state carries both sensory prediction and harm prediction in an unstructured latent; SD-033b should carry the structured outcome representation explicitly and serve as the substrate E2’s harm-prediction queries hit. Verify MECH-263 signatures (devaluation sensitivity; discrimination of same-value / different-outcome-identity states).
- SD-033c (vmPFC-analog). Register-only. No new implementation — existing ARC-035 / MECH-151 / MECH-152 / MECH-235 code is the substrate. Update cross-references so those claims point to SD-033c as their substrate address.
- SD-033d (premotor/SMA-analog). Register-only. No new implementation — existing E3 trajectory machinery is the substrate. Update cross-references. Finer-grained split is a follow-up if evidence demands.
- SD-033e (frontopolar-analog). Defer to V4. In V3, the only change is the MECH-261 schema forward-compatibility: operating_mode as a keyed dict (not a fixed tuple), and reserve the
parallel_goal_deliberationmode name. GAP-9 (2026-06-09): this V3-scope hook is present and complete; the substrate stays V4.
Steps 1–2 are the V3 implementation targets. Steps 3–4 are graph consolidation. Step 5 is a schema-only commitment with V4 follow-up lit-pull.
Falsification signatures
Substrate-level. Each signature distinguishes a specific subdivision failure from downstream wiring issues.
SD-033a (lateral-PFC-analog) is over-specified if: ree-v3’s existing E3 sequence-selection machinery reproduces — without an explicit rule/goal substrate — all three MECH-262 signatures: rule-abstracted generalisation (transfer to novel stimuli under the same rule), rule-selective persistence across distractor events, and training-dependent rule emergence. Collapses SD-033a back into E3 and makes MECH-261’s “write target” vacuous in a different way (no substrate because none is needed).
SD-033a failure at source if: the implementation shows stimulus-bound persistence (fires only for trained stimuli, fails to generalise) or distractor-fragile persistence (representation collapses when internal_replay events intervene). MECH-262 is violated — SD-033a does not meet its functional spec.
SD-033b (OFC-analog) is over-specified if: E2’s harm-prediction behaviour and rule-conditioned outcome-generalisation can be reproduced without distinguishing a state-structure substrate from a scalar-value substrate. The OFC/vmPFC functional dissociation does not carry at REE’s level of description; SD-033b collapses back into SD-033c.
MECH-263 failure at source if: SD-033b activity encodes scalar value without discriminating between same-value / different-outcome-identity states; or fails the devaluation-sensitivity test. SD-033b is functionally indistinguishable from SD-033c.
SD-033c / SD-033d registration failures are not directly falsifiable at the substrate level — these subdivisions are registration-only consolidations of existing function. The load-bearing falsifications for their claimed function live on the source claims (ARC-035 / MECH-151 / MECH-152 / MECH-235 for SD-033c; E3 trajectory experiments for SD-033d).
MECH-261 cross-mode gating over-specification (primary SD-032 × SD-033 interaction test): if a ree-v3 implementation using purely local per-substrate gating (no shared operating_mode vector from SD-032a) reproduces the post-quiescence action-selection bias toward recently replayed trajectories, the coordinator overlay is wrong — forward propagation would be coordinated locally rather than by the salience network. See EXP-0148.
Rule protection failure (SD-033a × MECH-261 interaction): SD-033a rule representations should remain stable across internal_replay events even when replay content is rule-incompatible, because MECH-261 suppresses writes into SD-033a during replay. Failure signature — replay events overwrite the held rule, rule-selective persistence collapses — falsifies either SD-033a’s persistence mechanism or MECH-261’s replay-suppression gate. Use a replay-content-controlled ablation to distinguish the two.
Cross-cluster interaction with SD-032
SD-032 produces operating_mode. SD-033 holds the substrates operating_mode gates writes into. See sd_032_cingulate_integration_substrate.md for the cingulate-side design and for the full MECH-261 write-gating profile table.
Two concrete interaction signatures expected to be visible in V3:
-
Forward propagation bias (MECH-261 primary behavioural signature). Content active in internal_replay produces a measurable bias on subsequent external_task action selection, mediated by writes into SD-033a during replay (Tambini & Davachi 2019). This is both a design requirement and MECH-261’s primary falsification target.
-
Cingulate → PFC bypass asymmetry. SD-032b (dACC-analog) writes to a striatal-analog action-value target bypassing SD-033c (vmPFC value integration) — this is the Baliki 2010 ACC-NAc pathway. Predicts: affective-pain-driven action-value shifts are not reflected in SD-033c activity in the same way that reward-driven shifts are. The cingulate cluster has its own write-path into action selection that does not route through PFC value integration.
Related claims
- SD-033, SD-033a–e — this cluster
- MECH-262 — rule-selective persistence in SD-033a (three signatures)
- MECH-263 — OFC-analog state-space / specific-outcome representation
- SD-032, SD-032a–e — paired cluster (coordinator + adaptive control)
- MECH-261 — mode-conditioned write gating (primary consumer of SD-033)
- MECH-094 — hypothesis tag (special case, specifically on SD-033d sequence commits)
- ARC-035 — vmPFC value-regulation hub (subsumed by SD-033c; bidirectional edge closed GAP-9:
SD-033c.subsumes↔ARC-035.instantiates: SD-033c) - MECH-151, MECH-152, MECH-235 — vmPFC value integration (folded into SD-033c; each carries
instantiates: SD-033c, listed inSD-033c.subsumes) - SD-033d design doc —
sd_033d_premotor_sma_analog.md(E3/MECH-090 → premotor/SMA mapping, GAP-9) - MECH-116 — dlPFC LSTM working memory (adjacent to SD-033a)
- ARC-038 — viability map (MECH-261 write target alongside SD-033)
- ARC-039 — entorhinal grid loop (offline consolidation target alongside SD-033)
- INV-049 — sleep necessity (coordinates offline_consolidation mode for SD-033 writes)
References
Primary lit-pull: evidence/literature/targeted_review_pfc_subdivision_architecture/synthesis.md (7 entries across three prongs, mean confidence 0.80, 2026-04-19).
Systems-consolidation grounding for MECH-261 offline writes into SD-033a/b: evidence/literature/targeted_review_systems_consolidation_waking_propagation/synthesis.md (5 entries, aggregate literature_confidence 0.883, 2026-04-19).
Key entries:
Prong A (lateral-PFC rule representation):
- Miller & Cohen 2001 (Annu Rev Neurosci): PFC as active maintenance of task-relevant rules, biasing posterior processing.
- Badre & Nee 2018 (Trends Cogn Sci): caudal-rostral abstraction gradient within lateral PFC.
- Mansouri, Freedman & Buckley 2020 (Nat Rev Neurosci): stimulus-abstracted rule-selective neurons in primate lateral PFC; persistence across delays; training-dependent emergence.
Prong B (OFC / vmPFC dissociation):
- Rudebeck & Murray 2014 (Neuron): selective fiber-sparing lesions dissociate OFC (specific-outcome prediction / credit assignment) from vmPFC (subjective value integration).
- Stalnaker, Cooch & Schoenbaum 2015 (Nat Neurosci): OFC represents cognitive maps of task structure, not scalar value.
Prong C (premotor/SMA + frontopolar V3/V4 scoping):
- Tanji & Hoshi 2008 (Physiol Rev): graded executive control across lateral PFC → pre-SMA → SMA → dorsal premotor.
- Koechlin & Summerfield 2007 (Trends Cogn Sci): frontopolar cortex as substrate for branching and nested control (entry point; full V4 survey deferred).
Systems-consolidation supporting MECH-261 writes into SD-033a/b:
- Peyrache et al 2009 (Nat Neurosci): SWR-coupled rule-learning replay in mPFC.
- Frankland & Bontempi 2005 (Nat Rev Neurosci): systems consolidation from hippocampus to neocortex.
- Carr, Jadhav & Frank 2011; Tambini & Davachi 2019; Rothschild/Eban/Frank 2017 (soft-boundary mode mixing; forward-propagation bias; cortical-hippocampal-cortical loop).