Threshold Supervisor: Scattered Mechanisms Survey
Registered: 2026-05-07 Status: research-anchor survey supporting Q-041
Purpose
REE has accumulated several adaptive-threshold and adaptive-baseline mechanisms, each scoped to one substrate. The brain dynamically manages thresholds for commitment, release, surprise, urgency, and pass/fail across multiple timescales – and this is a stability lever, not just a tuning knob. This document inventories what is already adaptive in REE, the timescale each operates on, and what a unified meta-level “threshold supervisor” would have to add that is not already covered.
This is a registration anchor for Q-041. It is not a design doc. The substrate question is what the experiments referenced from Q-041 are meant to resolve.
Existing scattered mechanisms
| Locus | Adaptive quantity | Update timescale | Read by | Notes |
|---|---|---|---|---|
| ARC-016 (E3 dynamic precision) | precision = f(running variance of E3 prediction error) | per-step EMA, ~10-100 steps | E3 score weighting; commit threshold via relative ratio | Internal to E3. Does not adapt cross-substrate thresholds. EXQ-018b PASS validated the variance-driven relative-threshold path; EXQ-396a/b and EXQ-454 reclassified non_contributory under the V_s-monostrategy substrate gap. |
| MECH-258 / SD-032b (dACC pe_cap, FIFO action history) | precision-weighted PE normaliser, recency action-history window | window = 8 actions (dacc_suppression_memory) | dACC bundle -> E3 score_bias | Partial volatility tracking. NOT a Behrens-2007-style explicit volatility estimator; the pe_cap is a fixed scale parameter, not learned. |
| SD-032c (AIC-analog interoceptive baseline EMA) | drive/fatigue baseline against which z_harm_a salience is computed | EMA, alpha set per AICConfig | aic_salience, harm_s_gain | Floats with z_harm_a – urgency is computed against a moving baseline rather than a fixed threshold. Drive-dependent by construction. |
| SD-032d (PCC-analog stability scalar) | scalar in [0,1] from success EMA + drive_level + steps-since-last-offline | success_alpha=0.02 (~50-step half-life); offline_recency_window=500 | SalienceCoordinator effective_threshold | Modulates the SD-032a switch threshold. Ties offline cadence to threshold sensitivity. |
| SD-032e (pACC drive bias EMA) | bounded drive_bias added to SD-012 drive_level | alpha=0.002 (~347-step half-life) | GoalState, SalienceCoordinator, AIC, PCC, dACC bundle | Slowest of the cingulate cluster. The meaning of “drive=0.7” drifts under chronic load. Closest existing analogue to neuromodulator-setpoint chronic shift. |
| MECH-040 (safety baseline / volatility) | dual control channels (provisional, no genuine evidence) | unspecified | control_plane | Registered but never validated on real substrate. |
| MECH-204 (REM zero-point capture) | precision_at_rem_entry snapshot for sleep recalibration | per-sleep-bout | (capture-only; no writeback) | Captures the recalibration target but does not act on it. The sleep-side writeback is unimplemented – this is the most obvious gap. |
Timescale spread is wide – ~10 steps (ARC-016 EMA) up to ~350 steps (SD-032e pACC drive bias) – but no mechanism operates at the cross-substrate / sleep-bout / multi-day timescale at which neuromodulator setpoints recalibrate biologically.
What is missing
A meta-level threshold supervisor would adapt pass/fail or commit/release thresholds across substrates based on system-wide instability metrics. Specifically:
-
Cross-substrate volatility tracking. Each adaptive locus above tracks variance internal to its own signal. Nothing aggregates across loci. A genuine volatility estimator (Behrens et al 2007, Nat Neurosci) would read multiple PE streams jointly and emit a system-level learning-rate signal that downstream thresholds consume.
-
Sleep-mediated writeback. MECH-204 captures a precision zero-point at REM entry but the writeback path that uses it to recalibrate waking thresholds is absent. The brain treats sleep as global recalibration (Tononi & Cirelli 2014, Neuron, SHY); REE captures the snapshot and discards it.
-
Setpoint drift under chronic load. SD-032e’s pACC drive bias is the closest existing analogue but is scoped to a single substrate (drive_level write-back from z_harm_a). Chronic-stress 5-HT/DA setpoint shifts in biology recalibrate many downstream thresholds simultaneously (commit, release, surprise) – REE has no joint mechanism.
-
Coherence guarantee. Independent EMAs at different timescales can drift into incoherent regimes (e.g., high pACC drive_bias with low PCC stability scalar). A supervisor would either enforce a coherence constraint or expose the divergence as an instability signal in its own right.
Why this is a question, not a substrate
The natural temptation is to write this as a SD (“SD-NNN: threshold supervisor module that reads all adaptive loci and emits joint recalibration signals”). I have resisted that for two reasons:
- Premature commitment. Whether the supervisor is needed at all is an open empirical question. The scattered mechanisms may already produce coherent meta-stability under realistic load – the V_s-monostrategy substrate gap currently masks the dependent behaviour, so we do not yet know.
- V3-vs-V4 placement is genuinely uncertain. The simplest threshold supervisor is a slow EMA of EMAs – could land in V3. The full sleep-mediated writeback supervisor requires MECH-204 to act on its snapshot, which depends on the sleep substrate enrichment (MECH-285, MECH-286, INV-049 implementation). That is a V4 commitment.
Q-041 registers the question. If experimental evidence licenses a substrate-level commitment, the cluster (SD + supporting MECH) follows.
Anchor literature (registration-time, pre-lit-pull)
- Behrens et al. 2007. Learning the value of information in an uncertain world. Nat Neurosci 10:1214-1221. – dACC tracks volatility and adjusts learning rate accordingly.
- Friston & Adams 2013 / Adams, Shipp & Friston 2013. Predictions not commands: active inference in the motor system. – precision-weighted PE as the canonical adaptive-control variable.
- Tononi & Cirelli 2014. Sleep and the price of plasticity. Neuron 81:12-34. – SHY: sleep as global synaptic recalibration. Bears directly on the MECH-204 writeback gap.
A targeted lit-pull on “adaptive learning rate dACC volatility” + “neuromodulator setpoint chronic stress recalibration” + “synaptic homeostasis SHY recalibration” is the natural Q-041 successor; staged here as a follow-on rather than gating the registration.
Observable signature for a working supervisor
Under sustained drive_level=0.9 for ~1000 steps, an agent equipped with a meta-level threshold supervisor should show coherent shift across substrates – effective commit threshold, effective beta-gate release threshold, AIC switch threshold, dACC pe_cap normalisation should move together along a single low-dimensional trajectory consistent with a shared volatility/setpoint signal. Without a supervisor, the adaptive loci move independently – the trajectory is high-dimensional and substrate- specific.
This is the signature the Q-041 diagnostic experiment proposal targets.