Threshold Supervisor: Scattered Mechanisms Survey

Registered: 2026-05-07 Status: research-anchor survey supporting Q-041

Purpose

REE has accumulated several adaptive-threshold and adaptive-baseline mechanisms, each scoped to one substrate. The brain dynamically manages thresholds for commitment, release, surprise, urgency, and pass/fail across multiple timescales – and this is a stability lever, not just a tuning knob. This document inventories what is already adaptive in REE, the timescale each operates on, and what a unified meta-level “threshold supervisor” would have to add that is not already covered.

This is a registration anchor for Q-041. It is not a design doc. The substrate question is what the experiments referenced from Q-041 are meant to resolve.

Existing scattered mechanisms

Locus	Adaptive quantity	Update timescale	Read by	Notes
ARC-016 (E3 dynamic precision)	precision = f(running variance of E3 prediction error)	per-step EMA, ~10-100 steps	E3 score weighting; commit threshold via relative ratio	Internal to E3. Does not adapt cross-substrate thresholds. EXQ-018b PASS validated the variance-driven relative-threshold path; EXQ-396a/b and EXQ-454 reclassified non_contributory under the V_s-monostrategy substrate gap.
MECH-258 / SD-032b (dACC pe_cap, FIFO action history)	precision-weighted PE normaliser, recency action-history window	window = 8 actions (dacc_suppression_memory)	dACC bundle -> E3 score_bias	Partial volatility tracking. NOT a Behrens-2007-style explicit volatility estimator; the pe_cap is a fixed scale parameter, not learned.
SD-032c (AIC-analog interoceptive baseline EMA)	drive/fatigue baseline against which z_harm_a salience is computed	EMA, alpha set per AICConfig	aic_salience, harm_s_gain	Floats with z_harm_a – urgency is computed against a moving baseline rather than a fixed threshold. Drive-dependent by construction.
SD-032d (PCC-analog stability scalar)	scalar in [0,1] from success EMA + drive_level + steps-since-last-offline	success_alpha=0.02 (~50-step half-life); offline_recency_window=500	SalienceCoordinator effective_threshold	Modulates the SD-032a switch threshold. Ties offline cadence to threshold sensitivity.
SD-032e (pACC drive bias EMA)	bounded drive_bias added to SD-012 drive_level	alpha=0.002 (~347-step half-life)	GoalState, SalienceCoordinator, AIC, PCC, dACC bundle	Slowest of the cingulate cluster. The meaning of “drive=0.7” drifts under chronic load. Closest existing analogue to neuromodulator-setpoint chronic shift.
MECH-040 (safety baseline / volatility)	dual control channels (provisional, no genuine evidence)	unspecified	control_plane	Registered but never validated on real substrate.
MECH-204 (REM zero-point capture)	precision_at_rem_entry snapshot for sleep recalibration	per-sleep-bout	(capture-only; no writeback)	Captures the recalibration target but does not act on it. The sleep-side writeback is unimplemented – this is the most obvious gap.

Timescale spread is wide – ~10 steps (ARC-016 EMA) up to ~350 steps (SD-032e pACC drive bias) – but no mechanism operates at the cross-substrate / sleep-bout / multi-day timescale at which neuromodulator setpoints recalibrate biologically.

What is missing

A meta-level threshold supervisor would adapt pass/fail or commit/release thresholds across substrates based on system-wide instability metrics. Specifically:

Cross-substrate volatility tracking. Each adaptive locus above tracks variance internal to its own signal. Nothing aggregates across loci. A genuine volatility estimator (Behrens et al 2007, Nat Neurosci) would read multiple PE streams jointly and emit a system-level learning-rate signal that downstream thresholds consume.
Sleep-mediated writeback. MECH-204 captures a precision zero-point at REM entry but the writeback path that uses it to recalibrate waking thresholds is absent. The brain treats sleep as global recalibration (Tononi & Cirelli 2014, Neuron, SHY); REE captures the snapshot and discards it.
Setpoint drift under chronic load. SD-032e’s pACC drive bias is the closest existing analogue but is scoped to a single substrate (drive_level write-back from z_harm_a). Chronic-stress 5-HT/DA setpoint shifts in biology recalibrate many downstream thresholds simultaneously (commit, release, surprise) – REE has no joint mechanism.
Coherence guarantee. Independent EMAs at different timescales can drift into incoherent regimes (e.g., high pACC drive_bias with low PCC stability scalar). A supervisor would either enforce a coherence constraint or expose the divergence as an instability signal in its own right.

Why this is a question, not a substrate

The natural temptation is to write this as a SD (“SD-NNN: threshold supervisor module that reads all adaptive loci and emits joint recalibration signals”). I have resisted that for two reasons:

Premature commitment. Whether the supervisor is needed at all is an open empirical question. The scattered mechanisms may already produce coherent meta-stability under realistic load – the V_s-monostrategy substrate gap currently masks the dependent behaviour, so we do not yet know.
V3-vs-V4 placement is genuinely uncertain. The simplest threshold supervisor is a slow EMA of EMAs – could land in V3. The full sleep-mediated writeback supervisor requires MECH-204 to act on its snapshot, which depends on the sleep substrate enrichment (MECH-285, MECH-286, INV-049 implementation). That is a V4 commitment.

Q-041 registers the question. If experimental evidence licenses a substrate-level commitment, the cluster (SD + supporting MECH) follows.

Anchor literature (registration-time, pre-lit-pull)

Behrens et al. 2007. Learning the value of information in an uncertain world. Nat Neurosci 10:1214-1221. – dACC tracks volatility and adjusts learning rate accordingly.
Friston & Adams 2013 / Adams, Shipp & Friston 2013. Predictions not commands: active inference in the motor system. – precision-weighted PE as the canonical adaptive-control variable.
Tononi & Cirelli 2014. Sleep and the price of plasticity. Neuron 81:12-34. – SHY: sleep as global synaptic recalibration. Bears directly on the MECH-204 writeback gap.

A targeted lit-pull on “adaptive learning rate dACC volatility” + “neuromodulator setpoint chronic stress recalibration” + “synaptic homeostasis SHY recalibration” is the natural Q-041 successor; staged here as a follow-on rather than gating the registration.

Observable signature for a working supervisor

Under sustained drive_level=0.9 for ~1000 steps, an agent equipped with a meta-level threshold supervisor should show coherent shift across substrates – effective commit threshold, effective beta-gate release threshold, AIC switch threshold, dACC pe_cap normalisation should move together along a single low-dimensional trajectory consistent with a shared volatility/setpoint signal. Without a supervisor, the adaptive loci move independently – the trajectory is high-dimensional and substrate- specific.

This is the signature the Q-041 diagnostic experiment proposal targets.