Epistemic PE / Active Inference routing:区分可学习误差与噪声,再驱动 β_t / curiosity
Evaluation modality
Spec-levelA spec-motivation / governance borrow. Evaluated by spec review + contract tests, not A/B or ablation.
- Primary owner
- —
- Phase-A verdict
- —
- Shadow profile
- —
- Source papers
- Curiosity-Critic 2026 + ODAR 2026 + WorldLLM 2025 + PCN≈BP
- Specs
- docs/specs/prediction-error-loop.mddocs/specs/temporal-abstraction.mddocs/specs/lifeform-vitals.md
Blind spot (现状盲点)
DM-1 处理 PE distribution,OA-9 处理 PE faithfulness,但还缺少一个关键区分:哪些 PE 是可学习的 epistemic error,哪些只是 aleatoric noise。如果不分清,系统会把不可学噪声当作 curiosity / credit / needs 的来源,导致 noisy-TV 式探索或关系焦虑误报。
Adoptable suggestions (可落地动作)
- 1.在 prediction-error spec 中把 Curiosity-Critic 的 "cumulative PE improvement" 作为 PE readout 候选:只有可降低的 PE 才进入 curiosity / credit 主路径。PROPOSED
Not a runnable A/B candidate — evaluated by the path above, not ablation.
- 2.在 temporal-abstraction spec 中加入 ODAR / active inference 路由:β_t 切换同时考虑 epistemic gain 与 pragmatic cost。PROPOSED
Not a runnable A/B candidate — evaluated by the path above, not ablation.
- 3.第一阶段只做 SHADOW 对照:epistemic-filtered PE vs raw PE 在 held-out dialogue / social cognition probes 上的校准差异。PROPOSED
Not a runnable A/B candidate — evaluated by the path above, not ablation.
Traceability
No plugins / runs linked yet. Scaffold a suggestion to start.
Expected benefit (预期收益)
- 避免不可学噪声驱动 controller / memory / vitals。 - 给 curiosity 与 intrinsic motivation 一个 R-PE 兼容定义。 - 让 regime / β_t 切换更接近 free-energy / active inference 的统一形式。
Cited paper (引用论文)
**Curiosity-Critic**(2026)、**ODAR**(2026)、**WorldLLM**(2025)、**Predictive Coding ≈ Backprop**。详见 [`research/arxiv-survey-2026-05.md`](../../research/arxiv-survey-2026-05.md) §2 与 [`research/probe/11_vz_implications.md`](../../research/probe/11_vz_implications.md) R-PE。 ---