PRISM: Festina Lente Proactivity—Risk-Sensitive, Uncertainty-Aware Deliberation for Proactive Agents

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.7 Download Report PDF

Agent

Proactive agents must decide not only what to say but also whether and when to intervene. Many current systems rely on brittle heuristics or indiscriminate long reasoning, which offers little control over the benefit-burden tradeoff. We formulate the problem as cost-sensitive selective intervention and present PRISM, a novel framework that couples a decision-theoretic gate with a dual-process reasoning architecture. At inference time, the agent intervenes only when a calibrated probability of user acceptance exceeds a threshold derived from asymmetric costs of missed help and false alarms. Inspired by festina lente (Latin: "make haste slowly"), we gate by an acceptance-calibrated, cost-derived threshold and invoke a resource-intensive Slow mode with counterfactual checks only near the decision boundary, concentrating computation on ambiguous and high-stakes cases. Training uses gate-aligned, schema-locked distillation: a teacher running the full PRISM pipeline provides dense, executable supervision on unlabeled interaction traces, while the student learns a response policy that is explicitly decoupled from the intervention gate to enable tunable and auditable control. On ProactiveBench, PRISM reduces false alarms by 22.78% and improves F1 by 20.14% over strong baselines. These results show that principled decision-theoretic gating, paired with selective slow reasoning and aligned distillation, yields proactive agents that are precise, computationally efficient, and controllable. To facilitate reproducibility, we release our code, models, and resources at https://prism-festinalente.github.io/; all experiments use the open-source ProactiveBench benchmark.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces PRISM, a framework coupling cost-derived gating with dual-process reasoning to decide when proactive agents should intervene. It resides in the 'Proactive Agent Intervention with Uncertainty-Aware Gating' leaf, which contains only three papers total. This is a notably sparse research direction within the broader taxonomy, suggesting the specific combination of calibrated acceptance probabilities, asymmetric cost modeling, and selective slow reasoning has received limited prior attention. The leaf sits under 'Decision-Theoretic Intervention Frameworks,' a branch emphasizing formal utility models over heuristic triggers.

The taxonomy reveals neighboring leaves addressing related but distinct challenges. 'Prescriptive Process Monitoring with Causal Inference' applies causal reasoning to business process interventions but excludes non-causal predictive systems. 'Cost-Aware Multi-Agent Active Search' optimizes sensing actions in multi-agent settings, explicitly excluding single-agent scenarios. The paper's focus on single-agent, conversational intervention with calibrated gating distinguishes it from these adjacent directions. The broader 'Reactive and Threshold-Based Intervention Systems' branch uses simpler heuristics without decision-theoretic optimization, underscoring PRISM's methodological departure toward principled uncertainty handling.

Among seven candidates examined across three contributions, none yielded clear refutations. The core PRISM framework examined one candidate with no overlap found. The gate-aligned distillation method examined zero candidates, reflecting either limited prior work or search coverage gaps. The Decision-Consistent Curation filtering examined six candidates, all deemed non-refutable or unclear. These statistics suggest that within the limited search scope—top-K semantic matches plus citation expansion—the specific technical choices (cost-derived thresholds, festina lente gating, schema-locked distillation) appear relatively unexplored, though the small candidate pool precludes strong claims about absolute novelty.

Given the sparse taxonomy leaf and limited refutations among seven candidates, the work appears to occupy a less-crowded niche within decision-theoretic intervention. However, the analysis covers only a narrow semantic neighborhood and does not exhaustively survey adjacent fields like human-AI collaboration or conversational agents. The absence of refutations may reflect genuine novelty in combining these elements or simply the constraints of a top-K search strategy. A broader literature review would be needed to confirm whether the integration of calibrated gating, dual-process reasoning, and distillation-based training is indeed unprecedented.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: cost-sensitive selective intervention for proactive agents. This field addresses how autonomous systems can decide when and where to intervene in dynamic environments while balancing intervention costs against potential benefits. The taxonomy reveals four main branches that capture distinct perspectives on this challenge. Decision-Theoretic Intervention Frameworks emphasize principled uncertainty quantification and gating mechanisms that weigh expected outcomes before committing resources, often drawing on meta-verification or dual-control paradigms (e.g., CoSight Meta Verification[2], Shielding Dual Control[9]). Resource Allocation in Networked Systems focuses on distributed settings where bandwidth, compute, or network slicing must be allocated efficiently across nodes (e.g., ECP Network Slicing[3], OneM2M QoS Analysis[4]). Reactive and Threshold-Based Intervention Systems employ simpler heuristics or risk thresholds to trigger responses in domains like intrusion detection or healthcare monitoring. Domain-Specific Proactive Optimization tailors intervention strategies to particular applications—ranging from cloud cost management (Proactive Cloud Cost[33]) to medical treatment timing—where domain constraints shape when proactive action is justified. A central tension across these branches is the trade-off between model sophistication and operational simplicity: decision-theoretic approaches offer rigorous uncertainty handling but may incur higher computational overhead, while threshold-based methods provide fast responses at the cost of less nuanced reasoning. Another recurring theme is the interplay between proactive prediction and reactive correction, visible in works that blend forecasting with real-time adaptation. PRISM Proactive Agents[0] sits squarely within the Decision-Theoretic Intervention Frameworks branch, specifically under Proactive Agent Intervention with Uncertainty-Aware Gating. It shares conceptual ground with CoSight Meta Verification[2] in its emphasis on gating decisions under uncertainty, yet distinguishes itself by focusing on selective intervention timing rather than meta-level verification alone. Compared to Shielding Dual Control[9], which prioritizes safety constraints in control loops, PRISM Proactive Agents[0] foregrounds cost-benefit analysis for when an agent should act versus defer, positioning it as a bridge between rigorous decision theory and practical resource-constrained deployment.

Claimed Contributions

PRISM framework with cost-sensitive gating and selective slow reasoning

1 retrieved paper

PRISM is a novel framework that combines decision-theoretic gating with dual-process reasoning architecture. It uses calibrated probabilities of user need and acceptance, applies cost-sensitive thresholds to decide when to intervene, and selectively invokes resource-intensive slow reasoning only for ambiguous cases near decision boundaries.

1 retrieved paper

Gate-aligned, schema-locked distillation training method

0 retrieved papers

A training approach where a teacher model generates structured supervision following the PRISM pipeline on unlabeled data, and a student model learns through supervised fine-tuning with an objective that aligns training and deployment by using the same costs, gates, and margins at both stages.

0 retrieved papers

Decision-Consistent Curation (RDC) filtering method

6 retrieved papers

A data curation method that filters training examples by ranking them according to a score that combines acceptance outcomes with calibration quality of need and acceptance probability estimates, enabling efficient distillation from teacher to student models.

6 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[2] Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts PDF

Zhang Hongwei, Lu Ji, Hongwei Zhang, Jiang ShiQing, Ji Lu, Zhu Chen-xiang, Shiqing Jiang, Xie Li, Chenxiang Zhu, Zhong Chen, Li Xie, Chen Haoran, Chen Zhong, Zhu Yurui, Haoran Chen, Du Yongsheng, Yurui Zhu, Gao Yan-qin, Yongsheng Du, Huang Ling-jun, Yanqin Gao, Wang Baoli, Lingjun Huang, Tan Fang, Baoli Wang, Zou Peng, Fang Tan, Peng Zou (2025) • arXiv.org

[9] Active uncertainty reduction for safe and efficient interaction planning: A shielding-aware dual control approach PDF

Haimin Hu, David Isele, S. Bae, J. Fisac, SangJae Bae, Jaime F. Fisac (2023) • Int. J. Robotics Res.

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

PRISM framework with cost-sensitive gating and selective slow reasoning

[40] A Comprehensive Survey of LLM-Driven Collective Intelligence: Past, Present, and Future PDF

Cannot Refute

Contribution

Gate-aligned, schema-locked distillation training method

Contribution

Decision-Consistent Curation (RDC) filtering method

[41] LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification PDF

Cannot Refute

[42] LumiNet: Perception-Driven Knowledge Distillation via Statistical Logit Calibration PDF

Cannot Refute

[43] SelecTKD: Selective Token-Weighted Knowledge Distillation for LLMs PDF

Cannot Refute

[44] Teacher Network Calibration Improves Cross-Quality Knowledge Distillation PDF

Cannot Refute

[45] Hard Gate Knowledge Distillation -- Leverage Calibration for Robust and Reliable Language Model PDF

Cannot Refute

[46] PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection PDF

Cannot Refute

PRISM: Festina Lente Proactivity—Risk-Sensitive, Uncertainty-Aware Deliberation for Proactive Agents

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[2] Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts PDF

[9] Active uncertainty reduction for safe and efficient interaction planning: A shielding-aware dual control approach PDF

Contribution Analysis

PRISM framework with cost-sensitive gating and selective slow reasoning

[40] A Comprehensive Survey of LLM-Driven Collective Intelligence: Past, Present, and Future PDF

Gate-aligned, schema-locked distillation training method

Decision-Consistent Curation (RDC) filtering method

[41] LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification PDF

[42] LumiNet: Perception-Driven Knowledge Distillation via Statistical Logit Calibration PDF

[43] SelecTKD: Selective Token-Weighted Knowledge Distillation for LLMs PDF

[44] Teacher Network Calibration Improves Cross-Quality Knowledge Distillation PDF

[45] Hard Gate Knowledge Distillation -- Leverage Calibration for Robust and Reliable Language Model PDF

[46] PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection PDF

Table of Contents