Formalising Human-in-the-Loop: Computational Reductions, Failure Modes, and Legal-Moral Responsibility

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.7 Download Report PDF

Human-in-the-loopAutomated decision making systemHuman oversight in sociotechnical systemsOracle machineAI safetyTrustworthy AI

We use the notion of oracle machines and reductions from computability theory to formalise different Human-in-the-loop (HITL) setups for AI systems, distinguishing between trivial human monitoring (i.e., total functions), single endpoint human action (i.e., many-one reductions), and highly involved human-AI interaction (i.e., Turing reductions). We then proceed to show that the legal status and safety of different setups vary greatly. We present a taxonomy to categorise HITL failure modes, highlighting the practical limitations of HITL setups. We then identify omissions in UK and EU legal frameworks, which focus on HITL setups that may not always achieve the desired ethical, legal, and sociotechnical outcomes. We suggest areas where the law should recognise the effectiveness of different HITL setups and assign responsibility in these contexts, avoiding human `scapegoating'. Our work shows an unavoidable trade-off between attribution of legal responsibility, and technical explainability. Overall, we show how HITL setups involve many technical design decisions, and can be prone to failures out of the humans' control. Our formalisation and taxonomy opens up a new analytic perspective on the challenges in creating HITL setups, helping inform AI developers and lawmakers on designing HITL setups to better achieve their desired outcomes.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper formalizes human-in-the-loop (HITL) setups using oracle machines and computational reductions from computability theory, distinguishing trivial monitoring, single-endpoint action, and highly interactive collaboration. It resides in the 'Computational Reduction Models for Human-AI Interaction' leaf, which contains only two papers total. This leaf sits within the broader 'Theoretical Foundations and Formal Frameworks' branch, indicating a relatively sparse research direction focused on rigorous formal characterizations rather than empirical or application-driven work.

The taxonomy reveals neighboring leaves addressing 'Interaction Protocols and Decision Frameworks' (tractable protocols and agreement mechanisms) and 'Safety and Reliability Frameworks' (mode confusion and formal verification). These adjacent areas share the theoretical branch but diverge in focus: the sibling leaves emphasize decision-theoretic models and fault detection, whereas the paper's leaf concentrates on reduction-based abstractions. The taxonomy's scope notes clarify that applied implementations belong elsewhere, reinforcing that this work occupies a foundational niche distinct from domain-specific applications scattered across the 'Application Domains' branch.

Among 29 candidates examined, the three contributions—formalizing HITL via reductions (9 candidates), taxonomizing failure modes (10 candidates), and analyzing legal frameworks (10 candidates)—show no clear refutations. The limited search scope means these statistics reflect top-K semantic matches and citation expansion, not exhaustive coverage. The formalization contribution appears particularly novel given the sparse leaf population, while the failure taxonomy and legal analysis may overlap with broader human-AI interaction literature not captured in this focused search. The absence of refutable pairs suggests either genuine novelty or gaps in the candidate pool.

Based on the limited search of 29 candidates, the work appears to occupy a sparsely populated formal niche, with its reduction-based approach distinguishing it from neighboring protocol-oriented or safety-focused frameworks. The analysis cannot confirm whether larger-scale searches or domain-specific legal literature would reveal closer prior work, particularly for the legal responsibility and failure mode contributions.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: formalising human-in-the-loop setups using computational reductions. The field spans a diverse set of concerns, from foundational theory to practical deployment. At the highest level, the taxonomy organizes work into six main branches: Theoretical Foundations and Formal Frameworks, which develop rigorous models and reduction-based abstractions for human–AI interaction; Interactive System Design and Optimization, which addresses interface design, user modeling, and adaptive workflows; Machine Learning with Human Feedback, covering reinforcement learning from human preferences and related training paradigms; Dimensionality Reduction and Reliability, which tackles visualization, interpretability, and robustness; Application Domains, spanning robotics, healthcare, energy systems, and beyond; and Supporting Methodologies, which provide cross-cutting techniques such as protocol design and data fusion. Representative works illustrate these themes: for instance, Humans Out Loop[11] and Formal Frameworks Mode Confusion[12] anchor the theoretical side, while RRHF[18] and LLM Interactive Code Generation[3] exemplify machine learning with feedback, and Visual Analytics Dimensionality Reduction[2] highlights interpretability challenges. Several active lines of work reveal key trade-offs and open questions. One tension lies between formal guarantees—pursued by reduction-based frameworks that treat human input as an oracle or computational resource—and the messy realities of adaptive interfaces and noisy feedback in deployed systems. Another contrast emerges between domain-agnostic methodologies, such as dimensionality reduction techniques like Linear t-SNE[22], and domain-specific applications like Brain Stimulation Optimization[10] or Vehicle Platooning Intervention[14], each of which must reconcile general principles with specialized constraints. Within this landscape, Formalising Human-in-the-Loop[0] sits squarely in the Theoretical Foundations branch, specifically under Computational Reduction Models for Human-AI Interaction. Its emphasis on rigorous reduction-based abstractions aligns closely with Humans Out Loop[11], which also explores formal characterizations of human involvement, though the two may differ in how they model the boundary between automated and human-driven decision-making. By anchoring human-in-the-loop setups in computational complexity and reduction theory, this work provides a unifying lens that complements more empirical or application-focused studies elsewhere in the taxonomy.

Claimed Contributions

Formalisation of HITL setups using computational reductions

9 retrieved papers

The authors introduce a novel computational framework that characterises HITL setups through oracle machines and reduction types from computability theory. This formalisation distinguishes three setup types: trivial monitoring (total functions), endpoint action (many-one reductions), and involved interaction (Turing reductions), unifying disparate HITL concepts under a consistent theoretical lens.

9 retrieved papers

Taxonomy of HITL failure modes

10 retrieved papers

The authors develop a taxonomy organised into five main failure categories (machine components, process and workflow, human–machine interface, human component, and exogenous circumstances) that systematically captures how HITL setups can fail in practice. This taxonomy connects failure modes to the different computational reduction types identified in their formalisation.

10 retrieved papers

Analysis of legal frameworks and responsibility trade-offs

10 retrieved papers

The authors analyse UK and EU legal frameworks (GDPR and EU AI Act) to identify gaps in how they address HITL requirements, and reveal an inherent trade-off: HITL setups with greater explainability (involved interactions) create responsibility gaps, while setups with clearer responsibility attribution (endpoint actions) are less transparent. They provide suggestions for improving legal frameworks to prevent humans from becoming scapegoats.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[11] Can Humans Be out of the Loop? PDF

J Zhang, E Bareinboim (2022)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Formalisation of HITL setups using computational reductions

[26] Human-in-the-loop active learning for goal-oriented molecule generation PDF

Cannot Refute

[28] VOICE: Visual Oracle for Interaction, Conversation, and Explanation PDF

Cannot Refute

[29] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach PDF

Cannot Refute

[30] Modeling Variation in Human Feedback with User Inputs: An Exploratory Methodology PDF

Cannot Refute

[31] You are the only possible oracle: Effective test selection for end users of interactive machine learning systems PDF

Cannot Refute

[32] Leveraging Oracle Digital Assistant (ODA) to Automate ERP Transactions and Improve User Productivity PDF

Cannot Refute

[33] Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback PDF

Cannot Refute

[34] Towards understanding and simplifying human-in-the-loop machine learning PDF

Cannot Refute

[35] Oracle or Teacher? A Systematic Overview of Research on Interactive Labeling for Machine Learning PDF

Cannot Refute

Contribution

Taxonomy of HITL failure modes

[46] Understanding choice independence and error types in human-ai collaboration PDF

Cannot Refute

[47] Reliability Assurance for AI Systems PDF

Cannot Refute

[48] Human-in-the-loop Techniques in Machine Learning. PDF

Cannot Refute

[49] Human information interaction, artificial intelligence, and errors PDF

Cannot Refute

[50] Scientific Knowledge Graph Construction Needs an AI-Mediated, Scientist-in-the-Loop Workflow (A Blue Sky Paper) PDF

Cannot Refute

[51] Collaborative automation in factories of the future: review and survey PDF

Cannot Refute

[52] Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes PDF

Cannot Refute

[53] Smart Biofloc Systems: Leveraging Artificial Intelligence (AI) and Internet of Things (IoT) for Sustainable Aquaculture Practices PDF

Cannot Refute

[54] Architecting HumanâAI Systems for Effective Collaboration and Oversight: Making Sense of Human/AIâin/on/Over/Under/AlongâtheâLoop PDF

Cannot Refute

[55] Design and Application of a C++ Compiler Error Solution Query Platform Integrating Large Language Models and Human-in-the-Loop Support PDF

Cannot Refute

Contribution

Analysis of legal frameworks and responsibility trade-offs

[36] Human-Centered Machine Learning PDF

Cannot Refute

[37] Human-in-the-Loop AI Engineering: Enhancing Collaboration Between Developers and End Users PDF

Cannot Refute

[38] Toward meaningful transparency and accountability of AI Algorithms in public service delivery PDF

Cannot Refute

[39] Requirements of high-risk AI systems: AI Act. Article 14. Human oversight PDF

Cannot Refute

[40] Human-in-the-Loop Robotics: Enhancing Safety and Adaptability through Interactive AI Systems PDF

Cannot Refute

[41] Autonomous AI, smart seaports, and supply chain management: Challenges and Risks PDF

Cannot Refute

[42] Evaluating the Trade-offs Between Explainability and Security in AI-Powered Cyber Defense PDF

Cannot Refute

[43] Administrative Liability for Damages Caused by Artificial Intelligence Systems in Public Services: An Analytical Study in Light of the Principles of Legality and Transparency PDF

Cannot Refute

[44] Navigating the Human-oversight Dilemma in AI-based Systems PDF

Cannot Refute

[45] Percentages and reasons: AI explainability and ultimate human responsibility within the medical field PDF

Cannot Refute

Formalising Human-in-the-Loop: Computational Reductions, Failure Modes, and Legal-Moral Responsibility

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[11] Can Humans Be out of the Loop? PDF

Contribution Analysis

Formalisation of HITL setups using computational reductions

[26] Human-in-the-loop active learning for goal-oriented molecule generation PDF

[28] VOICE: Visual Oracle for Interaction, Conversation, and Explanation PDF

[29] Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach PDF

[30] Modeling Variation in Human Feedback with User Inputs: An Exploratory Methodology PDF

[31] You are the only possible oracle: Effective test selection for end users of interactive machine learning systems PDF

[32] Leveraging Oracle Digital Assistant (ODA) to Automate ERP Transactions and Improve User Productivity PDF

[33] Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback PDF

[34] Towards understanding and simplifying human-in-the-loop machine learning PDF

[35] Oracle or Teacher? A Systematic Overview of Research on Interactive Labeling for Machine Learning PDF

Taxonomy of HITL failure modes

[46] Understanding choice independence and error types in human-ai collaboration PDF

[47] Reliability Assurance for AI Systems PDF

[48] Human-in-the-loop Techniques in Machine Learning. PDF

[49] Human information interaction, artificial intelligence, and errors PDF

[50] Scientific Knowledge Graph Construction Needs an AI-Mediated, Scientist-in-the-Loop Workflow (A Blue Sky Paper) PDF

[51] Collaborative automation in factories of the future: review and survey PDF

[52] Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes PDF

[53] Smart Biofloc Systems: Leveraging Artificial Intelligence (AI) and Internet of Things (IoT) for Sustainable Aquaculture Practices PDF

[54] Architecting HumanâAI Systems for Effective Collaboration and Oversight: Making Sense of Human/AIâin/on/Over/Under/AlongâtheâLoop PDF

[55] Design and Application of a C++ Compiler Error Solution Query Platform Integrating Large Language Models and Human-in-the-Loop Support PDF

Analysis of legal frameworks and responsibility trade-offs

[36] Human-Centered Machine Learning PDF

[37] Human-in-the-Loop AI Engineering: Enhancing Collaboration Between Developers and End Users PDF

[38] Toward meaningful transparency and accountability of AI Algorithms in public service delivery PDF

[39] Requirements of high-risk AI systems: AI Act. Article 14. Human oversight PDF

[40] Human-in-the-Loop Robotics: Enhancing Safety and Adaptability through Interactive AI Systems PDF

[41] Autonomous AI, smart seaports, and supply chain management: Challenges and Risks PDF

[42] Evaluating the Trade-offs Between Explainability and Security in AI-Powered Cyber Defense PDF

[43] Administrative Liability for Damages Caused by Artificial Intelligence Systems in Public Services: An Analytical Study in Light of the Principles of Legality and Transparency PDF

[44] Navigating the Human-oversight Dilemma in AI-based Systems PDF

[45] Percentages and reasons: AI explainability and ultimate human responsibility within the medical field PDF

Table of Contents

[54] Architecting HumanâAI Systems for Effective Collaboration and Oversight: Making Sense of Human/AIâin/on/Over/Under/AlongâtheâLoop PDF