Knowledgeable Language Models as Black-Box Optimizers for Personalized Medicine

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Large language modelsPersonalized medicineBlack-box optimizationDistribution shift

The goal of personalized medicine is to discover a treatment regimen that optimizes a patient's clinical outcome based on their personal genetic and environmental factors. However, candidate treatments cannot be arbitrarily administered to the patient to assess their efficacy; we often instead have access to an in silico surrogate model that approximates the true fitness of a proposed treatment. Unfortunately, such surrogate models have been shown to fail to generalize to previously unseen patient-treatment combinations. We hypothesize that domain-specific prior knowledge—such as medical textbooks and biomedical knowledge graphs—can provide a meaningful alternative signal of the fitness of proposed treatments. To this end, we introduce LLM-based Entropy-guided Optimization with kNowledgeable priors (LEON), a mathematically principled approach to leverage large language models (LLMs) as black-box optimizers without any task-specific fine-tuning, taking advantage of their ability to contextualize unstructured domain knowledge to propose personalized treatment plans in natural language. In practice, we implement LEON via 'optimization by prompting,' which uses LLMs as stochastic engines for proposing treatment designs. Experiments on real-world optimization tasks show LEON outperforms both traditional and LLM-based methods in proposing individualized treatments for patients.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Personalized treatment optimization under distribution shift. The field addresses how to tailor interventions when patient populations or clinical environments evolve over time, creating mismatches between training and deployment conditions. The taxonomy reveals several complementary research directions: Causal Inference and Treatment Effect Estimation Under Covariate Shift focuses on identifying treatment effects when covariate distributions change, often leveraging propensity weighting and doubly robust methods. Reinforcement Learning for Sequential Treatment Optimization tackles dynamic decision-making over multiple time steps, balancing exploration and exploitation in non-stationary environments. Domain Adaptation and Representation Learning Under Distribution Shift seeks invariant features that generalize across hospitals or patient subgroups, as seen in works like Domain-invariant Clinical Representation[10] and Knowledge-Guided Domain Adaptation[7]. Adaptive Clinical Decision Support and Real-Time Monitoring emphasizes responsive systems that adjust to individual patient trajectories, while Optimization and Meta-Learning for Personalized Treatment explores how to efficiently learn treatment policies that transfer across contexts. Specialized Applications demonstrate these principles in concrete settings ranging from anticoagulation dosing to mood disorder monitoring. Recent work highlights tensions between model flexibility and robustness guarantees. Many studies pursue uncertainty quantification through conformal methods, such as Conformal Deep Q-Learning[6] and Conformal Dose-Response[17], to provide reliable prediction intervals under shift. Others emphasize transfer learning strategies, exemplified by Transfer Learning Treatment Rules[3], which adapt policies learned in source populations to new target settings. Within the Optimization and Meta-Learning branch, Knowledgeable Black-Box Optimizers[0] sits alongside Contextually Constrained Optimization[12], both addressing how to incorporate domain knowledge and context-specific constraints when optimizing treatment parameters. While Contextually Constrained Optimization[12] focuses on learning feasible regions from contextual features, Knowledgeable Black-Box Optimizers[0] emphasizes integrating expert priors into black-box search procedures. This cluster reflects a broader trend toward hybrid approaches that blend data-driven optimization with structured domain expertise, aiming to improve sample efficiency and safety when distribution shifts challenge purely empirical methods.

Claimed Contributions

Formulating personalized medicine as a black-box optimization problem

10 retrieved papers

The authors formulate personalized medicine as a conditional black-box optimization problem where the objective is to discover optimal treatment regimens conditioned on patient-specific genetic and environmental features. This formulation provides a mathematical foundation for applying optimization methods to individualized treatment design.

10 retrieved papers

Constrained optimization problem with certainty-based constraints

10 retrieved papers

The authors introduce two constraints to the optimization problem: one that bounds the Wasserstein distance between proposed and historical designs to ensure reliable surrogate predictions, and another that bounds the entropy of proposed designs to encourage consistency based on domain knowledge. These constraints address the challenge of imperfect surrogate models in out-of-distribution settings.

10 retrieved papers

LEON: LLM-based Entropy-guided Optimization with kNowledgeable priors

10 retrieved papers

The authors derive a tractable solution to the constrained optimization problem that leverages large language models as zero-shot optimizers without task-specific fine-tuning. LEON uses statistical analysis of design distributions and an adversarial source critic model, implemented via optimization-by-prompting to propose personalized treatment plans.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[12] Learning to Optimize Contextually Constrained Problems for Real-Time Decision Generation PDF

Aaron Babier, Timothy C.Y. Chan, A. Babier, Adam Diamant, Timothy C. Y. Chan, Rafid Mahmood (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Formulating personalized medicine as a black-box optimization problem

[47] Non-greedy tree-based learning for estimating global optimal dynamic treatment decision rules with continuous treatment dosage PDF

Cannot Refute

[48] Next-Gen Medical Intelligence: Fuzzy Logic-Driven Expert Systems For Clinical Decision-Making PDF

Cannot Refute

[49] Metaheuristic algorithms and medical applications PDF

Cannot Refute

[50] Novel models for the prediction of drugâgene interactions PDF

Cannot Refute

[51] Comparing covariate prioritization via matching to machine learning methods for causal inference using five empirical applications PDF

Cannot Refute

[52] Towards automated patient-specific optimization of deep brain stimulation for movement disorders PDF

Cannot Refute

[53] Machine Learning-Based Surrogate Models and Transfer Learning for Derivative Free Optimization of HTPEM Fuel Cells PDF

Cannot Refute

[54] Optimization in cardiovascular modeling PDF

Cannot Refute

[55] From Coordination to Personalization: A Trust-Aware Simulation Framework for AI-Driven Personalized Decision Support in Emergency Departments PDF

Cannot Refute

[56] Velocity-based cardiac contractility personalization from images using derivative-free optimization PDF

Cannot Refute

Contribution

Constrained optimization problem with certainty-based constraints

[37] Portfolio optimization with transfer entropy constraints PDF

Cannot Refute

[38] Multi-objective drilling trajectory optimization using decomposition method with minimum fuzzy entropy-based comprehensive evaluation PDF

Cannot Refute

[39] Predictive Entropy Search for Bayesian Optimization with Unknown Constraints PDF

Cannot Refute

[40] Bayesian optimization with active learning of design constraints using an entropy-based approach PDF

Cannot Refute

[41] Physics-Guided Multi-Representation Learning with Quadruple Consistency Constraints for Robust Cloud Detection in Multi-Platform Remote Sensing PDF

Cannot Refute

[42] â¦ is the Universe Mathematically Self-Consistent D; Quantum Resource Complementarity Principle: A Cosmic Self-Consistency Explanation Based on Optimal â¦ PDF

Cannot Refute

[43] Entropy-based optimization on individual and global predictions for semi-supervised learning PDF

Cannot Refute

[44] Gradient boundary infiltration in large language models: A projection-based constraint framework for distributional trace locality PDF

Cannot Refute

[45] Predicting Protein Folding Pathways with Quadratic Constraints on Rates of Entropy Change: A Nonlinear Optimization-Based Control Approach PDF

Cannot Refute

[46] Predictive Entropy Search for Multi-objective Bayesian Optimization PDF

Cannot Refute

Contribution

LEON: LLM-based Entropy-guided Optimization with kNowledgeable priors

[57] Medagents: Large language models as collaborators for zero-shot medical reasoning PDF

Cannot Refute

[58] Application of large language models in medicine PDF

Cannot Refute

[59] Automated radiotherapy treatment planning guided by GPT-4Vision PDF

Cannot Refute

[60] Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning PDF

Cannot Refute

[61] â¦ a retrieval-augmented generation-based large language model to guide oncologists in searching for FDA-approved therapies for patient treatment planning. PDF

Cannot Refute

[62] Current opinions on large cellular models PDF

Cannot Refute

[63] Decoding substance use disorder severity from clinical notes using a large language model PDF

Cannot Refute

[64] Leveraging large language models to extract information on substance use disorder severity from clinical notes: a zero-shot learning approach PDF

Cannot Refute

[65] Implementing large language model-based artificial intelligence (AI) technology in proposing effective treatment plans in patients with cancer. PDF

Cannot Refute

[66] VIVE: An LLM-based approach to identifying and extracting context-specific personal values from text PDF

Cannot Refute

Knowledgeable Language Models as Black-Box Optimizers for Personalized Medicine

Overview

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[12] Learning to Optimize Contextually Constrained Problems for Real-Time Decision Generation PDF

Contribution Analysis

Formulating personalized medicine as a black-box optimization problem

[47] Non-greedy tree-based learning for estimating global optimal dynamic treatment decision rules with continuous treatment dosage PDF

[48] Next-Gen Medical Intelligence: Fuzzy Logic-Driven Expert Systems For Clinical Decision-Making PDF

[49] Metaheuristic algorithms and medical applications PDF

[50] Novel models for the prediction of drugâgene interactions PDF

[51] Comparing covariate prioritization via matching to machine learning methods for causal inference using five empirical applications PDF

[52] Towards automated patient-specific optimization of deep brain stimulation for movement disorders PDF

[53] Machine Learning-Based Surrogate Models and Transfer Learning for Derivative Free Optimization of HTPEM Fuel Cells PDF

[54] Optimization in cardiovascular modeling PDF

[55] From Coordination to Personalization: A Trust-Aware Simulation Framework for AI-Driven Personalized Decision Support in Emergency Departments PDF

[56] Velocity-based cardiac contractility personalization from images using derivative-free optimization PDF

Constrained optimization problem with certainty-based constraints

[37] Portfolio optimization with transfer entropy constraints PDF

[38] Multi-objective drilling trajectory optimization using decomposition method with minimum fuzzy entropy-based comprehensive evaluation PDF

[39] Predictive Entropy Search for Bayesian Optimization with Unknown Constraints PDF

[40] Bayesian optimization with active learning of design constraints using an entropy-based approach PDF

[41] Physics-Guided Multi-Representation Learning with Quadruple Consistency Constraints for Robust Cloud Detection in Multi-Platform Remote Sensing PDF

[42] â¦ is the Universe Mathematically Self-Consistent D; Quantum Resource Complementarity Principle: A Cosmic Self-Consistency Explanation Based on Optimal â¦ PDF

[43] Entropy-based optimization on individual and global predictions for semi-supervised learning PDF

[44] Gradient boundary infiltration in large language models: A projection-based constraint framework for distributional trace locality PDF

[45] Predicting Protein Folding Pathways with Quadratic Constraints on Rates of Entropy Change: A Nonlinear Optimization-Based Control Approach PDF

[46] Predictive Entropy Search for Multi-objective Bayesian Optimization PDF

LEON: LLM-based Entropy-guided Optimization with kNowledgeable priors

[57] Medagents: Large language models as collaborators for zero-shot medical reasoning PDF

[58] Application of large language models in medicine PDF

[59] Automated radiotherapy treatment planning guided by GPT-4Vision PDF

[60] Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning PDF

[61] â¦ a retrieval-augmented generation-based large language model to guide oncologists in searching for FDA-approved therapies for patient treatment planning. PDF

[62] Current opinions on large cellular models PDF

[63] Decoding substance use disorder severity from clinical notes using a large language model PDF

[64] Leveraging large language models to extract information on substance use disorder severity from clinical notes: a zero-shot learning approach PDF

[65] Implementing large language model-based artificial intelligence (AI) technology in proposing effective treatment plans in patients with cancer. PDF

[66] VIVE: An LLM-based approach to identifying and extracting context-specific personal values from text PDF

Table of Contents

[50] Novel models for the prediction of drugâgene interactions PDF

[42] â¦ is the Universe Mathematically Self-Consistent D; Quantum Resource Complementarity Principle: A Cosmic Self-Consistency Explanation Based on Optimal â¦ PDF

[61] â¦ a retrieval-augmented generation-based large language model to guide oncologists in searching for FDA-approved therapies for patient treatment planning. PDF