Task-Agnostic Amortized Multi-Objective Optimization

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Multi-Objective OptimizationBayesian OptimizationTransformersNeural Processes

Balancing competing objectives is omnipresent across disciplines, from drug design to autonomous systems. Multi-objective Bayesian optimization is a promising solution for such expensive, black-box problems: it fits probabilistic surrogates and selects new designs via an acquisition function that balances exploration and exploitation. In practice, it requires tailored choices of surrogate and acquisition that rarely transfer to the next problem, is myopic when multi-step planning is often required, and adds refitting overhead, particularly in parallel or time-sensitive loops. We present TAMO, a fully amortized, universal policy for multi-objective black-box optimization. TAMO uses a transformer architecture that operates across varying input and objective dimensions, enabling pretraining on diverse corpora and transfer to new problems without retraining: at test time, the pretrained model proposes the next design with a single forward pass. We pretrain the policy with reinforcement learning to maximize cumulative hypervolume improvement over full trajectories, conditioning on the entire query history to approximate the Pareto frontier. Across synthetic benchmarks and real tasks, TAMO produces fast proposals, reducing proposal time by 50–1000× versus alternatives while matching or improving Pareto quality under tight evaluation budgets. These results show that transformers can perform multi-objective optimization entirely in-context, eliminating per-task surrogate fitting and acquisition engineering, and open a path to foundation-style, plug-and-play optimizers for scientific discovery workflows.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

TAMO proposes a fully amortized transformer policy for multi-objective black-box optimization that operates across varying input and objective dimensions without per-task retraining. The paper sits in the 'Transformer-Based In-Context Multi-Objective Optimization' leaf, which contains only two papers including TAMO itself. This represents a relatively sparse research direction within the broader taxonomy, suggesting the work targets an emerging area where transformer-based amortized policies are applied to multi-objective settings with full in-context reasoning over query histories.

The taxonomy reveals that TAMO's immediate neighbors include 'Preferential Amortized Optimization' (learning from pairwise preferences) and more distant branches covering task-specific Bayesian methods, evolutionary algorithms, and domain-specific generative models. The scope note for TAMO's leaf explicitly excludes preferential feedback and single-objective approaches, positioning the work at the intersection of amortized policy learning and direct multi-objective evaluation. Nearby branches like 'Constrained and Scalable Bayesian Optimization' require per-task surrogate fitting, highlighting TAMO's departure from traditional Bayesian paradigms toward universal, pretrained policies.

Among thirty candidates examined, the analysis found limited prior work overlap. The core amortized policy contribution (Contribution A) examined ten candidates with zero refutations, and the dimension-agnostic architecture (Contribution B) similarly showed no clear prior work among ten candidates. However, the non-myopic trajectory-level reinforcement learning objective (Contribution C) identified one refutable candidate among ten examined, suggesting some existing work on multi-step planning in related optimization contexts. These statistics indicate that within the limited search scope, most contributions appear relatively novel, though the trajectory-level RL framing has at least one overlapping precedent.

Based on the top-thirty semantic matches and taxonomy structure, TAMO appears to occupy a sparsely populated niche combining transformer amortization with multi-objective black-box optimization. The single sibling paper and limited refutations suggest novelty within the examined scope, though the analysis does not cover exhaustive literature or domain-specific evolutionary or Bayesian methods outside the semantic search radius. The trajectory-level RL component shows the most prior work overlap among the three contributions analyzed.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: amortized multi-objective black-box optimization with transformers. The field addresses how to efficiently solve multiple optimization problems by learning reusable policies or models that generalize across tasks, rather than optimizing each problem from scratch. The taxonomy reveals four main branches: Amortized Policy Learning for Multi-Objective Optimization focuses on training neural policies (often transformer-based) that can propose solutions in-context or via learned mappings; Task-Specific Bayesian Optimization Frameworks emphasize surrogate modeling and acquisition strategies tailored to individual problem instances; Deep Learning-Enhanced Evolutionary Algorithms integrate neural components into population-based search; and Domain-Specific Generative Optimization targets specialized application areas such as molecular design or hardware synthesis. Works like In-Context Multi-Objective[3] and Preferential Amortized[6] illustrate how transformers can be leveraged to handle multi-objective trade-offs directly, while approaches such as Efficient Scalable Bayesian[5] and PABBO[2] represent more classical Bayesian paradigms augmented with modern scalability techniques. A central tension across these branches is the trade-off between task-agnostic generalization and domain-specific performance: amortized methods promise rapid adaptation to new objectives but may sacrifice the fine-grained tuning that task-specific Bayesian or evolutionary methods provide. Task-Agnostic Amortized[0] sits squarely within the transformer-based amortized policy learning branch, emphasizing in-context learning for multi-objective scenarios much like In-Context Multi-Objective[3]. Compared to In-Context Multi-Objective[3], which also explores transformer-driven multi-objective reasoning, Task-Agnostic Amortized[0] appears to push further on generalization across diverse black-box functions without requiring task-specific retraining. Meanwhile, works like Preferential Amortized[6] incorporate human preferences into the amortization process, and Transformer Multimodal[1] extends the paradigm to multimodal data, highlighting ongoing efforts to broaden the scope and applicability of learned optimization policies.

Claimed Contributions

TAMO: Fully amortized multi-objective optimization policy

10 retrieved papers

The authors introduce TAMO, a transformer-based policy that performs multi-objective optimization through a single forward pass at test time, eliminating the need for per-task surrogate fitting and acquisition function optimization. The policy is pretrained using reinforcement learning to maximize cumulative hypervolume improvement over full trajectories.

10 retrieved papers

Dimension-agnostic transformer architecture

10 retrieved papers

The authors develop a novel transformer architecture with a dimension-aggregating embedder that can handle varying input and output dimensionalities. This enables the model to be pretrained on heterogeneous tasks and transfer to new problems with different dimensions without requiring retraining.

10 retrieved papers

Non-myopic trajectory-level reinforcement learning objective

Can Refute

10 retrieved papers

The authors formulate the optimization problem as a Markov decision process and train the policy using REINFORCE to optimize hypervolume-based rewards over entire trajectories rather than single-step gains. This encourages long-horizon planning instead of myopic one-step optimization typical in traditional acquisition functions.

10 retrieved papers

Can Refute

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[3] In-Context Multi-Objective Optimization PDF

Xinyu Zhang, Conor Hassan, Julien Martinelli, Daolang Huang, Samuel Kaski (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

TAMO: Fully amortized multi-objective optimization policy

[2] PABBO: Preferential Amortized Black-Box Optimization PDF

Cannot Refute

[3] In-Context Multi-Objective Optimization PDF

Cannot Refute

[7] Bayesian design of concrete with amortized Gaussian processes and multi-objective optimization PDF

Cannot Refute

[8] It's morphing time: Unleashing the potential of multiple llms via multi-objective optimization PDF

Cannot Refute

[9] Amortized Generation of Sequential Algorithmic Recourses for Black-Box Models PDF

Cannot Refute

[10] Parametric Pareto Set Learning: Amortizing Multi-Objective Optimization With Parameters PDF

Cannot Refute

[11] Amortized Active Generation of Pareto Sets PDF

Cannot Refute

[12] Optimal biorefinery product allocation by combining process and economic modeling PDF

Cannot Refute

[13] Multi-objective optimization with unbounded solution sets PDF

Cannot Refute

[14] Multi-objective thermoeconomic optimization of coupling MSF desalination with PWR nuclear power plant through evolutionary algorithms PDF

Cannot Refute

Contribution

Dimension-agnostic transformer architecture

[15] Crossformer: Transformer utilizing cross-dimension dependency for multivariate time series forecasting PDF

Cannot Refute

[16] Dimension-expanding MLP in transformer: Inappropriate sentences and paragraph digital content filtering PDF

Cannot Refute

[17] Approximation and estimation ability of transformers for sequence-to-sequence functions with infinite dimensional input PDF

Cannot Refute

[18] RTIDS: A Robust Transformer-Based Approach for Intrusion Detection System PDF

Cannot Refute

[19] Empirical evaluation of pre-trained transformers for human-level NLP: The role of sample size and dimensionality PDF

Cannot Refute

[20] Searching for efficient transformers for language modeling PDF

Cannot Refute

[21] Deep ensemble transformers for dimensionality reduction PDF

Cannot Refute

[22] Compound Fault Transfer Diagnosis of Gearboxes Based on Improved Transformer Network Under Small Datasets and Variable Working Conditions PDF

Cannot Refute

[23] Token Packing for Transformers with Variable-Length Inputs PDF

Cannot Refute

[24] ETC: Encoding long and structured inputs in transformers PDF

Cannot Refute

Contribution

Non-myopic trajectory-level reinforcement learning objective

[28] Boformer: Learning to solve multi-objective bayesian optimization via non-markovian rl PDF

Can Refute

[3] In-Context Multi-Objective Optimization PDF

Cannot Refute

[25] Multi-objective deep reinforcement learning for optimal design of wind turbine blade PDF

Cannot Refute

[26] Pareto set learning for multi-objective reinforcement learning PDF

Cannot Refute

[27] MOSMAC: A Multi-agent Reinforcement Learning Benchmark on Sequential Multi-Objective Tasks PDF

Cannot Refute

[29] Multi-objective Sequential Decision Making for Holistic Supply Chain Optimization PDF

Cannot Refute

[30] Hypervolume-Based Multi-Objective Optimization Method Applying Deep Reinforcement Learning to the Optimization of Turbine Blade Shape PDF

Cannot Refute

[31] Scaling pareto-efficient decision making via offline multi-objective rl PDF

Cannot Refute

[32] Multi-objective sequential decision making PDF

Cannot Refute

[33] Multi-Agent Reinforcement Learning for Explainable Spacecraft Configuration Design PDF

Cannot Refute

Task-Agnostic Amortized Multi-Objective Optimization

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[3] In-Context Multi-Objective Optimization PDF

Contribution Analysis

TAMO: Fully amortized multi-objective optimization policy

[2] PABBO: Preferential Amortized Black-Box Optimization PDF

[3] In-Context Multi-Objective Optimization PDF

[7] Bayesian design of concrete with amortized Gaussian processes and multi-objective optimization PDF

[8] It's morphing time: Unleashing the potential of multiple llms via multi-objective optimization PDF

[9] Amortized Generation of Sequential Algorithmic Recourses for Black-Box Models PDF

[10] Parametric Pareto Set Learning: Amortizing Multi-Objective Optimization With Parameters PDF

[11] Amortized Active Generation of Pareto Sets PDF

[12] Optimal biorefinery product allocation by combining process and economic modeling PDF

[13] Multi-objective optimization with unbounded solution sets PDF

[14] Multi-objective thermoeconomic optimization of coupling MSF desalination with PWR nuclear power plant through evolutionary algorithms PDF

Dimension-agnostic transformer architecture

[15] Crossformer: Transformer utilizing cross-dimension dependency for multivariate time series forecasting PDF

[16] Dimension-expanding MLP in transformer: Inappropriate sentences and paragraph digital content filtering PDF

[17] Approximation and estimation ability of transformers for sequence-to-sequence functions with infinite dimensional input PDF

[18] RTIDS: A Robust Transformer-Based Approach for Intrusion Detection System PDF

[19] Empirical evaluation of pre-trained transformers for human-level NLP: The role of sample size and dimensionality PDF

[20] Searching for efficient transformers for language modeling PDF

[21] Deep ensemble transformers for dimensionality reduction PDF

[22] Compound Fault Transfer Diagnosis of Gearboxes Based on Improved Transformer Network Under Small Datasets and Variable Working Conditions PDF

[23] Token Packing for Transformers with Variable-Length Inputs PDF

[24] ETC: Encoding long and structured inputs in transformers PDF

Non-myopic trajectory-level reinforcement learning objective

[28] Boformer: Learning to solve multi-objective bayesian optimization via non-markovian rl PDF

[3] In-Context Multi-Objective Optimization PDF

[25] Multi-objective deep reinforcement learning for optimal design of wind turbine blade PDF

[26] Pareto set learning for multi-objective reinforcement learning PDF

[27] MOSMAC: A Multi-agent Reinforcement Learning Benchmark on Sequential Multi-Objective Tasks PDF

[29] Multi-objective Sequential Decision Making for Holistic Supply Chain Optimization PDF

[30] Hypervolume-Based Multi-Objective Optimization Method Applying Deep Reinforcement Learning to the Optimization of Turbine Blade Shape PDF

[31] Scaling pareto-efficient decision making via offline multi-objective rl PDF

[32] Multi-objective sequential decision making PDF

[33] Multi-Agent Reinforcement Learning for Explainable Spacecraft Configuration Design PDF

Table of Contents