Inference-time scaling of diffusion models through classical search

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

diffusion modelsinference-time scalingcompositional generationsearch algorithms

Classical search algorithms have long underpinned modern artificial intelligence. In this work, we tackle the challenge of inference-time control in diffusion models—adapting generated outputs to meet diverse test-time objectives—using principles from classical search. We propose a general framework that orchestrates local and global search to efficiently navigate the generative space. It performs compute-efficient global exploration using breadth-first and depth-first tree search and employs a theoretically grounded, scalable local search via annealed Langevin MCMC. We evaluate our approach on a range of challenging domains, including planning, offline reinforcement learning, and image generation, and observe significant gains in both performance and efficiency over baseline methods. These results demonstrate that classical search offers a principled and practical foundation for inference-time scaling in diffusion models. By jointly scaling local and global search for the first time, our framework establishes a new Pareto frontier across challenging decision-making domains.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a general framework that orchestrates local and global search for inference-time control in diffusion models, combining breadth-first and depth-first tree search with annealed Langevin MCMC. It resides in the 'Tree Search for Reward-Guided Generation' leaf, which contains five papers including the original work. This leaf sits within a broader cluster of tree search and Monte Carlo methods, indicating a moderately active research direction. The taxonomy shows this is one of several approaches to search-based alignment, with sibling categories exploring discrete diffusion, Monte Carlo guidance, and evolutionary methods.

The taxonomy reveals neighboring research directions that contextualize this work. The parent category 'Tree Search and Monte Carlo Methods for Alignment' encompasses discrete language diffusion and stochastic search guidance, while sibling branches explore evolutionary algorithms and noise trajectory optimization. The 'Inference-Time Scaling and Adaptive Computation' branch addresses related questions about computational allocation during sampling. The scope notes clarify that this leaf focuses on continuous reward-guided generation, excluding gradient-based methods and training-time optimization, which positions the work at the intersection of classical search theory and modern generative modeling.

Among twenty-one candidates examined, the contribution-level analysis shows mixed novelty signals. The general framework orchestrating local and global search examined ten candidates with none clearly refuting it, suggesting this high-level integration may be relatively novel within the limited search scope. However, the adaptive DFS algorithm claim examined one candidate that appears to refute it, and the annealed Langevin MCMC contribution examined ten candidates with one refutable match. These statistics indicate that while the overall framework integration may be new, individual algorithmic components have substantial prior work among the examined papers.

Based on the limited search of twenty-one semantically similar papers, the work appears to offer a novel synthesis of classical search paradigms for diffusion inference, though specific algorithmic contributions show overlap with existing methods. The taxonomy structure suggests this sits in a moderately explored area with clear boundaries from discrete diffusion and evolutionary approaches. The analysis does not cover the full literature landscape, and a broader search might reveal additional related work in adjacent branches or domain-specific applications.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: inference-time control in diffusion models using classical search algorithms. The field has organized itself around four main branches that reflect different emphases in how search and control are applied. Search-Based Inference-Time Alignment and Control focuses on tree search and Monte Carlo methods to steer generation toward desired rewards or constraints, often employing evolutionary strategies or dynamic search procedures to refine outputs. Inference-Time Scaling and Adaptive Computation explores how to allocate computational budgets more effectively during sampling, investigating adaptive step selection and scaling laws that trade off quality against inference cost. Domain-Specific Applications and Specialized Guidance addresses tailored solutions for particular modalities such as protein design, RNA generation, or audio super-resolution, where domain constraints shape the search process. Finally, Specialized Inference Techniques and Architectures examines novel sampling frameworks, noise trajectory optimization, and architectural modifications that enable more flexible or efficient control mechanisms. Recent work has concentrated on reward-guided generation and the interplay between search depth and sample quality. Classical Search Scaling[0] sits within the tree search cluster for reward-guided generation, alongside methods like Tree Search Guidance[41], Dynamic Search Alignment[17], and Diffusion Tree Sampling[29], all of which explore how classical search algorithms can be adapted to navigate the high-dimensional latent spaces of diffusion models. While Tree Search Steering[21] and Tree Reward Search[12] emphasize Monte Carlo rollouts and value estimation, Classical Search Scaling[0] investigates how traditional search paradigms scale with increased inference compute, a theme that resonates with broader efforts in Inference-Time Scaling such as Scaling Inference Compute[9] and Beyond Denoising Steps[4]. A key open question across these branches is whether the gains from deeper search justify the computational overhead, and how to balance exploration breadth with exploitation of high-reward regions during generation.

Claimed Contributions

General framework orchestrating local and global search for diffusion models

10 retrieved papers

The authors introduce a unified framework for inference-time scaling of diffusion models that combines global search (via breadth-first and depth-first tree search) with local search (via annealed Langevin MCMC). This framework enables efficient exploration of the generative space and refinement of samples beyond the base model's capabilities.

10 retrieved papers

First adaptive DFS algorithm for diffusion inference scaling

Can Refute

1 retrieved paper

The authors propose a depth-first search algorithm with adaptive backtracking for diffusion models. Unlike prior methods with fixed schedules, this DFS approach dynamically allocates compute based on verifier scores, enabling early backtracking and preventing excessive compute on easy instances.

1 retrieved paper

Can Refute

Theoretically grounded local search via annealed Langevin MCMC

Can Refute

10 retrieved papers

The authors develop a local search method based on annealed Langevin MCMC that samples from compositional distributions. They provide theoretical unification showing that training-free guidance with recurrence is equivalent to Langevin MCMC in the continuous limit, enabling principled optimization beyond base model modes.

10 retrieved papers

Can Refute

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[17] Dynamic Search for Inference-Time Alignment in Diffusion Models PDF

Li, Xiner, Uehara, Masatoshi, Xiner Li, Su, Xingyu, Masatoshi Uehara, Scalia, Gabriele, Xingyu Su, Biancalani, Tommaso, Gabriele Scalia, Regev, Aviv, Tommaso Biancalani, Levine, Sergey, Aviv Regev, Ji, Shuiwang, Sergey Levine, Shuiwang Ji (2025) • arXiv.org

[21] Training-free guidance beyond differentiability: Scalable path steering with tree search in diffusion and flow models PDF

Guo Ying-qing, Yang, Yukang, Yingqing Guo, Yuan Hui, Yukang Yang, Wang Mengdi, Hui Yuan, Mengdi Wang (2025)

[29] Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models PDF

Jain Vineet, Vineet Jain, Pedramfar, Mohammad, Kusha Sareen, Ravanbakhsh, Siamak, Mohammad Pedramfar, Siamak Ravanbakhsh (2025) • arXiv.org

[41] Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance PDF

Wang, Zehong, Zhang, Chuxu, Ye Yanfang (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

General framework orchestrating local and global search for diffusion models

[29] Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models PDF

Cannot Refute

[51] Nomad: Goal masked diffusion policies for navigation and exploration PDF

Cannot Refute

[52] Generation driven understanding of localized 3D scenes with 3D diffusion model PDF

Cannot Refute

[53] DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode PDF

Cannot Refute

[54] Point Cloud Pre-Training with Diffusion Models PDF

Cannot Refute

[55] Unifying Layout Generation with a Decoupled Diffusion Model PDF

Cannot Refute

[56] LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model PDF

Cannot Refute

[57] Learning spatio-temporal representation with local and global diffusion PDF

Cannot Refute

[58] Accelerating Markov Chain Monte Carlo sampling with diffusion models PDF

Cannot Refute

[59] Promote: Prior-guided diffusion model with global-local contrastive learning for exemplar-based image translation PDF

Cannot Refute

Contribution

First adaptive DFS algorithm for diffusion inference scaling

[17] Dynamic Search for Inference-Time Alignment in Diffusion Models PDF

Can Refute

Contribution

Theoretically grounded local search via annealed Langevin MCMC

[69] MCMC-Correction of Score-Based Diffusion Models for Model Composition PDF

Can Refute

[60] Continuously Tempered Diffusion Samplers PDF

Cannot Refute

[61] Solving Linear Inverse Problems Using Higher-Order Annealed Langevin Diffusion PDF

Cannot Refute

[62] Posterior Sampling by Combining Diffusion Models with Annealed Langevin Dynamics PDF

Cannot Refute

[63] Score-based Diffusion Models in Function Space PDF

Cannot Refute

[64] Annealed Langevin Dynamics for Massive MIMO Detection PDF

Cannot Refute

[65] Score-based diffusion meets annealed importance sampling PDF

Cannot Refute

[66] Polynomial complexity sampling from multimodal distributions using Sequential Monte Carlo PDF

Cannot Refute

[67] Complex-Valued Retrievals From Noisy Images Using Diffusion Models PDF

Cannot Refute

[68] VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics PDF

Cannot Refute

Inference-time scaling of diffusion models through classical search

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[17] Dynamic Search for Inference-Time Alignment in Diffusion Models PDF

[21] Training-free guidance beyond differentiability: Scalable path steering with tree search in diffusion and flow models PDF

[29] Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models PDF

[41] Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance PDF

Contribution Analysis

General framework orchestrating local and global search for diffusion models

[29] Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models PDF

[51] Nomad: Goal masked diffusion policies for navigation and exploration PDF

[52] Generation driven understanding of localized 3D scenes with 3D diffusion model PDF

[53] DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode PDF

[54] Point Cloud Pre-Training with Diffusion Models PDF

[55] Unifying Layout Generation with a Decoupled Diffusion Model PDF

[56] LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model PDF

[57] Learning spatio-temporal representation with local and global diffusion PDF

[58] Accelerating Markov Chain Monte Carlo sampling with diffusion models PDF

[59] Promote: Prior-guided diffusion model with global-local contrastive learning for exemplar-based image translation PDF

First adaptive DFS algorithm for diffusion inference scaling

[17] Dynamic Search for Inference-Time Alignment in Diffusion Models PDF

Theoretically grounded local search via annealed Langevin MCMC

[69] MCMC-Correction of Score-Based Diffusion Models for Model Composition PDF

[60] Continuously Tempered Diffusion Samplers PDF

[61] Solving Linear Inverse Problems Using Higher-Order Annealed Langevin Diffusion PDF

[62] Posterior Sampling by Combining Diffusion Models with Annealed Langevin Dynamics PDF

[63] Score-based Diffusion Models in Function Space PDF

[64] Annealed Langevin Dynamics for Massive MIMO Detection PDF

[65] Score-based diffusion meets annealed importance sampling PDF

[66] Polynomial complexity sampling from multimodal distributions using Sequential Monte Carlo PDF

[67] Complex-Valued Retrievals From Noisy Images Using Diffusion Models PDF

[68] VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics PDF

Table of Contents