Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

discrete diffusiontest-time scalingreward aligntment

Discrete diffusion models have become highly effective across various domains. However, real-world applications often require the generative process to adhere to certain constraints. To this end, we propose a Sequential Monte Carlo (SMC) framework that enables scalable inference-time control of discrete diffusion models through principled importance weighting and optimal proposal construction. Specifically, our approach derives tractable importance weights for a range of intermediate targets and characterises the optimal proposal, for which we develop two practical approximations: a first-order gradient-based approximation and an amortised proposal trained to minimise the log-variance of the importance weights. Empirical results across synthetic tasks, language modelling, biology design, and text-to-image generation demonstrate that our framework enhances controllability and sample quality, highlighting the effectiveness of SMC as a versatile recipe for scaling discrete diffusion models at inference time.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a Sequential Monte Carlo framework for inference-time control of discrete diffusion models, deriving tractable importance weights and characterizing optimal proposals through gradient-based and amortized approximations. Within the taxonomy, it resides in the 'Sequential Monte Carlo and Importance Weighting' leaf under 'Inference-Time Guidance and Control Mechanisms,' alongside two sibling papers. This leaf represents a focused research direction within the broader field of inference-time control, which itself comprises four distinct guidance subcategories. The relatively small number of siblings suggests this is a specialized but not overcrowded area.

The taxonomy reveals that inference-time guidance methods span multiple paradigms: gradient-free approaches, gradient-based posterior prediction, tree search strategies, and SMC-based techniques. The paper's leaf sits within a branch that emphasizes principled probabilistic inference, contrasting with neighboring leaves that employ search-based or derivative-free guidance. The taxonomy's scope notes clarify that SMC methods focus on particle-based frameworks and importance weighting, distinguishing them from gradient-based guidance that directly steers generation via reward gradients. This positioning highlights the paper's methodological commitment to probabilistic reweighting rather than direct optimization.

Among thirty candidates examined, the first contribution (SMC framework with tractable importance weights) shows two refutable candidates out of ten examined, indicating some prior work in this specific area. The second contribution (approximately optimal proposals) has one refutable candidate among ten, suggesting moderate overlap with existing methods. The third contribution (versatile multi-domain demonstration) found no refutable candidates across ten examined papers, appearing more novel in its breadth of application. These statistics reflect a limited search scope—top-K semantic matches plus citation expansion—rather than exhaustive coverage, so the presence of refutable candidates signals overlap within a constrained candidate pool.

Given the limited search scale, the analysis suggests the paper occupies a methodologically distinct position within SMC-based guidance, though some foundational elements overlap with prior importance weighting and proposal design work. The multi-domain versatility appears less explored in the examined candidates, potentially offering incremental novelty. However, the search scope (thirty candidates) leaves open the possibility of additional relevant work beyond the examined set, particularly in adjacent probabilistic inference or particle filtering literature not captured by semantic search.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: inference-time control of discrete diffusion models. The field organizes around four main branches that reflect distinct methodological emphases. Inference-Time Guidance and Control Mechanisms explore how to steer generation without retraining, encompassing techniques such as sequential Monte Carlo methods, importance weighting, and corrector-based approaches that refine samples during the reverse process. Sampling and Inference Acceleration focuses on reducing computational cost through faster solvers, remasking strategies like Remasking Inference Scaling[1], and adaptive scheduling. Training-Based Approaches and Model Architecture Design address foundational model improvements, including architectural innovations such as ControlNet[22] and training objectives that better align generation with downstream constraints. Domain-Specific Applications demonstrate how these methods adapt to specialized settings like protein design, layout generation with LayoutDM[6], and multimodal tasks, highlighting the interplay between general-purpose control mechanisms and domain requirements. A particularly active line of work centers on probabilistic inference methods that treat guidance as a posterior correction problem. Techniques like Particle Gibbs Sampling[11] and Feynman-Kac Correctors[29] leverage sequential Monte Carlo frameworks to incorporate constraints through reweighting or resampling, while Soft Value Decoding[3] and Steering Posterior Prediction[2] explore alternative ways to bias the generative process toward desired properties. Importance Weighting Inference[0] sits within this cluster, emphasizing importance weighting to adjust the sampling distribution at inference time. Compared to Particle Gibbs Sampling[11], which iteratively refines particle sets, and Feynman-Kac Correctors[29], which apply corrector steps grounded in Feynman-Kac theory, Importance Weighting Inference[0] offers a complementary perspective on how to balance computational efficiency with the fidelity of constraint satisfaction, contributing to ongoing discussions about trade-offs between sample quality, diversity, and inference cost in discrete diffusion models.

Claimed Contributions

SMC framework for discrete diffusion models with tractable importance weights

Can Refute

10 retrieved papers

The authors introduce a Sequential Monte Carlo framework specifically designed for discrete diffusion models that enables inference-time control through principled importance weighting. The framework derives tractable importance weights for intermediate target distributions, including product distributions and reward-tilting distributions, providing a general approach for test-time scaling.

10 retrieved papers

Can Refute

Two approximately optimal proposal distributions

Can Refute

10 retrieved papers

The authors develop two practical approximations to the optimal SMC proposal: a gradient-based first-order approximation and an amortised neural proposal trained by minimising the log-variance of importance weights. These proposals aim to reduce variance in the SMC procedure and improve sampling efficiency.

10 retrieved papers

Can Refute

Versatile framework demonstrated across multiple domains

10 retrieved papers

The authors validate their SMC framework across diverse applications spanning language modelling, biological sequence design, and text-to-image generation. The experiments demonstrate that the proposed methods consistently enhance controllability and sample quality across different domains.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[11] Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling PDF

Dang, Meihua, Han, Jiaqi, Xu, Minkai, Xu Kai, Srivastava, Akash, Ermon, Stefano (2025)

[29] Discrete feynman-kac correctors PDF

M Hasan, M Skreta, A Aspuru-Guzik (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

SMC framework for discrete diffusion models with tractable importance weights

[11] Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling PDF

Can Refute

[46] Debiasing guidance for discrete diffusion with sequential monte carlo PDF

Can Refute

[25] RNE: a plug-and-play framework for diffusion density estimation and inference-time control PDF

Cannot Refute

[27] Breaking determinism: Fuzzy modeling of sequential recommendation using discrete state space diffusion model PDF

Cannot Refute

[45] Reverse Diffusion Sequential Monte Carlo Samplers PDF

Cannot Refute

[47] Reinforced sequential Monte Carlo for amortised sampling PDF

Cannot Refute

[48] Efficient schemes for stochastic kinetic models PDF

Cannot Refute

[49] Importance-Weighted Training of Diffusion Samplers PDF

Cannot Refute

[50] Advancing Regularization Methods for Interpretable and Robust Deep Learning PDF

Cannot Refute

[51] Computational methods for complex stochastic systems: Alternatives to MCMC PDF

Cannot Refute

Contribution

Two approximately optimal proposal distributions

[52] Auto-Encoding Sequential Monte Carlo PDF

Can Refute

[53] Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows PDF

Cannot Refute

[54] Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization PDF

Cannot Refute

[55] Online variational sequential monte carlo PDF

Cannot Refute

[56] Particle-MALA and Particle-mGRAD: Gradient-based MCMC methods for high-dimensional state-space models PDF

Cannot Refute

[57] Parameter Estimation in Hidden Markov Models with Intractable Likelihoods Using Sequential Monte Carlo PDF

Cannot Refute

[58] Stochastic gradient Hamiltonian sequential Monte Carlo filter with Earth Mover's Distance sampling for target tracking PDF

Cannot Refute

[59] Smcp3: Sequential monte carlo with probabilistic program proposals PDF

Cannot Refute

[60] A General-Purpose Fixed-Lag No U-Turn Sampler for Nonlinear Non-Gaussian State Space Models PDF

Cannot Refute

[61] Enhanced SMC2: Leveraging Gradient Information from Differentiable Particle Filters Within Langevin Proposals PDF

Cannot Refute

Contribution

Versatile framework demonstrated across multiple domains

[22] Adding Conditional Control to Text-to-Image Diffusion Models PDF

Cannot Refute

[62] Diffusion models in bioinformatics and computational biology PDF

Cannot Refute

[63] A survey on generative diffusion models PDF

Cannot Refute

[64] Vector Quantized Diffusion Model for Text-to-Image Synthesis PDF

Cannot Refute

[65] Dirichlet diffusion score model for biological sequence generation PDF

Cannot Refute

[66] Snapfusion: Text-to-image diffusion model on mobile devices within two seconds PDF

Cannot Refute

[67] Multistate and functional protein design using RoseTTAFold sequence space diffusion PDF

Cannot Refute

[68] De novo protein designâFrom new structures to programmable functions PDF

Cannot Refute

[69] Diffusion language models are versatile protein learners PDF

Cannot Refute

[70] Versatile Diffusion: Text, Images and Variations All in One Diffusion Model PDF

Cannot Refute

Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[11] Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling PDF

[29] Discrete feynman-kac correctors PDF

Contribution Analysis

SMC framework for discrete diffusion models with tractable importance weights

[11] Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling PDF

[46] Debiasing guidance for discrete diffusion with sequential monte carlo PDF

[25] RNE: a plug-and-play framework for diffusion density estimation and inference-time control PDF

[27] Breaking determinism: Fuzzy modeling of sequential recommendation using discrete state space diffusion model PDF

[45] Reverse Diffusion Sequential Monte Carlo Samplers PDF

[47] Reinforced sequential Monte Carlo for amortised sampling PDF

[48] Efficient schemes for stochastic kinetic models PDF

[49] Importance-Weighted Training of Diffusion Samplers PDF

[50] Advancing Regularization Methods for Interpretable and Robust Deep Learning PDF

[51] Computational methods for complex stochastic systems: Alternatives to MCMC PDF

Two approximately optimal proposal distributions

[52] Auto-Encoding Sequential Monte Carlo PDF

[53] Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows PDF

[54] Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization PDF

[55] Online variational sequential monte carlo PDF

[56] Particle-MALA and Particle-mGRAD: Gradient-based MCMC methods for high-dimensional state-space models PDF

[57] Parameter Estimation in Hidden Markov Models with Intractable Likelihoods Using Sequential Monte Carlo PDF

[58] Stochastic gradient Hamiltonian sequential Monte Carlo filter with Earth Mover's Distance sampling for target tracking PDF

[59] Smcp3: Sequential monte carlo with probabilistic program proposals PDF

[60] A General-Purpose Fixed-Lag No U-Turn Sampler for Nonlinear Non-Gaussian State Space Models PDF

[61] Enhanced SMC2: Leveraging Gradient Information from Differentiable Particle Filters Within Langevin Proposals PDF

Versatile framework demonstrated across multiple domains

[22] Adding Conditional Control to Text-to-Image Diffusion Models PDF

[62] Diffusion models in bioinformatics and computational biology PDF

[63] A survey on generative diffusion models PDF

[64] Vector Quantized Diffusion Model for Text-to-Image Synthesis PDF

[65] Dirichlet diffusion score model for biological sequence generation PDF

[66] Snapfusion: Text-to-image diffusion model on mobile devices within two seconds PDF

[67] Multistate and functional protein design using RoseTTAFold sequence space diffusion PDF

[68] De novo protein designâFrom new structures to programmable functions PDF

[69] Diffusion language models are versatile protein learners PDF

[70] Versatile Diffusion: Text, Images and Variations All in One Diffusion Model PDF

Table of Contents

[68] De novo protein designâFrom new structures to programmable functions PDF