Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Jihoon Lee¹, Hoyeon Moon¹, Kevin Zhai⁵, Arun Kumar Chithanar, Anit Kumar Sahu², Soummya Kar³, Chul Lee, Souradip Chakraborty⁴, Amrit Singh Bedi⁵

¹Yonsei University, ²Oracle, ³CMU, ⁴UMD, ⁵UCF

Abstract

Diffusion-based large language models (dLLMs) are trained flexibly to model extreme dependence in the data distribution; however, how to best utilize this information at inference time remains an open problem. In this work, we uncover an interesting property of these models: dLLMs trained on textual data implicitly learn a mixture of semi-autoregressive experts, where different generation orders reveal different specialized behaviors. We show that committing to any single, fixed inference time schedule, a common practice, collapses performance by failing to leverage this latent ensemble. To address this, we introduce HEX (Hidden semiautoregressive EXperts for test-time scaling), a training-free inference method that ensembles across heterogeneous block schedules. By doing a majority vote over diverse block-sized generation paths, HEX robustly avoids failure modes associated with any single fixed schedule. On reasoning benchmarks such as GSM8K, it boosts accuracy by up to 3.56× (from 24.72% to 88.10%), outperforming top-K margin inference and specialized fine-tuned methods like GRPO, without additional training. HEX even yields significant gains on MATH benchmark from 16.40% to 40.00%, scientific reasoning on ARC-C from 54.18% to 87.80%, and TruthfulQA from 28.36% to 57.46%. Our results establish a new paradigm for test-time scaling in diffusion-based LLMs (dLLMs), revealing that the sequence in which masking is performed plays a critical role in determining performance during inference.

Key Features

✨ Hidden Semi-Autoregressive Experts: Reveals that diffusion LLMs implicitly learn multiple semi-AR experts, each specializing in distinct generation orders.
🚀 Training-Free Test-Time Scaling: Ensembles diverse block-sized decoding schedules at inference to unlock latent reasoning capabilities without retraining.

Results

Installation

# Clone the repository
git clone https://github.com/junos-ai-org/Test-Time-Scaling
cd HEX

# Create a virtual environment
conda env create -f env.yml
conda activate dllm_tts

Quick Start

Inside the HEX/eval directory, review the arguments described in run_eval_HEX.sh and run the script accordingly.

cd eval
bash run_eval_HEX.sh

Project Structure

Test-Time-Scaling/
└── HEX/               # HEX Source code

Citation

If you find this work useful, please cite our paper:

@article{lee2025hex,
  title={Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts},
  author={Lee, Jihoon and Moon, Hoyeon and Zhai, Kevin and Chithanar, Arun Kumar and Sahu, Anit Kumar and Kar, Soummya and Lee, Chul and Chakraborty, Souradip and Bedi, Amrit Singh},
  journal={Under Submission},
  year={2025}
}

License

This project is licensed under the MIT License - see the LICENSE file for details. Most of the code of /HEX/ is based on d1.

Acknowledgments

Contact

For questions or issues, please open an issue on GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.github/workflows		.github/workflows
HEX		HEX
assets		assets
static		static
.nojekyll		.nojekyll
LICENSE		LICENSE
README.md		README.md
index.html		index.html
index1.html		index1.html
index2.html		index2.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Abstract

Key Features

Results

Installation

Quick Start

Project Structure

Citation

License

Acknowledgments

Contact

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

junos-ai-org/Test-Time-Scaling

Folders and files

Latest commit

History

Repository files navigation

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Abstract

Key Features

Results

Installation

Quick Start

Project Structure

Citation

License

Acknowledgments

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages