[Nvidia] Add trtllm mnnvl allreduce #12787

wenscarl · 2025-11-06T19:41:07Z

Motivation

Upstreaming the new trtllm_mnnvl_fused_allreduce_add_rmsnorm. Depends on flashinfer-ai/flashinfer#2118

This will improve throughput for multi-GPU decode for NVL systems.

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.

Fridge003 · 2025-11-10T04:36:25Z

@wenscarl Is this PR ready for review?

wenscarl · 2025-11-10T13:55:16Z

@wenscarl Is this PR ready for review?

There is still some issue with the kernel in flashinfer.

wenscarl and others added 8 commits October 30, 2025 19:09

Add mm_fp4 trtllm backend

b0244e1

Merge branch 'sgl-project:main' into mm_fp4_trtllm

69e7be5

Use str env var

153f7b1

Address comment.

d3e2ed6

Merge branch 'main' into mm_fp4_trtllm

a4ad51e

Merge branch 'main' into mm_fp4_trtllm

4488840

Fix typo.

d63fd56

Wip

94d32b6

github-actions bot added documentation Improvements or additions to documentation quant LLM Quantization labels Nov 6, 2025

Fridge003 added high priority and removed documentation Improvements or additions to documentation labels Nov 6, 2025

wip

77e2462

github-actions bot added the documentation Improvements or additions to documentation label Nov 19, 2025

anurlybayev added nvidia blackwell SM100/SM120 labels Dec 11, 2025

anurlybayev assigned wenscarl Dec 11, 2025

anurlybayev added Grace Blackwell and removed blackwell SM100/SM120 labels Dec 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Nvidia] Add trtllm mnnvl allreduce #12787

[Nvidia] Add trtllm mnnvl allreduce #12787

Uh oh!

wenscarl commented Nov 6, 2025 •

edited by anurlybayev

Loading

Uh oh!

Fridge003 commented Nov 10, 2025

Uh oh!

wenscarl commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Nvidia] Add trtllm mnnvl allreduce #12787

Are you sure you want to change the base?

[Nvidia] Add trtllm mnnvl allreduce #12787

Uh oh!

Conversation

wenscarl commented Nov 6, 2025 • edited by anurlybayev Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

Fridge003 commented Nov 10, 2025

Uh oh!

wenscarl commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wenscarl commented Nov 6, 2025 •

edited by anurlybayev

Loading