[Roadmap] Blackwell MXFP8 and NVFP4 RL training

@humansand
## Miles

- MXFP8 & NVFP4
    - [x] https://github.com/radixark/miles/pull/614
    - [x] https://github.com/radixark/miles/issues/567
    - [ ] https://github.com/radixark/miles/pull/919

- MXFP8
    - [x] https://github.com/radixark/miles/pull/512
    - [x] https://github.com/radixark/miles/pull/963

- NVFP4
    - [x] ~https://github.com/radixark/miles/pull/546~
    - [x] https://github.com/radixark/miles/pull/907
    - [x] https://github.com/radixark/miles/pull/1054
    - [ ] Use FlashInfer nvfp4 quantizer
    - [ ] Avoid QDQ during weight sync and directly use TE data

## SGLang

- MXFP8 & NVFP4
    - [x] https://github.com/sgl-project/sglang/pull/20214
- MXFP8
    - [x] https://github.com/sgl-project/sglang/pull/17449
    - [x] https://github.com/sgl-project/sglang/pull/18742
    - [x] ~https://github.com/sgl-project/sglang/pull/17294~
    - [ ] https://github.com/sgl-project/sglang/pull/26342
    - [x] https://github.com/sgl-project/sglang/pull/19537
    - [x] https://github.com/sgl-project/sglang/pull/21280
    - [x] https://github.com/sgl-project/sglang/pull/21576
    - [x] https://github.com/sgl-project/sglang/pull/22484
    - [x] https://github.com/sgl-project/sglang/pull/26287
    - [x] https://github.com/sgl-project/sglang/pull/28459
- NVFP4
    - [x] ~https://github.com/sgl-project/sglang/pull/18012~
    - [x] https://github.com/sgl-project/sglang/pull/18085
    - [x] https://github.com/sgl-project/sglang/pull/22204
    - [x] https://github.com/sgl-project/sglang/pull/22918

## TransformerEngine
- MXFP8 & NVFP4
    - [x] https://github.com/NVIDIA/TransformerEngine/pull/2644
    - [x] https://github.com/NVIDIA/TransformerEngine/pull/2865
- NVFP4
    - [x] https://github.com/NVIDIA/TransformerEngine/pull/2931
    - [x] https://github.com/NVIDIA/TransformerEngine/pull/2972
    - [x] https://github.com/NVIDIA/cudnn-frontend/pull/251
    - [ ] https://github.com/NVIDIA/TransformerEngine/pull/3042
## FlashInfer
- MXFP8 & NVFP4
    - [x] https://github.com/flashinfer-ai/flashinfer/pull/3387
- MXFP8
    - [x] https://github.com/flashinfer-ai/flashinfer/pull/2581
 - NVFP4
    - [x] https://github.com/flashinfer-ai/flashinfer/pull/3027
    - [x] https://github.com/flashinfer-ai/flashinfer/pull/3264
    - [x] https://github.com/flashinfer-ai/flashinfer/pull/3448


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] Blackwell MXFP8 and NVFP4 RL training #615

Miles

SGLang

TransformerEngine

FlashInfer

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Roadmap] Blackwell MXFP8 and NVFP4 RL training #615

Description

Miles

SGLang

TransformerEngine

FlashInfer

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions