[Feat] Perft E by Jackmin801 · Pull Request #2155 · PrimeIntellect-ai/prime-rl

Jackmin801 · 2026-03-31T18:53:49Z

Note

Medium Risk
Adds a new MoE LoRA application path (perft_e) with custom forward/EP handling, which could affect training correctness and performance for MoE models. Also changes adapter checkpoint/broadcast serialization to cast to a configurable dtype, which may impact downstream loading expectations and numeric fidelity.

Overview
Adds a new MoE LoRA strategy, moe_lora_mode="perft_e", that applies a single bypass LoRA over the entire MoE block (PErFT-E) via the new MultiLoRAPERFTE module and updates apply_lora_to_model to select it vs the existing per-projection MoE LoRA.

Introduces LoRAConfig.save_dtype and updates both weight checkpointing (WeightCheckpointManager.get_run_adapter_state_dict) and filesystem weight broadcast (adapter-only path) to cast LoRA adapter tensors to CPU in the configured dtype before saving.

^{Written by Cursor Bugbot for commit 4eac31f. This will update automatically on new commits. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-31T19:05:38Z

src/prime_rl/configs/trainer.py

+        Field(
+            description="MoE LoRA strategy. 'per_projection' applies separate LoRA to each of w1/w2/w3. 'perft_e' applies a single bypass LoRA to the entire MoE block (PErFT-E).",
+        ),
+    ] = "per_projection"


New config fields missing CHANGELOG entry

Low Severity

Two new config fields are added to LoRAConfig in src/prime_rl/configs/trainer.py — moe_lora_mode and save_dtype — without a corresponding CHANGELOG.md update. Per the project rule, any PR modifying configuration structures (including added fields) in src/prime_rl/*/config.py must update the changelog.

Additional Locations (1)

src/prime_rl/configs/trainer.py#L142-L148

^{Triggered by project rule: BugBot Instructions}

cursor · 2026-03-31T19:05:38Z

src/prime_rl/trainer/ckpt.py

+DTYPE_MAP = {
+    "bfloat16": torch.bfloat16,
+    "float32": torch.float32,
+}


Duplicate DTYPE_MAP definitions across files

Low Severity

This PR introduces two new identical DTYPE_MAP / _DTYPE_MAP dictionaries (in ckpt.py and filesystem.py) that duplicate the existing DTYPE_MAP already defined in src/prime_rl/trainer/model.py. All three map the same two strings to the same torch dtypes. A single shared definition would avoid inconsistency risk if new dtypes are added later.

Additional Locations (1)

src/prime_rl/trainer/rl/broadcast/filesystem.py#L22-L26

Jackmin801 added 2 commits March 29, 2026 23:47

initial impl with sharded experts

e53b3fe

do 3d tensor and bf16

4596a87

cursor bot reviewed Mar 31, 2026

View reviewed changes

rename to perft

4eac31f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat] Perft E#2155

[Feat] Perft E#2155
Jackmin801 wants to merge 3 commits intomainfrom
feat-perft-e

Jackmin801 commented Mar 31, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 31, 2026

Uh oh!

cursor bot Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jackmin801 commented Mar 31, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 31, 2026

Choose a reason for hiding this comment

New config fields missing CHANGELOG entry

Uh oh!

cursor bot Mar 31, 2026

Choose a reason for hiding this comment

Duplicate DTYPE_MAP definitions across files

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Jackmin801 commented Mar 31, 2026 •

edited by cursor bot

Loading