[cfg,trainer] feat: (MOPD, 1/3): Multi-teacher config dict by JacobHelwig · Pull Request #5774 · verl-project/verl

JacobHelwig · 2026-03-27T06:19:18Z

What does this PR do?

Config changes for Multi-teacher OPD. Adds multiple teacher model configs to DistillationConfig. Maintains single teacher support elsewhere.

Design & Code Changes

Adds a teacher model config dict to the distillation config. We also maintain a teacher_model entry for single teacher OPD.

For the multi-teacher training script, teacher model args will be specified as:

+distillation.teacher_models.gsm8k.task="openai/gsm8k"    
+distillation.teacher_models.gsm8k.model_path="path/to/math_teacher"
+distillation.teacher_models.geo3k.task="hiyouga/geometry3k"
+distillation.teacher_models.geo3k.model_path="path/to/vision_math_teacher"

gemini-code-assist

Code Review

This pull request refactors the distillation configuration by moving resource pool settings, such as enable_resource_pool, n_gpus_per_node, and nnodes, from the individual teacher model level to the top-level distillation configuration. It also removes the num_workers parameter and introduces a teacher_models dictionary to support future multi-teacher distillation. Additionally, the changes include improved validation logic for teacher model initialization and resource allocation across the trainer and experimental loops. I have no feedback to provide as there were no review comments to evaluate.

JacobHelwig · 2026-04-07T02:51:34Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the distillation configuration to support multi-teacher setups by moving resource pool settings from the teacher model configuration to the top-level distillation configuration and introducing a teacher_models dictionary. The changes include updating the trainer logic, configuration files, and validation methods to accommodate these structural updates. I have no feedback to provide as there were no review comments.

wuxibin89 · 2026-04-13T05:27:34Z

Merged into #5834

JacobHelwig added 3 commits March 27, 2026 01:01

Multi-teacher cfg

cdcd6a3

Fix megatron name

e85de26

Generate cfgs

4ca6da7

JacobHelwig requested review from ArronHZG, PeterSH6, eric-haibin-lin, tongyx361, vermouth1992 and wuxibin89 as code owners March 27, 2026 06:19

gemini-code-assist bot reviewed Mar 27, 2026

View reviewed changes

Add task and model path checks

95097bb

JacobHelwig closed this Mar 27, 2026

Compose cfg test

309e140

wuxibin89 reopened this Apr 7, 2026

JacobHelwig mentioned this pull request Apr 7, 2026

[MOPD, 1/n][cfg,trainer] feat: Multiple teacher configs #5773

Closed

JacobHelwig and others added 4 commits April 6, 2026 21:32

Merge branch 'main' into jhelwig/mopdCfgDict

26a09a0

PC

dd0f0b9

top-k check

9ebd62d

Correct check

432188b

JacobHelwig changed the title ~~[MOPD, 1/n][cfg,trainer] feat: Multi-teacher config dict~~ [MOPD, 1/3][cfg,trainer] feat: Multi-teacher config dict Apr 7, 2026

JacobHelwig changed the title ~~[MOPD, 1/3][cfg,trainer] feat: Multi-teacher config dict~~ [cfg,trainer] feat: (MOPD, 1/3): Multi-teacher config dict Apr 7, 2026

gemini-code-assist bot reviewed Apr 7, 2026

View reviewed changes

wuxibin89 mentioned this pull request Apr 13, 2026

[trainer,cfg,rollout,algo] feat: (MOPD, 1/2) Multi-Teacher Model, Server Managers and Config #5834

Open

wuxibin89 closed this Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cfg,trainer] feat: (MOPD, 1/3): Multi-teacher config dict#5774

[cfg,trainer] feat: (MOPD, 1/3): Multi-teacher config dict#5774
JacobHelwig wants to merge 9 commits intoverl-project:mainfrom
JacobHelwig:jhelwig/mopdCfgDict

JacobHelwig commented Mar 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

JacobHelwig commented Apr 7, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

wuxibin89 commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JacobHelwig commented Mar 27, 2026

What does this PR do?

Design & Code Changes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

JacobHelwig commented Apr 7, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

wuxibin89 commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants