Skip to content

Add debug-exit-after-rollout to train entrypoints#1455

Open
fzyzcjy wants to merge 1 commit into
tom/pr_chain/trainer_ft/dev_revert_reversed/always-save-rollout-debug-data-regardless-of-rollout-global-datasetfrom
tom/pr_chain/trainer_ft/dev_revert_reversed/add-debug-exit-after-rollout-to-train-entrypoints
Open

Add debug-exit-after-rollout to train entrypoints#1455
fzyzcjy wants to merge 1 commit into
tom/pr_chain/trainer_ft/dev_revert_reversed/always-save-rollout-debug-data-regardless-of-rollout-global-datasetfrom
tom/pr_chain/trainer_ft/dev_revert_reversed/add-debug-exit-after-rollout-to-train-entrypoints

Conversation

@fzyzcjy

@fzyzcjy fzyzcjy commented Jun 22, 2026

Copy link
Copy Markdown
Collaborator

Adds a --debug-exit-after-rollout escape hatch to train.py and train_async.py: after the configured number of rollouts past start_rollout_id, log and break out of the train loop early. Used by FT e2e scenarios to bound run length.

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/always-save-rollout-debug-data-regardless-of-rollout-global-dataset branch from 18f5d29 to 9bbb879 Compare June 23, 2026 07:52
@fzyzcjy fzyzcjy requested a review from yushengsu-thu as a code owner June 23, 2026 07:52
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-debug-exit-after-rollout-to-train-entrypoints branch from 5e920bf to fbb0a45 Compare June 23, 2026 07:52
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/always-save-rollout-debug-data-regardless-of-rollout-global-dataset branch from 9bbb879 to 19a080a Compare June 23, 2026 09:30
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-debug-exit-after-rollout-to-train-entrypoints branch from fbb0a45 to cafb154 Compare June 23, 2026 09:30
Adds a --debug-exit-after-rollout escape hatch to train.py and train_async.py: after the configured number of rollouts past start_rollout_id, log and break out of the train loop early. Used by FT e2e scenarios to bound run length.
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/always-save-rollout-debug-data-regardless-of-rollout-global-dataset branch from 19a080a to b635d82 Compare June 23, 2026 13:34
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-debug-exit-after-rollout-to-train-entrypoints branch from cafb154 to 14dda88 Compare June 23, 2026 13:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant