Skip to content

Add FT with-failure e2e scenarios#1459

Open
fzyzcjy wants to merge 1 commit into
tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-deterministic-e2e-scenariosfrom
tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-with-failure-e2e-scenarios
Open

Add FT with-failure e2e scenarios#1459
fzyzcjy wants to merge 1 commit into
tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-deterministic-e2e-scenariosfrom
tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-with-failure-e2e-scenarios

Conversation

@fzyzcjy

@fzyzcjy fzyzcjy commented Jun 22, 2026

Copy link
Copy Markdown
Collaborator

Adds the with-failure FT e2e scenario (scenario_with_failure: a deterministic single-cell fault during training that must heal and converge) plus its per-mode CI entry tests (dp2_cp2_pp2 / real_rollout_dense / dp2_cp2_tp2_ep2).

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-deterministic-e2e-scenarios branch from f70db36 to f7eef9c Compare June 23, 2026 07:52
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-with-failure-e2e-scenarios branch from d971dbf to 2955715 Compare June 23, 2026 07:52
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-deterministic-e2e-scenarios branch from f7eef9c to 4eeac8d Compare June 23, 2026 09:31
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-with-failure-e2e-scenarios branch from 2955715 to 5230b27 Compare June 23, 2026 09:31
Adds the with-failure FT e2e scenario (scenario_with_failure: a deterministic single-cell fault during training that must heal and converge) plus its per-mode CI entry tests (dp2_cp2_pp2 / real_rollout_dense / dp2_cp2_tp2_ep2).
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-deterministic-e2e-scenarios branch from 4eeac8d to 2294738 Compare June 23, 2026 13:35
@fzyzcjy fzyzcjy force-pushed the tom/pr_chain/trainer_ft/dev_revert_reversed/add-ft-with-failure-e2e-scenarios branch from 5230b27 to d559fa8 Compare June 23, 2026 13:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant