fix(ci): extend watchdog to non-PR events, simplify stale checks#812
Conversation
- Remove the pull_request-only restriction so schedule and workflow_dispatch runs also get automatic retries on git failures. - Use pr-context artifact presence (instead of event type) to decide whether to run PR-specific validation. - Drop the redundant pr_head_sha vs run_head_sha check (both come from the same run). Co-Authored-By: Claude <noreply@anthropic.com>
Codex Review该评论由 review 机器人自动更新。
SummaryReview failed at stage Findings未生成结构化 findings,因为 review 过程提前失败。 Log Tail |
|
Note Gemini is unable to generate a review for this pull request due to the file types involved not being currently supported. |
A3 板测失败
失败用例
|
A3 板测失败详情:PR #812orchestration_example_kernel_add
vector_example_dag_kernel_add_scalar
paged_attention_example_kernel_pv_matmul
paged_attention_example_kernel_init_inplace
vector_example_dag_kernel_add
paged_attention_example_kernel_online_update
paged_attention_example_kernel_softmax_prepare
orchestration_example_kernel_add_scalar
paged_attention_example_kernel_qk_matmul
orchestration_example_kernel_mul
vector_example_dag_kernel_mul
prelu
plan_memory_bind_tile_alias_liveness
plan_memory_peak_exact_capacity
plan_memory_loop_no_reuse_outer_live
plan_memory_if_yield
plan_memory_loop_in_if
plan_memory_peak_8_overlapping
plan_memory_if_in_loop
plan_memory_fragmentation_hole_fit
plan_memory_for_iter_args_yield
plan_memory_no_reuse_overlap
plan_memory_reuse_sequential
plan_memory_nested_loops
plan_memory_fragmentation_two_holes
rems
partition_view_verify_rank_mismatch_valid
partition_view_verify_valid
partition5d_dynamic
partition5d
sparse_attn_test_incore_7
decode_hca_test_incore_54
attention_swa_test_incore_40
decode_swa_test_incore_40
decode_csa_test_incore_81
attention_hca_test_incore_54
attention_csa_test_refresh_incore_81
tensor_view_layout_dn
rope_kv_cache
qwen3_decode_incore_4
post_rmsnorm
qwen3_decode_incore_1
qwen3_decode_incore_10
qwen3_decode_incore_11
rmsnorm
qwen3_decode_incore_6
qwen3_decode_incore_2
qwen3_decode_incore_7
qwen3_decode_incore_5
qwen3_decode_incore_12
test_barrier_sync
matmul
add_double_dynamic
nested_loop_confliect
rar_optimization_test
test_dynamic_valid_shape
test_auto_sync_tail_hint
compensation_test
rem
|
背景
目前 watchdog 只对
pull_request事件的失败做自动重试,schedule和workflow_dispatch如果遇到网络问题不会重试。同时 PR 校验中存在多余的 SHA 比较。修改
if条件中的事件类型限制 — ci-sim 本来只有pull_request、schedule、workflow_dispatch三种触发,不需要在 watchdog 里再过滤pr_head_sha != run_head_sha校验 — 两者都来自同一次 run,必定相等pr-context不存在时改为 fall through 而非直接 exit,让 schedule/dispatch 也能走到重试逻辑行为变化
🤖 Generated with Claude Code