Skip to content

fix(grpo): use 'is not None' for reward_weights check#24

Merged
sacredvoid merged 1 commit into
mainfrom
fix/reward-weights-falsy-check
Mar 26, 2026
Merged

fix(grpo): use 'is not None' for reward_weights check#24
sacredvoid merged 1 commit into
mainfrom
fix/reward-weights-falsy-check

Conversation

@sacredvoid
Copy link
Copy Markdown
Owner

Summary

Fixes #23

Test plan

  • All 152 tests pass

Same falsy-check pattern as the dataset_size bug fixed in #9.
An empty list would be silently skipped.

Fixes #23
@sacredvoid sacredvoid merged commit 3fa51cb into main Mar 26, 2026
@sacredvoid sacredvoid deleted the fix/reward-weights-falsy-check branch March 26, 2026 00:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Bug: GRPO reward_weights uses falsy check instead of is not None

1 participant