Skip to content

fix: align accuracy reward docstring with actual parsing#396

Open
Forostovec wants to merge 1 commit intoNousResearch:mainfrom
Forostovec:fix/accuracy-reward-docstring
Open

fix: align accuracy reward docstring with actual parsing#396
Forostovec wants to merge 1 commit intoNousResearch:mainfrom
Forostovec:fix/accuracy-reward-docstring

Conversation

@Forostovec
Copy link

The _extract_final_answer docstring listed "The answer is 42" as a supported format, but the function only parses GSM8K-style #### ... and LaTeX \boxed{...} patterns. This mismatch could mislead contributors and cause incorrect assumptions about reward behavior.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant