Skip to content

Conversation

sunildkumar
Copy link
Member

@sunildkumar sunildkumar commented Feb 6, 2025

I compared this trainer to the version in https://github.com/Deep-Agent/R1-V and found that they found success generating one completion at a time instead of as a batch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant