-
Notifications
You must be signed in to change notification settings - Fork 108
Open
3 / 43 of 4 issues completedDescription
Currently, we have two directories:
training: contains verl / trl code used for training the models for our paperexamples: contains minimal examples with various frameworks
@olliestanley suggested to label the state of the repo used for the conference submission, and move away from this distinction.
I would suggest we only keep the minimal training examples in examples, and update them as some are quite stale (e.g. 3-4 months old).
Ideally, they would all use the same training configuration (at least in terms of dataset generation), just show different implementation depending on the framework.
Sub-issues
Metadata
Metadata
Assignees
Labels
No labels