How to set minimum number of output tokens ? #2451
Unanswered
thangld201
asked this question in
Q&A
Replies: 1 comment
-
You can checkout this PR here #1945 but it has been stalled. If you can help getting this into a good shape that would be even better! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This param is equivalent to
min_new_tokens
in huggingface.generate()
. Is this already supported in vLLM ? @WoosukKwon I only see themax_tokens
option inSamplingParam
..Beta Was this translation helpful? Give feedback.
All reactions