You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix repeated outputs in version 3.16.0 when using include_prompt_in_result=False and a batch input with variable lengths: a typo in the code led to min_length being incorrectly applied
Update the Transformers converter to accept extra tokens for Falcon models
Release the Python GIL when loading the model
Initialize the rotary embeddings on the GPU instead of the CPU
Avoid a copy for the input features passed to the Whisper methods