Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whisper-v3 #1

Closed
suryasanchez opened this issue Nov 8, 2023 · 3 comments
Closed

Whisper-v3 #1

suryasanchez opened this issue Nov 8, 2023 · 3 comments

Comments

@suryasanchez
Copy link

Update to support Whisper v3? openai/whisper#1762

@thomasmol
Copy link
Owner

Yes I am working on it. Currently there are issues converting whisper v3 to cttranslate SYSTRAN/faster-whisper#544 (comment)

@thomasmol
Copy link
Owner

Currently waiting on this PR to get merged: huggingface/transformers#26699 so we can use batched inference and still get word level timestamps. I'll move away from faster-whisper (the maintainer is now working at Apple and not actively maintaining the repo anymore) and use the hugginface/transformer implementation described here: https://huggingface.co/openai/whisper-large-v3. Should result in even faster inference over faster-whisper. Hopefully somewhere this or next week!

@thomasmol
Copy link
Owner

Faster-whisper 0.10.0 was released, which includes support for whisper v3. Faster-whisper also has a new maintainer and moved here: https://github.com/SYSTRAN/faster-whisper.
Just updated the pipeline to new version of faster-whisper and to new version of pyannote (3.1).
Will still look into the batched inference when that's possible, but only when it supports VAD and word level timestamps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants