Replies: 1 comment
-
For anyone else who comes across this: I haven't found a way to accomplish this via whisper.cpp, but this is natively built in to whisperX. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Using the 'medium.en' model, I get a lost of "ghosts/echoes" during parts of the recording that are actually silent. Using VAD has eliminated this! Which is great. However, I need to preserve the actual timestamp from the original file in my output. It appears that when using VAD, the timestamps are all shifted forward, since what is passed to whisper is just a contiguous file of non-silence.
Is there a way I can use VAD, but preserve where in the file the audio was found with the original timestamp?
Beta Was this translation helpful? Give feedback.
All reactions