Can VAD preserve original timestamps? #3414

goose-ws · 2025-09-09T05:46:53Z

goose-ws
Sep 9, 2025

Using the 'medium.en' model, I get a lost of "ghosts/echoes" during parts of the recording that are actually silent. Using VAD has eliminated this! Which is great. However, I need to preserve the actual timestamp from the original file in my output. It appears that when using VAD, the timestamps are all shifted forward, since what is passed to whisper is just a contiguous file of non-silence.

Is there a way I can use VAD, but preserve where in the file the audio was found with the original timestamp?

goose-ws · 2025-09-10T07:29:23Z

goose-ws
Sep 10, 2025
Author

For anyone else who comes across this: I haven't found a way to accomplish this via whisper.cpp, but this is natively built in to whisperX.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can VAD preserve original timestamps? #3414

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Can VAD preserve original timestamps? #3414

Uh oh!

goose-ws Sep 9, 2025

Replies: 1 comment

Uh oh!

goose-ws Sep 10, 2025 Author

goose-ws
Sep 9, 2025

goose-ws
Sep 10, 2025
Author