Don't update duration if last timestamp is timestamp_begin #191

vickianand · 2022-09-29T14:06:50Z

There is bug causing start and end timestamps of the segments to be same when they shouldn't be.
If the only timestamp-token found is the tokenizer.timestamp_begin then we shouldn't update the duration (to 0). If there is really no speech, then anyway the segment won't be appended by the add_segment function.
Before this fix:

After this fix:

jongwook · 2022-09-29T19:27:51Z

Thanks. I think it's another failure case where Whisper didn't sample any intermediate timestamp tokens. Would you be able to upload/email the audio file?

vickianand · 2022-09-29T22:21:28Z

Here is the audio file: https://drive.google.com/file/d/1WG17h-3U565uWpl_jW2bkO3eS_l73H7f/view?usp=sharing
For the first three 30-sec chunks, it doesn't predict intermediate timestamps when run with:
--model medium.en --language en --initial_prompt "um, uh"

Don't update duration if last timestamp is same as begin

4c06182

vickianand changed the title ~~Don't update duration if last timestamp is same as begin~~ Don't update duration if last timestamp is timestamp_begin Sep 29, 2022

jongwook merged commit 2b0c297 into openai:main Sep 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't update duration if last timestamp is timestamp_begin #191

Don't update duration if last timestamp is timestamp_begin #191

vickianand commented Sep 29, 2022 •

edited

Loading

jongwook commented Sep 29, 2022

vickianand commented Sep 29, 2022 •

edited

Loading

Don't update duration if last timestamp is timestamp_begin #191

Don't update duration if last timestamp is timestamp_begin #191

Conversation

vickianand commented Sep 29, 2022 • edited Loading

jongwook commented Sep 29, 2022

vickianand commented Sep 29, 2022 • edited Loading

vickianand commented Sep 29, 2022 •

edited

Loading

vickianand commented Sep 29, 2022 •

edited

Loading