Shorter segments? #15

ronyfadel · 2023-02-22T07:36:42Z

Would it be possible to produce shorter segments? (some are way too long)

guillaumekln · 2023-02-22T09:53:03Z

There is no option that can effectively prevent this. The parameter length_penalty can help to some extent but it will not force the model to predict a shorter segment.

Do you get a different output with openai/whisper? If yes, it would be great if you can provide a way to reproduce the output.

ronyfadel · 2023-02-22T17:53:26Z

There's been discussions in openai/whisper where you could skew the model to output shorter segments by tweaking max_text_token_logprob: openai/whisper#435 (reply in thread)

Is something similar with the codebase in faster-whisper?

ronyfadel · 2023-02-22T19:12:23Z

I just saw the addition of length_penalty today. How should it be used? Its default value is set to 1.

ronyfadel · 2023-02-23T07:00:10Z

@guillaumekln from my testing, I've also had great results using the token_timestamps flag here

Tbh, I don't know what CTranslate2 does to the underlying model, and if such capabilities are lost because the model was transformed.

guillaumekln · 2023-02-23T09:20:56Z

At this time we did not implement any features or parameters that are not available in the reference implementation from openai/whisper. So currently there are no easy ways for users to tweak max_text_token_logprob or enable token-level timestamps, which would require changes to the C++ implementation in CTranslate2.

Regarding word-level timestamps, I'm following this development in the openai/whisper repo. If it is merged, I will look to support it here as well.

Also, you can ignore my comment regarding length_penalty. It is not relevant to your issue since you want the model to output more timestamps and not make the generated sequences shorter.

guillaumekln · 2023-03-15T14:03:49Z

I just merged the word-level timestamps branch so the segments can now be as short as you want.

stephanedebove · 2024-06-20T22:17:49Z

hi @guillaumekln do you mind explaining what you mean by "I just merged the word-level timestamps branch so the segments can now be as short as you want."?

How do we control their length now?

And why a couple of months after this reply you said here #452 (comment) that "There is no option to control the segment length."?

guillaumekln closed this as completed Mar 15, 2023

cpalappillil mentioned this issue Jun 8, 2023

Transcribe method kills jupyter notebook kernel #289

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shorter segments? #15

Shorter segments? #15

ronyfadel commented Feb 22, 2023

guillaumekln commented Feb 22, 2023

ronyfadel commented Feb 22, 2023

ronyfadel commented Feb 22, 2023

ronyfadel commented Feb 23, 2023

guillaumekln commented Feb 23, 2023

guillaumekln commented Mar 15, 2023

stephanedebove commented Jun 20, 2024 •

edited

Loading

Shorter segments? #15

Shorter segments? #15

Comments

ronyfadel commented Feb 22, 2023

guillaumekln commented Feb 22, 2023

ronyfadel commented Feb 22, 2023

ronyfadel commented Feb 22, 2023

ronyfadel commented Feb 23, 2023

guillaumekln commented Feb 23, 2023

guillaumekln commented Mar 15, 2023

stephanedebove commented Jun 20, 2024 • edited Loading

stephanedebove commented Jun 20, 2024 •

edited

Loading