New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add repeat audio back #1959

Open

yuekaizhang wants to merge 1 commit into openai:main from yuekaizhang:main

yuekaizhang commented Jan 15, 2024

Related to #1483.

I have encountered issues when decoding with beam_size > 1.

Here, https://github.com/openai/whisper/blob/main/whisper/model.py#L102, during cross_attention, I think we must keep the first two dims of query and key tensors same. However, I didn't find anywhere we repeat the audio or key, value tensors with beam_size.


add repeat audio back

9c686cd

yuekaizhang mentioned this pull request

Whisper Fine-tuning Recipe on Aishell1 k2-fsa/icefall#1466

Merged

1 task

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment