Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reorganize POWER9 SIMD code #369

Merged
merged 3 commits into from
Jan 5, 2023

Conversation

fitzsim
Copy link
Contributor

@fitzsim fitzsim commented Jan 4, 2023

I could not eliminate the separate index argument in the f16 load and store macros, so this patch set needs testing on other architectures.

The existing GGML_F32x4_REDUCE macro performs as well as the implementation in #366 so I used the existing one.

When I test with the F32 model, ggml_vec_dot_f16 and ggml_vec_mad_f16 are still being called. Is that expected?

@ggerganov
Copy link
Owner

When I test with the F32 model, ggml_vec_dot_f16 and ggml_vec_mad_f16 are still being called. Is that expected?

Yes - good point. I have to update whisper.cpp to not use F16 when in F32 mode.
Currently it stores some of the intermediate tensors in F16 which leads to the ggml_vec_dot_f16 and ggml_vec_mad_f16 pathways.

Copy link
Owner

@ggerganov ggerganov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested all platforms - everything is good!
Ready to merge if you are good?

@fitzsim
Copy link
Contributor Author

fitzsim commented Jan 5, 2023

Yes, I don't have any changes left to push. Thanks for reviewing and testing the patch set.

@ggerganov ggerganov merged commit 1944e7c into ggerganov:master Jan 5, 2023
rock3125 pushed a commit to rock3125/whisper.cpp that referenced this pull request Feb 21, 2023
anandijain pushed a commit to anandijain/whisper.cpp that referenced this pull request Apr 28, 2023
jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this pull request Oct 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants