Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix bug in ggml_alibi #143

Merged
merged 1 commit into from
May 11, 2023
Merged

Fix bug in ggml_alibi #143

merged 1 commit into from
May 11, 2023

Conversation

yangyaofei
Copy link
Contributor

According to NouamaneTazi/bloomz.cpp#27, I found there is a wrong implementation in ggml_alibi.

Now, ggml_alibi will get same result compare to the build_alibi_tensor in transformers.models.bloom.modeling_bloom

@Green-Sky
Copy link
Contributor

how does this relate to the discussion here #131 (comment) ?

@lukasmoellerch
Copy link
Contributor

replit implementation seems to be fixed after this change

@ggerganov
Copy link
Owner

Merging this based on the positive feedback

@ggerganov ggerganov merged commit 085f6af into ggerganov:master May 11, 2023
@wangkuiyi
Copy link

Thank you @yangyaofei for this fix! I used the old ALiBi in another model and suffered a lot from it.

@yangyaofei
Copy link
Contributor Author

@wangkuiyi you are welcome 😁

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants