-
Notifications
You must be signed in to change notification settings - Fork 941
Issues: ggerganov/ggml
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Asserting over nb[0] as type check causes issue for tensors after permuting
#530
opened Sep 19, 2023 by
cndn
Why GPT-J performs better on graviton without using simd than x86 using simd
#520
opened Sep 13, 2023 by
xshen053
Converting an arbritrary HF Transformer GPT2 to ggml format
#519
opened Sep 12, 2023 by
JellePiepenbrock
Directly converted from bfloat16 weights are 20x slower than converted from float32 ones.
#516
opened Sep 11, 2023 by
OlegJakushkin
ggml : better way to express implicit node dependencies in a graph
enhancement
New feature or request
refactoring
Refactoring
#502
opened Sep 4, 2023 by
ggerganov
[Feature request] Add support/demo implementation for Qwen-VL GGUF model
#497
opened Aug 30, 2023 by
CoruNethron
gpt-j, starcoder, gptneox examples cause "not enough space in the context's memory pool" for batches >32
#484
opened Aug 27, 2023 by
ravenscroftj
ggml : extend ggml_mul_mat to support non-F32 input for parameter Refactoring
b
refactoring
#455
opened Aug 16, 2023 by
ggerganov
Support for quantized zero degradation matrix multiplication for Large Language Models
#440
opened Aug 8, 2023 by
ThePerfectComputer
ProTip!
Add no:assignee to see everything that’s not assigned.