-
Notifications
You must be signed in to change notification settings - Fork 926
Issues: ggerganov/ggml
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Using memcpy on tensor data when the tensor is not contiguous
#584
opened Oct 17, 2023 by
audiovention
How about adding "tokenizer.ggml.cls_token_id" for special cls token.
#541
opened Sep 26, 2023 by
FFengIll
Port k-quants support from ggerganov/llama.cpp to ggerganov/ggml
#532
opened Sep 21, 2023 by
saharNooby
Asserting over nb[0] as type check causes issue for tensors after permuting
#530
opened Sep 19, 2023 by
cndn
Why GPT-J performs better on graviton without using simd than x86 using simd
#520
opened Sep 13, 2023 by
xshen053
Converting an arbritrary HF Transformer GPT2 to ggml format
#519
opened Sep 12, 2023 by
JellePiepenbrock
Directly converted from bfloat16 weights are 20x slower than converted from float32 ones.
#516
opened Sep 11, 2023 by
OlegJakushkin
ggml : better way to express implicit node dependencies in a graph
enhancement
New feature or request
refactoring
Refactoring
#502
opened Sep 4, 2023 by
ggerganov
[Feature request] Add support/demo implementation for Qwen-VL GGUF model
#497
opened Aug 30, 2023 by
CoruNethron
gpt-j, starcoder, gptneox examples cause "not enough space in the context's memory pool" for batches >32
#484
opened Aug 27, 2023 by
ravenscroftj
ProTip!
What’s not been updated in a month: updated:<2024-05-18.