Skip to content

Issues: ggerganov/ggml

ggml : unified file format
#220 by philpax was closed Nov 1, 2023
Closed 82
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Error in ggml_get_rows for large tensors with CUDA backend.
#877 opened Jun 30, 2024 by balisujohn updated Jun 30, 2024
ggml : implement a spellcheck model (xfspell, t5-spellchecker, etc) good first issue Good for newcomers help wanted Extra attention is needed model Model specific
#233 opened Jun 6, 2023 by walking-octopus updated Jun 29, 2024
Is ggml support arm architecture inference?
#872 opened Jun 26, 2024 by FanZhang91 updated Jun 26, 2024
Optimizing the ChatTTS Model to Enhance Generation Speed
#871 opened Jun 26, 2024 by land007 updated Jun 26, 2024
Is there interest in ggml_reduce or ggml_add_ext?
#868 opened Jun 22, 2024 by balisujohn updated Jun 25, 2024
Support for Custom Data Types in ggml_arange Function
#869 opened Jun 22, 2024 by WenheLI updated Jun 25, 2024
Text to speech models in GGML?
#59 opened Apr 1, 2023 by simplejackcoder updated Jun 25, 2024
is there interest in ggml_unfold_1d ?
#866 opened Jun 20, 2024 by balisujohn updated Jun 20, 2024
How to get scale / delta from quantized file?
#862 opened Jun 17, 2024 by infiniteloop97 updated Jun 18, 2024
How do pixel unshuffle in ggml ?
#732 opened Feb 14, 2024 by delldu updated Jun 12, 2024
ggml vs onnxruntime on SOC chip
#810 opened Apr 30, 2024 by Francis235 updated Jun 6, 2024
Proposing To Add Naming Convention For GGUF files in documents
#820 opened May 13, 2024 by mofosyne updated Jun 2, 2024
ggml_flip or ggml_pad_reflect?
#819 opened May 12, 2024 by PABannier updated Jun 1, 2024
ggml inference time is significantly slower than onnxruntime
#841 opened May 28, 2024 by Francis235 updated May 31, 2024
User defined operation
#836 opened May 24, 2024 by Francis235 updated May 27, 2024
issues about YaRN
#835 opened May 24, 2024 by foldl updated May 24, 2024
Completion of error handling
#834 opened May 23, 2024 by elfring updated May 23, 2024
ggml how to compute depthwise conv
#833 opened May 21, 2024 by Francis235 updated May 21, 2024
GGML Fragmentation Issue
#830 opened May 19, 2024 by zhouwg updated May 19, 2024
GGML_MAX_NAME is too small.
#825 opened May 16, 2024 by IntptrMax updated May 17, 2024
DirectML support
#406 opened Jul 22, 2023 by brightening-eyes updated May 1, 2024
ggml vs Qualcomm SNPE inference engine on qualcomm soc
#809 opened Apr 30, 2024 by Francis235 updated Apr 30, 2024
Behaviour Mismatch between ggml_opt in Native Program and WASM
#807 opened Apr 29, 2024 by saraalrawi updated Apr 29, 2024
Behavior mismatch between PyTorch GroupNorm and ggml_group_norm
#803 opened Apr 21, 2024 by balisujohn updated Apr 25, 2024
ProTip! Type g i on any issue or pull request to go back to the issue listing page.