Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FIX CLBLAST Compile #222

Closed
wants to merge 1 commit into from
Closed

Conversation

apcameron
Copy link
Contributor

Copy ggml-opencl.cpp from llama.cpp
Update CMakefiles.txt

This addresses the following #216 and #205

Copy ggml-opencl.cpp from llama.cpp
Update CMakefiles.txt
@apcameron
Copy link
Contributor Author

@ggerganov Do I need to make any more updates or is this PR good to go?

@ggerganov
Copy link
Owner

I'll sync latest llama.cpp together with OpenCL, CUDA and Metal implementations later today

@ggerganov ggerganov closed this Jun 18, 2023
CCLDArjun pushed a commit to CCLDArjun/ggml that referenced this pull request Dec 18, 2023
* Improved quantize script

I improved the quantize script by adding error handling and allowing to select many models for quantization at once in the command line. I also converted it to Python for generalization as well as extensibility.

* Fixes and improvements based on Matt's observations

Fixed and improved many things in the script based on the reviews made by @mattsta. The parallelization suggestion is still to be revised, but code for it was still added (commented).

* Small fixes to the previous commit

* Corrected to use the original glob pattern

The original Bash script uses a glob pattern to match files that have endings such as ...bin.0, ...bin.1, etc. That has been translated correctly to Python now.

* Added support for Windows and updated README to use this script

New code to set the name of the quantize script binary depending on the platform has been added (quantize.exe if working on Windows) and the README.md file has been updated to use this script instead of the Bash one.

* Fixed a typo and removed shell=True in the subprocess.run call

Fixed a typo regarding the new filenames of the quantized models and removed the shell=True parameter in the subprocess.run call as it was conflicting with the list of parameters.

* Corrected previous commit

* Small tweak: changed the name of the program in argparse

This was making the automatic help message to be suggesting the program's usage as being literally "$ Quantization Script [arguments]". It should now be something like "$ python3 quantize.py [arguments]".
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants