Tags · 1b5d/llm-api

0.1.2

GH actions free up disk space (#18)

* gh actions free up disk space

* cleanup free disk space commands

* cleanup free disk space commands

Nov 13, 2023
c247d50
zip
tar.gz
Notes

0.1.1

Update readme and bump dep version (#14)

* update readme and bump dep version

* upgrade GPU dockerfile

Oct 25, 2023
85e18f2
zip
tar.gz
Notes

0.1.0

Release 0.1.0 (#12)

* fix gh actions

* remove arm64 from gpu build

* cleanup

* enable cache for docker build on gha

* refine docker build cache ref

* refine gpu dependencies

* quite conda installations

* free disk space on gha

* fix free space command on gha

* fix free space command on gha

Jul 23, 2023
928ec4a
zip
tar.gz
Notes

0.0.4-gptq-llama-triton

fix GPTQ-for-Llama setup

Jun 16, 2023
293f9d9
zip
tar.gz
Notes

0.0.4

move from cuda to triton in regards to GPTQ for Llama

Jun 8, 2023
bd6f512
zip
tar.gz
Notes

0.0.3-gptq-llama-cuda

add a separate parameter for safetensors models

May 5, 2023
b668b59
zip
tar.gz
Notes

0.0.2-gptq-llama-cuda

rebuild the cuda based image

Apr 25, 2023
250ec5c
zip
tar.gz
Notes

0.0.1-gptq-llama-cuda

fix noop placeholder in gptq vendor code

Apr 23, 2023
5bb2199
zip
tar.gz
Notes

0.0.1

add gh actions for release

Apr 13, 2023
981e670
zip
tar.gz
Notes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.1.2

0.1.1

0.1.0

0.0.4-gptq-llama-triton

0.0.4

0.0.3-gptq-llama-cuda

0.0.2-gptq-llama-cuda

0.0.1-gptq-llama-cuda

0.0.1

Tags: 1b5d/llm-api