Skip to content

Tags: 1b5d/llm-api

Tags

0.1.2

Toggle 0.1.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
GH actions free up disk space (#18)

* gh actions free up disk space

* cleanup free disk space commands

* cleanup free disk space commands

0.1.1

Toggle 0.1.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Update readme and bump dep version (#14)

* update readme and bump dep version

* upgrade GPU dockerfile

0.1.0

Toggle 0.1.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Release 0.1.0 (#12)

* fix gh actions

* remove arm64 from gpu build

* cleanup

* enable cache for docker build on gha

* refine docker build cache ref

* refine gpu dependencies

* quite conda installations

* free disk space on gha

* fix free space command on gha

* fix free space command on gha

0.0.4-gptq-llama-triton

Toggle 0.0.4-gptq-llama-triton's commit message
fix GPTQ-for-Llama setup

0.0.4

Toggle 0.0.4's commit message
move from cuda to triton in regards to GPTQ for Llama

0.0.3-gptq-llama-cuda

Toggle 0.0.3-gptq-llama-cuda's commit message
add a separate parameter for safetensors models

0.0.2-gptq-llama-cuda

Toggle 0.0.2-gptq-llama-cuda's commit message
rebuild the cuda based image

0.0.1-gptq-llama-cuda

Toggle 0.0.1-gptq-llama-cuda's commit message
fix noop placeholder in gptq vendor code

0.0.1

Toggle 0.0.1's commit message
add gh actions for release