llama-pc

Run llama model on PC.
based on ggerganov/llama.cpp, add Flask server and UI.
Guideline zh

LLM model(llama-2–7b-chat.Q2_K.gguf)

docker build -t songgs/llm-cpu -f deploy/Dockerfile .
docker run -it -p 8000:8000 songgs/llm-cpu

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
deploy		deploy
docs		docs
llama_pc		llama_pc
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt