LLaMa.cpp Gemma Web-UI

This project uses llama.cpp to load model from a local file, delivering fast and memory-efficient inference.
The project is currently designed for Google Gemma, and will support more models in the future.

Deployment

Prerequisites

Download Gemma model from Google repository (https://huggingface.co/google/gemma-2b-it).
Quantize the Gemma model (highly recommended if target machine has limited memory).

Installation

Download Gemma model from Google repository.
Edit the model-path config.yaml, this should point to the actual model path.
Start the web-ui by command:
```
screen -S "webui" bash ./start-ui.sh
```

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
core		core
doc_parser		doc_parser
.gitignore		.gitignore
LICENSE		LICENSE
config.yaml		config.yaml
document_rag_processor.py		document_rag_processor.py
llm_connector.py		llm_connector.py
readme.md		readme.md
requirements.txt		requirements.txt
start-ui.sh		start-ui.sh
webui.py		webui.py
webui_config.py		webui_config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaMa.cpp Gemma Web-UI

Deployment

Prerequisites

Installation

About

Releases

Packages

Contributors 2

Languages

License

h-alice/llamacpp-webui

Folders and files

Latest commit

History

Repository files navigation

LLaMa.cpp Gemma Web-UI

Deployment

Prerequisites

Installation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages