GitHub - iacore/rwkv-rs

Warning: experimental

dfdx cannot yet utilize 100% of your GPU. As a result, this is slower than ggml or numba/numpy.

Usage

get model

wget2 https://huggingface.co/BlinkDL/rwkv-4-pile-430m/resolve/main/RWKV-4-Pile-430M-20220808-8066.pth

# get convert.py from https://github.com/iacore/rwkv-np
python convert.py RWKV-4-Pile-430M-20220808-8066.pth

infer

cargo run --example infer --release

In theory, using bigger RWKV is also possible if you got enough memory. Just remember to change the Rust model type and model path in examples/infer.rs.

further optimization

>80% of the time is spent on matrix multiplication. Using faster matrix multiplication code will help a lot.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
examples		examples
src		src
.gitignore		.gitignore
20B_tokenizer.json		20B_tokenizer.json
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Usage

further optimization

About

Releases

Packages

Languages

iacore/rwkv-rs

Folders and files

Latest commit

History

Repository files navigation

Usage

further optimization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages