Skip to content
This repository has been archived by the owner on Nov 9, 2023. It is now read-only.

iacore/rwkv-rs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Warning: experimental

dfdx cannot yet utilize 100% of your GPU. As a result, this is slower than ggml or numba/numpy.

Usage

  1. get model
wget2 https://huggingface.co/BlinkDL/rwkv-4-pile-430m/resolve/main/RWKV-4-Pile-430M-20220808-8066.pth

# get convert.py from https://github.com/iacore/rwkv-np
python convert.py RWKV-4-Pile-430M-20220808-8066.pth
  1. infer
cargo run --example infer --release

In theory, using bigger RWKV is also possible if you got enough memory. Just remember to change the Rust model type and model path in examples/infer.rs.

further optimization

>80% of the time is spent on matrix multiplication. Using faster matrix multiplication code will help a lot.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages