Making this library more like Hugging Face #25

oobabooga · 2023-03-05T19:46:40Z

I have expressed my interest in having RWKV officially implemented in Hugging Face in huggingface/transformers#17230.

Meanwhile, I have a distilled set of suggestions for how this library could be made more familiar to people who are already used to transformers and AutoModelForCausalLM.

Maybe some of these are already possible in the current version of rwkv. If so, I would be grateful if you could let me know how.

1. Being able to load the tokenizer explicitly

with something like

tokenizer = RWKVTokenizer.from_pretrained("/path/to/20B_tokenizer.json")

and then use it with

prompt = "Hello, my name is "
input_ids = tokenizer.encode(prompt)

Having the ability to count the number of tokens in a given prompt is very useful.

2. Generating text with input_ids as input rather than a string

Something like

output_ids = model.generate(input_ids, temperature=0.8, top_p=0.95)
output_text = tokenizer.decode(output_ids)

3. Generation parameters

Many parameters are available for model.generate() in HF, but it seems to me that the absolutely essential ones that everyone uses are:

temperature ✅
top_p ✅
top_k
repetition_penalty

I am aware that alpha_frequency and alpha_presence are implemented, but these parameters are not usually found in presets that people have already come up with while working with other models. For this reason, having repetition_penalty would be valuable.

The text was updated successfully, but these errors were encountered:

oobabooga · 2023-03-05T19:58:01Z

For reference, I am currently using this wrapper to run the model:

https://github.com/oobabooga/text-generation-webui/blob/main/modules/RWKV.py

BlinkDL · 2023-03-06T01:18:01Z

from tokenizers import Tokenizer # pip install tokenizers
tokenizer = Tokenizer.from_file("20B_tokenizer.json")
print(tokenizer.encode("Hello World").ids)
print(tokenizer.decode([12092, 3645])

output_ids = tokenizer.encode(model.generate(tokenizer.decode(input_ids), temperature=0.8, top_p=0.95)).ids
output_text = tokenizer.decode(output_ids)

soon :)

oobabooga · 2023-03-06T11:47:12Z

Thanks, based on your suggestions I have added a RWKVTokenizer wrapper: oobabooga/text-generation-webui@e91f4bc

BlinkDL closed this as completed Apr 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making this library more like Hugging Face #25

Making this library more like Hugging Face #25

oobabooga commented Mar 5, 2023

oobabooga commented Mar 5, 2023

BlinkDL commented Mar 6, 2023

oobabooga commented Mar 6, 2023

Making this library more like Hugging Face #25

Making this library more like Hugging Face #25

Comments

oobabooga commented Mar 5, 2023

1. Being able to load the tokenizer explicitly

2. Generating text with input_ids as input rather than a string

3. Generation parameters

oobabooga commented Mar 5, 2023

BlinkDL commented Mar 6, 2023

oobabooga commented Mar 6, 2023