Overview

This is a fork of QLoRA

Differences from original

airoboros support

Since I am the creator of the various airoboros models, this fork is fairly well catered to the airoboros instruction/response format, and uses the airoboros prompt.

The instructions.jsonl file (or whatever filename you are using), should be a single JSON string per line, newline separated, with "instruction" and "response" values.

Add: --dataset_format airoboros

experimental MPT support

Supports fine-tuning MPT base models via --mpt True, but requires a PEFT-compatible base model, e.g.: https://huggingface.co/jondurbin/mpt-30b-qlora-compatible

epochs instead of steps

I prefer using a fixed number of epochs in training rather than trying to stop are a particular step count. I removed the --max_steps parameter in favor of --num_train_epochs (which I usually set to 3)

experimental flash attention support

Try flash_qlora.py instead of qlora.py

lots of stuff removed

MMLU benchmarks, eval, etc.

Requirements for llama based models

To fine-tune a llama base model, you should use one of these:

Then, you MUST replace the special_tokens_map.json file and tokenizer_config.json file with the ones found in this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
flash_attn_monkey_patch.py		flash_attn_monkey_patch.py
flash_qlora.py		flash_qlora.py
qlora.py		qlora.py
requirements.txt		requirements.txt
special_tokens_map.json		special_tokens_map.json
special_tokens_map.jsonn		special_tokens_map.jsonn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Differences from original

airoboros support

experimental MPT support

epochs instead of steps

experimental flash attention support

lots of stuff removed

Requirements for llama based models

About

Releases

Packages

Languages

License

AIdeaLab/qlora-mpt

Folders and files

Latest commit

History

Repository files navigation

Overview

Differences from original

airoboros support

experimental MPT support

epochs instead of steps

experimental flash attention support

lots of stuff removed

Requirements for llama based models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages