😊 Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel 😊

Update Logs

!!! There's some error when you update torch to 2.0. I'll fix it as soon as possible !!!

Features

Multi-GPU training using DeepSpeed and Fully sharded Data Parallel with Accelerate
Training LLaMA using huggingface, lora, peft
Using clm training examples from huggingface example
- https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling
You can use alpaca_data.hf which is converted for using Huggingface Datasets
- Split train and validation for clm training

Dependency

pip install -r requirements.txt

Train

# Use PEFT, LORA
accelerate launch --config_file peft_config.yaml finetune.py

or you can use Huggingface Arguments for controll all situations during training. All the HFArguments can be used.

# You can use train.sh.
# Stil updating...
python train.py \
    --model_name_or_path decapoda-research/llama-7b-hf \
    --dataset_name alpaca_data.hf \
    --is_dataset_from_disk True \
    --per_device_train_batch_size 8 \
    --per_device_eval_batch_size 8 \
    --do_train \
    --do_eval \
    --output_dir test-clm

The codes are still updated, so maybe there're can be some unexpected error. I used base code from https://github.com/tloen/alpaca-lora. Thanks a lot!!

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
alpaca_data.hf		alpaca_data.hf
.gitignore		.gitignore
DATA_LICENSE		DATA_LICENSE
LICENSE		LICENSE
README.md		README.md
alpaca_data.json		alpaca_data.json
config.yaml		config.yaml
convert_dataset.py		convert_dataset.py
finetune.py		finetune.py
generate.py		generate.py
lengths.ipynb		lengths.ipynb
peft_config.yaml		peft_config.yaml
requirements.txt		requirements.txt
temp.yaml		temp.yaml
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

😊 Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel 😊

Update Logs

Features

Dependency

Train

About

Releases

Packages

Languages

License

naem1023/alpaca-lora-for-huggingface

Folders and files

Latest commit

History

Repository files navigation

😊 Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel 😊

Update Logs

Features

Dependency

Train

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages