Tag-to-Caption Augmentation using Large Language Model

This project aims to generate captions for music using existing tags.

TTMR++: Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
SeungHeon Doh, Minhee Lee, Dasaem Jeong, Juhan Nam ICASSP 2024

LLaMA-7B Finetune

Download pretrain LLaMA weight from LLaMa Access, and move 7B weight in models/7B
(Optional) Finetune LLaMa2 Model with Quantization + LoRA

cd llamak2c
python lora_finetune/llama_finetuning.py --use_peft --peft_method lora --quantization --model_name ../models/7B --output_dir ../models/k2c_lora2

Inference with huggingface dataset

cd llama_k2c
python inference.py --model_name ../models/7B --peft_model ../models/k2c_lora --dataset_name msd --dataset_split all --

License

This project is licensed under the MIT License.

Contact

For any questions or inquiries, please contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
chatgpt_k2c		chatgpt_k2c
llama_k2c		llama_k2c
models/k2c_lora		models/k2c_lora
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tag-to-Caption Augmentation using Large Language Model

LLaMA-7B Finetune

License

Contact

About

Releases

Packages

Languages

seungheondoh/llm-tag-to-caption

Folders and files

Latest commit

History

Repository files navigation

Tag-to-Caption Augmentation using Large Language Model

LLaMA-7B Finetune

License

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages