Finetuning with lora output never ends. #29

gyupro · 2024-05-22T07:08:13Z

Hi, Thanks for your wonderful work.

I am struggling using my lora tuned model.

I conducted following steps

finetuning with lora

Undi95/Meta-Llama-3-8B-Instruct-hf model base
llama3 template

inference with gradio

run server with model-base Undi95/Meta-Llama-3-8B-Instruct-hf model-path checkpoints/LLaVA-Meta-Llama-3-8B-Instruct-lora

Model output never ends. (I think something's wrong with EOS token?)

displaywz · 2024-05-31T05:01:05Z

same question

mmaaz60 · 2024-05-31T06:42:15Z

Hi Both,

Thanks for your interest in our work. I noticed you are using the wrong LLaMA3 base model that may have some issues with the tokenizer as reported in the earlier versions.

I would recommend using the official meta-llama/Meta-Llama-3-8B as base version as they fixed the tokenizer issue which was effecting generation. Let me know if this solves the issue.

Thanks and Good Luck

displaywz · 2024-05-31T06:56:46Z

Nice work! I am using the latest llava-llama3 model downloaded from huggingface and attempting to use it directly for Lora. When I directly use the model without Lora, I will repeatedly output the final text content on my task until the maximum length, and I suspect it may be related to EOS. In addition, when I try to use Lora, the output becomes strange and even produces some content that is not a word. Is this related to me directly using the original version of llava's finetune task_lora? I only replaced the llava-llama3 version with the dialogue template llama3 and the base model hf. Thank you again for your work. Very helpful to me :)

lzy-ps · 2024-06-30T14:06:03Z

Nice work! I am using the latest llava-llama3 model downloaded from huggingface and attempting to use it directly for Lora. When I directly use the model without Lora, I will repeatedly output the final text content on my task until the maximum length, and I suspect it may be related to EOS. In addition, when I try to use Lora, the output becomes strange and even produces some content that is not a word. Is this related to me directly using the original version of llava's finetune task_lora? I only replaced the llava-llama3 version with the dialogue template llama3 and the base model hf. Thank you again for your work. Very helpful to me :)

same problem. The output from model is a bunch of exclamation marks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning with lora output never ends. #29

Finetuning with lora output never ends. #29

gyupro commented May 22, 2024

displaywz commented May 31, 2024

mmaaz60 commented May 31, 2024

displaywz commented May 31, 2024

lzy-ps commented Jun 30, 2024

Finetuning with lora output never ends. #29

Finetuning with lora output never ends. #29

Comments

gyupro commented May 22, 2024

displaywz commented May 31, 2024

mmaaz60 commented May 31, 2024

displaywz commented May 31, 2024

lzy-ps commented Jun 30, 2024