what is the learning rate then finetuning all LLM parameters during SFT stage? #5

xmu-xiaoma666 · 2024-04-29T11:44:01Z

What is the learning rate then finetuning all LLM parameters during SFT stage?

mmaaz60 · 2024-04-29T13:53:23Z

Thank you for your interest in our work. We use a learning rate of 2e-5 during full fine-tuning of both LLaMA-3 and Phi-3 based models. I hope it will help. Good Luck and let me know if you have any questions.

mmaaz60 · 2024-04-29T18:09:08Z

Hi @xmu-xiaoma666,

We just added the full finetuning script that will reproduce our reported results. Good Luck and let us know if you have any questions.

xmu-xiaoma666 closed this as completed Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what is the learning rate then finetuning all LLM parameters during SFT stage? #5

what is the learning rate then finetuning all LLM parameters during SFT stage? #5

xmu-xiaoma666 commented Apr 29, 2024

mmaaz60 commented Apr 29, 2024

mmaaz60 commented Apr 29, 2024

what is the learning rate then finetuning all LLM parameters during SFT stage? #5

what is the learning rate then finetuning all LLM parameters during SFT stage? #5

Comments

xmu-xiaoma666 commented Apr 29, 2024

mmaaz60 commented Apr 29, 2024

mmaaz60 commented Apr 29, 2024