-
Notifications
You must be signed in to change notification settings - Fork 398
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What if you use only D_{exp} #84
Comments
You can check out the AlpacaEval leaderboard for preliminary results. |
Thank you for the update!
Thank you very much! |
Hi, thanks for your question! Here is the full result of SFT only using D_exp (GPT-4) and D_sub (GPT-3.5) along with the C-RLFT and SFT results with both sets.
|
Dear authors,
I wonder what would happen if you use only D_{exp}. It's not clear whether the ability to differentiate high-quality data from low-quality data is more critical than focusing on fine-tuning toward GPT-4. Please see if you can shed some light on this. Thank you!
--yyrkoon
The text was updated successfully, but these errors were encountered: