You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for your outstanding work!
Recently, I have been trying to test the performance of GPT-3.5 turbo on gsm8k, but I have received poor results, as shown in the following figure:
This is the command I am using, and I would like to ask, what is the reason for this? lm_eval --model openai-chat-completions --model_args model="gpt-3.5-turbo" --tasks gsm8k
Looking forward to your help!
The text was updated successfully, but these errors were encountered:
Thank you for your outstanding work!
![image](https://private-user-images.githubusercontent.com/91206112/322295434-ea338777-2401-483b-8208-daddfb0fe050.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjE1OTYwNTksIm5iZiI6MTcyMTU5NTc1OSwicGF0aCI6Ii85MTIwNjExMi8zMjIyOTU0MzQtZWEzMzg3NzctMjQwMS00ODNiLTgyMDgtZGFkZGZiMGZlMDUwLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA3MjElMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNzIxVDIxMDIzOVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWI5ODE5ZWRhNmI3MTNmY2ZmNDdmN2FmNGNkOGU2ZTVjNTVkMDNiMjk5NzAwNTBiNDA5YTYwMWQ1MmMzMmEyYTAmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.5xHAE6_h0z00xT0aFpN43HaLA9xl05h6AXTMR9oCoSE)
Recently, I have been trying to test the performance of GPT-3.5 turbo on gsm8k, but I have received poor results, as shown in the following figure:
This is the command I am using, and I would like to ask, what is the reason for this?
lm_eval --model openai-chat-completions --model_args model="gpt-3.5-turbo" --tasks gsm8k
Looking forward to your help!
The text was updated successfully, but these errors were encountered: