GSM8K Error result produced by gpt-3.5-turbo #1700

APiaoG · 2024-04-14T18:36:55Z

Thank you for your outstanding work!
Recently, I have been trying to test the performance of GPT-3.5 turbo on gsm8k, but I have received poor results, as shown in the following figure:

This is the command I am using, and I would like to ask, what is the reason for this?
lm_eval --model openai-chat-completions --model_args model="gpt-3.5-turbo" --tasks gsm8k

Looking forward to your help!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GSM8K Error result produced by gpt-3.5-turbo #1700

GSM8K Error result produced by gpt-3.5-turbo #1700

APiaoG commented Apr 14, 2024 •

edited

Loading

GSM8K Error result produced by gpt-3.5-turbo #1700

GSM8K Error result produced by gpt-3.5-turbo #1700

Comments

APiaoG commented Apr 14, 2024 • edited Loading

APiaoG commented Apr 14, 2024 •

edited

Loading