-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question aboud evaluating MMLU #1347
Comments
Hi! could you share more how you're downloading the dataset manually beforehand? We describe a few ways to load from a local dataset here: https://github.com/EleutherAI/lm-evaluation-harness/blob/main/docs/new_task_guide.md#using-local-datasets which may be helpful. For instance, the latter example should work with |
Hi, please let us know if you're continuing to experience issues not solved by following the instructions in the linked docs page. |
i have the same problems. I use the "dataset.save_to_disk()" to save gsm8k dataset into "llm/dataset/gsm8k". however when i set gsm8k.yaml as Want any help if possible |
Hi @Jp-17 , could you open a new issue for this documenting what you're running into + steps to replicate? It sounds like the issue may be related to the use of |
thanks for your reply, i have just open a new issue, which contain more detailed info. #1829 |
Hi, thanks for the great work!
I want to evaluate the MMLU benchmark. Because I could not access the huggingface hub, I download the 'hails/mmlu_no_train' on huggingface. But when I run this command 'lm_eval --model hf --model_args pretrained=/root/paddlejob/workspace/env_run/huitingfeng/models/llama-2-7b-chat-hf --tasks mmlu --device cuda:4 --batch_size 8', there has some tracebacks:
My datasets version is 2.16.1. When I downgrade the datasets version to 2.15.0, there is another traceback:
I want to know how could I organize the downloaded MMLU benchmark and its corresponding yaml file.
The text was updated successfully, but these errors were encountered: