Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

id_column contains duplicate values #1667

Open
nicho2 opened this issue Apr 3, 2024 · 0 comments
Open

id_column contains duplicate values #1667

nicho2 opened this issue Apr 3, 2024 · 0 comments

Comments

@nicho2
Copy link
Contributor

nicho2 commented Apr 3, 2024

I tried to send data to zeno visualisator but i have this error:

py ./scripts/zeno_visualize.py --data_path ./output/ --project_name "LG Model Test"
Successfully updated project.
Access your project at https://hub.zenoml.com/project/460c-8698-59a31942ea41/LG%20Model%20Test_gsm8k
Traceback (most recent call last):
File "C:\Projects\transpose\lm-evaluation-harness\scripts\zeno_visualize.py", line 219, in
main()
File "C:\Projects\transpose\lm-evaluation-harness\scripts\zeno_visualize.py", line 103, in main
project.upload_dataset(
File "C:\Python311\Lib\site-packages\zeno_client\client.py", line 117, in upload_dataset
pa_table = df_to_pa(df, id_column)
^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Python311\Lib\site-packages\zeno_client\util.py", line 22, in df_to_pa
raise ValueError("ERROR: id_column contains duplicate values")
ValueError: ERROR: id_column contains duplicate values

it's because result are duplicate in the log file.

command is:
lm_eval --model local-chat-completions --tasks gsm8k,babi --model_args model=sn/mistral-7b-instruct-v0.2-LG,base_url=http:https://10.2.42.198:1234/v1 --log_samples --output_path output/mistral-7b-instruct-v0.2-LG --limit 4

it's concerning only gsm8k task

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant