Support for other languages - Translated datasets #1695

onurgu · 2024-04-11T15:23:22Z

Hi,

I am trying understand the best way to provide a separate LLM benchmark specifically evaluating against Turkish datasets.

This can be done by automatically translating the datasets references by tasks in lm_eval/tasks. This is the approach taken by

(OpenLLMTurkishLeaderboard)[https://huggingface.co/spaces/malhajar/OpenLLMTurkishLeaderboard]

making changes to files like lm_eval/tasks/arc/arc_easy.yaml as below:

https://github.com/malhajar17/lm-evaluation-harness_turkish/blob/main/lm_eval/tasks/arc_tr/arc_easy.yaml

In this file, a reference to the new dataset is given. However, doc_to_text and doc_to_target fields still includes English keywords like Question:.

What is the best practice for supporting other languages? Can we add more datasets into arc_easy.yaml and make it use a different processing method for each different dataset?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for other languages - Translated datasets #1695

Support for other languages - Translated datasets #1695

onurgu commented Apr 11, 2024

Support for other languages - Translated datasets #1695

Support for other languages - Translated datasets #1695

Comments

onurgu commented Apr 11, 2024