By Ali Borji, Mehrdad Mohammadian
to be announced
to be announced
In total, our dataset contains 1002 question-answer pairs. There are 27 categories that can be used to assess the main and important abilities of the large language models. The figure below shows the number of questions per category.
To access the dataset, see the data folder or download the dataset from the release section. Both json
and csv
formats are provided for all categories, you can use them based on your need. For those categories/questions that do not require an answer, "NONE" is replaced as the answer.
to be announced