Truncation #1426

mdocekal · 2024-02-13T15:44:25Z

Is it possible to change truncation strategy?
For example let's say that I want to remove whole few-shot sample or truncate each few-shot sample from left/right by a fair amount of tokens.

MFajcik · 2024-02-14T09:31:30Z

To add more context, with @mdocekal we found that harness truncates task description, when having n-shot prompt. At least for GPT-2-XL and accelerate model. We would like to truncate the content of "shots" (so if it is 10-shot, and it won't fit, we want to change the particular example to e.g., 9-shot, or truncate from the last example, and not to truncate the preceding task description).

baberabb · 2024-02-14T18:23:26Z

Might be a bit tricky. The task description is prepended to the fully constructed fewshot string here:

lm-evaluation-harness/lm_eval/api/task.py

Line 853 in 620d6a1

labeled_examples = self.config.description + self.sampler.get_context(

If you only care about a specific model, one way could be to use a custom sampler and override the get_context method to condition the few-shots how you want by adding a tokenizer.

MFajcik · 2024-02-26T14:21:46Z

we found our "hacky" way to do what we wanted here. The question remains whether we should try, implement and pull request such a thing into lm-harness. We are developing a benchmark, and were hoping people could use harness for its evaluation.

Do you think the truncation strategy could be specified with user function in yaml?

mdocekal mentioned this issue Feb 13, 2024

Harness - Truncation DCGM/lm-evaluation-harness#2

Closed

1 task

haileyschoelkopf added the asking questions For asking for clarification / support on library usage. label Feb 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Truncation #1426

Truncation #1426

mdocekal commented Feb 13, 2024

MFajcik commented Feb 14, 2024 •

edited

Loading

baberabb commented Feb 14, 2024 •

edited

Loading

MFajcik commented Feb 26, 2024

Truncation #1426

Truncation #1426

Comments

mdocekal commented Feb 13, 2024

MFajcik commented Feb 14, 2024 • edited Loading

baberabb commented Feb 14, 2024 • edited Loading

MFajcik commented Feb 26, 2024

MFajcik commented Feb 14, 2024 •

edited

Loading

baberabb commented Feb 14, 2024 •

edited

Loading