[WIP] Add chat templating for HF models #1287

haileyschoelkopf · 2024-01-15T18:41:51Z

This is a WIP PR , carrying on the draft @daniel-furman in #1209 started of adding the specified oft-requested chat templating feature.

Current TODOs are:

Check performance using e.g. OpenHermes and Llama-2-7b-chat-hf (+ your chat model here)
expose CLI flags for sysprompt + toggling on chat template
(?) turn this into a mixin and enable it for vLLM, etc.?
Decide what user-controllable functionality needs to be added on top of this (e.g. how to deal with few-shot examples in context, how to add "triggers" that are added after the assistant's last response, ...)
...

Feedback appreciated!

…or testing

…style fix

CLAassistant · 2024-01-15T18:41:59Z

All committers have signed the CLA.

daniel-furman · 2024-01-15T19:30:49Z

Awesome to see this PR go up! Happy to provide context (pun intended) on some of my commits, my experimentation was starting to get messy at the end there.

Overall, I wasn't seeing the positive move in eval scores that I had anticipated. I hypothesize that dealing with few-shot examples in context and adding triggers will be helpful to getting scores to be better.

haileyschoelkopf · 2024-01-15T21:23:26Z

Likewise, initial tests with OpenHermes on my end are showing similarly, on ARC and Lambada (the latter of which does not surprise me too much).

We will also need to address the correctness of target_delimiter remaining the same when handling chat templates.

haileyschoelkopf · 2024-01-16T03:04:59Z

Yep, I have a more recent version I’ll push tomorrow morning!

tmabraham · 2024-01-17T09:51:06Z

does this currently not allow for custom system prompts?

lm_eval/models/huggingface.py

Co-authored-by: lewtun <[email protected]>

Co-authored-by: Daniel Furman <[email protected]>

haileyschoelkopf · 2024-02-27T19:42:11Z

TODOs:

get numbers on generative tasks with + without chat templating
Work out how to handle delimiters / spacing for multiple_choice tasks
Confirm this interacts alright with models' max length
.........
Ensure this raises an error when a model doesn't set a chat template in its tokenizer config
also port to vllm LM class
Decide if this is the best place to edit the codebase with these changes

sanchit-ahuja · 2024-04-30T09:09:22Z

Hi @haileyschoelkopf, have the changes been made part of the current main branch? If not, what are the blockers for this? Should I start tackling the TODOs that you have mentioned above for this PR?
Happy to help!

haileyschoelkopf · 2024-06-03T16:43:38Z

courtesy of @KonradSzafer @clefourrier , we've just merged #1873 , supporting chat templating in HF models -- feedback on additional features or blind spots there would be very greatly appreciated!

daniel-furman and others added 27 commits January 6, 2024 18:50

first stab at wrap_chat_template

3824828

first stab at wrap_chat_template, strip error fix

a784417

first stab at wrap_chat_template, rfind continuation fix

53c68db

first stab at wrap_chat_template, formatting in function

3e27f9d

first stab at wrap_chat_template, print statements in loglikelihood f…

87dff8b

…or testing

first stab at wrap_chat_template, remove system for now

5c4d9c7

first stab at wrap_chat_template, remove special chars from continuation

e689727

first stab at wrap_chat_template, remove special chars tab indenting …

337c084

…style fix

Merge branch 'EleutherAI:main' into main

6c68fd1

first stab at wrap_chat_template, various

34b32f7

first stab at wrap_chat_template, various

59e3b17

first stab at wrap_chat_template, arc conversation test

7191904

first stab at wrap_chat_template, arc conversation test

9949e4f

first stab at wrap_chat_template, remove arc experiment

2d3c835

first stab at wrap_chat_template, various

49f43f9

llama test

021232b

llama test

b6c75ed

llama test

047dde8

llama test

c38b9d2

llama test

1ea8470

llama test

2e27053

llama test

43dee06

llama test

39a11d0

remove system

bbcdffb

Merge branch 'main' into add-chat-templating

2b40017

update Instance.args setter

c47de8b

clean up wrap_chat_template + add TODOs

6ca8ab1

haileyschoelkopf requested a review from lintangsutawika as a code owner January 15, 2024 18:41

Merge branch 'main' into add-chat-templating

b8bda47

push most recent code

68c30aa

This was referenced Jan 19, 2024

Deal with _encode_pair() / Llama token 29871 / SPIECE_UNDERLINE better #1322

Draft

Low results on TriviaQA #1292

Open

StellaAthena mentioned this pull request Jan 25, 2024

Suboptimal Performance on Generation Tasks #1353

Closed

baberabb mentioned this pull request Jan 27, 2024

Question: system prompt in the few-shot setting #1361

Closed

lewtun reviewed Feb 14, 2024

View reviewed changes

lm_eval/models/huggingface.py Outdated Show resolved Hide resolved

daniel-furman reviewed Feb 15, 2024

View reviewed changes

lm_eval/models/huggingface.py Outdated Show resolved Hide resolved

Nondzu mentioned this pull request Feb 23, 2024

Enhancement of chat template handling speakleash/lm-evaluation-harness#4

Open

haileyschoelkopf and others added 2 commits February 27, 2024 14:11

Update lm_eval/models/huggingface.py

37db34c

Co-authored-by: lewtun <[email protected]>

Update lm_eval/models/huggingface.py

495d50b

Co-authored-by: Daniel Furman <[email protected]>

This was referenced Mar 8, 2024

Question about prompt formatting issue #1543

Open

Hf chat template #1578

Closed

haileyschoelkopf mentioned this pull request Mar 15, 2024

Create new release and ship to PyPI #1560

Closed

haileyschoelkopf added this to the v0.4.3 milestone Mar 15, 2024

haileyschoelkopf closed this Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add chat templating for HF models #1287

[WIP] Add chat templating for HF models #1287

haileyschoelkopf commented Jan 15, 2024 •

edited

CLAassistant commented Jan 15, 2024 •

edited

daniel-furman commented Jan 15, 2024 •

edited

haileyschoelkopf commented Jan 15, 2024

haileyschoelkopf commented Jan 16, 2024

tmabraham commented Jan 17, 2024

haileyschoelkopf commented Feb 27, 2024

sanchit-ahuja commented Apr 30, 2024

haileyschoelkopf commented Jun 3, 2024

[WIP] Add chat templating for HF models #1287

[WIP] Add chat templating for HF models #1287

Conversation

haileyschoelkopf commented Jan 15, 2024 • edited

CLAassistant commented Jan 15, 2024 • edited

daniel-furman commented Jan 15, 2024 • edited

haileyschoelkopf commented Jan 15, 2024

haileyschoelkopf commented Jan 16, 2024

tmabraham commented Jan 17, 2024

haileyschoelkopf commented Feb 27, 2024

sanchit-ahuja commented Apr 30, 2024

haileyschoelkopf commented Jun 3, 2024

haileyschoelkopf commented Jan 15, 2024 •

edited

CLAassistant commented Jan 15, 2024 •

edited

daniel-furman commented Jan 15, 2024 •

edited