Add chat template #1873

KonradSzafer · 2024-05-22T13:46:04Z

This pull request adds a chat template that allows a few examples to be provided as a conversation between user and assistant, or as a single message from the user, as well as providing a system prompt.

@clefourrier @NathanHB

clefourrier · 2024-05-22T13:48:11Z

@KonradSzafer fix the linting please

lm_eval/api/task.py

haileyschoelkopf

Thanks very much @KonradSzafer for bearing with me!

had a couple other small comments about edge cases, but in general, looks really great!

lm_eval/api/model.py

lm_eval/api/task.py

haileyschoelkopf

Final round of comments! Will also add a docs update to docs/model_guide.md imminently.

lm_eval/api/model.py

docs/interface.md

lm_eval/api/model.py

LSinev · 2024-05-31T17:40:40Z

May I suggest template actually used to be reported in some artifacts from run (with or without --log-samples used) and also the system instruction.

Add docs on Chat Template interface to `docs/model_guide.md`

lm_eval/evaluator.py

KonradSzafer · 2024-06-03T11:43:58Z

May I suggest template actually used to be reported in some artifacts from run (with or without --log-samples used) and also the system instruction.

Hi @LSinev! Good point, updated the branch with the changes!

haileyschoelkopf

LGTM! Thanks for all your work on this :)

docs/model_guide.md

Co-authored-by: Hailey Schoelkopf <[email protected]>

djstrong · 2024-06-08T06:56:16Z

I have tested our model with the template. Without template:

With chat template:

Avg g - average of generate_until versions of tasks
Avg mc - average of multiple_choice versions of tasks

I will evaluate more models.

djstrong · 2024-06-10T10:27:53Z

What do you think, models should be tested with argument fewshot_as_multiturn or without?

KonradSzafer · 2024-06-10T18:06:18Z

Hi @djstrong!
fewshot_as_multiturn tends to result in lower scores, but is more reflective of the real-world use of the model.
For learning more about the differences in results when using the chat template, I recommend reading this disscusion.

djstrong · 2024-06-10T20:19:25Z

I tested this only for one model and I confirm that with fewshot_as_multiturn scores are mostly lower.

I have tested dozen of models with and without chat templates but need to analyze results. Available here: https://huggingface.co/spaces/speakleash/open_pl_llm_leaderboard with ,chat sufix.

KonradSzafer added 12 commits May 8, 2024 08:37

initial chat template

62df55d

tokenizer attribute check

f4902e0

variable rename

4b790fa

interface update

cd9e454

system instruction

9dfb58a

system inst default update

3369f88

fewshot as multiturn

921c4d6

typing update

a4bc484

indent update

d01032d

added comments

8a0ce59

Merge branch 'main' into chat_template

9bd948d

Adding a fewshot in a more readable way

691e0c0

KonradSzafer requested review from haileyschoelkopf and lintangsutawika as code owners May 22, 2024 13:46

linting

1162e34

haileyschoelkopf mentioned this pull request May 24, 2024

Release schedule - more regular PyPI releases? #1879

Closed

haileyschoelkopf reviewed May 24, 2024

View reviewed changes

lm_eval/api/task.py Outdated Show resolved Hide resolved

haileyschoelkopf reviewed May 24, 2024

View reviewed changes

lm_eval/api/task.py Outdated Show resolved Hide resolved

KonradSzafer added 2 commits May 29, 2024 13:53

Moved apply chat template to LM

c370665

multiturn alternation fix

899a544

haileyschoelkopf mentioned this pull request May 30, 2024

eval with Alpaca template #1882

Closed

cache key update

f8771d2

haileyschoelkopf requested changes May 30, 2024

View reviewed changes

lm_eval/api/model.py Outdated Show resolved Hide resolved

lm_eval/api/task.py Show resolved Hide resolved

lm_eval/api/task.py Outdated Show resolved Hide resolved

KonradSzafer added 5 commits May 30, 2024 19:29

apply chat template method fix

52df595

add system prompt hash to cache_key

615352c

tokenizer name property for cache_key

d7b8fd9

property name fix

6f76522

linting backward compatibility fix

4b0c49a

haileyschoelkopf requested changes May 31, 2024

View reviewed changes

lm_eval/api/model.py Outdated Show resolved Hide resolved

docs/interface.md Outdated Show resolved Hide resolved

lm_eval/api/model.py Show resolved Hide resolved

lm_eval/api/model.py Outdated Show resolved Hide resolved

KonradSzafer and others added 5 commits May 31, 2024 17:42

docs and errors update

dca730a

add documentation on adding chat template compatibility to model_guide

a6d3c05

fewshot as multiturn check fix

16715f2

Merge pull request #9 from EleutherAI/chat_template

0ee30f1

Add docs on Chat Template interface to `docs/model_guide.md`

saving system inst and chat template in results

8ed9d77

clefourrier reviewed Jun 3, 2024

View reviewed changes

lm_eval/evaluator.py Outdated Show resolved Hide resolved

clefourrier reviewed Jun 3, 2024

View reviewed changes

lm_eval/evaluator.py Outdated Show resolved Hide resolved

KonradSzafer added 2 commits June 3, 2024 10:28

eval tracker update

222dae3

docs update

2db5209

lintangsutawika approved these changes Jun 3, 2024

View reviewed changes

merge main

54ef077

haileyschoelkopf approved these changes Jun 3, 2024

View reviewed changes

docs/model_guide.md Outdated Show resolved Hide resolved

docs/model_guide.md Outdated Show resolved Hide resolved

Apply suggestions from code review

4bcd0ae

Co-authored-by: Hailey Schoelkopf <[email protected]>

haileyschoelkopf merged commit 070d31d into EleutherAI:main Jun 3, 2024
8 checks passed

This was referenced Jun 3, 2024

[WIP] Add chat templating for HF models #1287

Closed

Hf chat template #1578

Closed

haileyschoelkopf mentioned this pull request Jun 11, 2024

Plans for a new release? #1951

Open

LSinev mentioned this pull request Jun 13, 2024

Suboptimal Performance on Generation Tasks #1353

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add chat template #1873

Add chat template #1873

KonradSzafer commented May 22, 2024

clefourrier commented May 22, 2024

haileyschoelkopf left a comment

haileyschoelkopf left a comment

LSinev commented May 31, 2024

KonradSzafer commented Jun 3, 2024

haileyschoelkopf left a comment

djstrong commented Jun 8, 2024

djstrong commented Jun 10, 2024

KonradSzafer commented Jun 10, 2024

djstrong commented Jun 10, 2024

Add chat template #1873

Add chat template #1873

Conversation

KonradSzafer commented May 22, 2024

clefourrier commented May 22, 2024

haileyschoelkopf left a comment

Choose a reason for hiding this comment

haileyschoelkopf left a comment

Choose a reason for hiding this comment

LSinev commented May 31, 2024

KonradSzafer commented Jun 3, 2024

haileyschoelkopf left a comment

Choose a reason for hiding this comment

djstrong commented Jun 8, 2024

djstrong commented Jun 10, 2024

KonradSzafer commented Jun 10, 2024

djstrong commented Jun 10, 2024