Allow "weight: 0" in messages to mask them #1703

DavidFarago · 2024-06-11T14:16:05Z

Allow in message objects the additional key weight, which can be set to 0 (or 1) to cause that message to be masked out (or left unmasked) for training (similar to 1).

Description

Extend src/axolotl/prompters.py::_build_result to return the turns with weights as additional tuple element. Do this in axolotl directly instead of modifying fastchat.conversation's Conversation.

Extend src/axolotl/prompt_tokenizers.py::tokenize_prompt to mask out tokens when weight is set to 0.

Motivation and Context

This is helpful for training the model to be robust and capable of error recovery upon a bad assistant message. A missing weight key defaults to weight 1, to guarantee downward compatibility.

How has this been tested?

Extend tests/prompt_strategies/test_sharegpt.py to contain messages with weight keys.

tests/prompt_strategies/test_sharegpt.py

Allow in message objects the additional key `weight`, which can be set to 0 (or 1) to cause that message to be masked out (or left unmasked) for training (similar to [1]). This is helpful for training the model to be robust and capable of error recovery upon a bad assistant message. A missing `weight` key defaults to weight 1, to guarantee downward compatibility. Extend `src/axolotl/prompters.py::_build_result` and `src/axolotl/prompt_strategies/sharegpt.py::SimpleShareGPTPromptTokenizingStrategy::get_conversation_thread` to return the turns with weights as additional tuple element. Do this in axolotl directly instead of modifying `fastchat.conversation`'s `Conversation`. Extend `src/axolotl/prompt_tokenizers.py::tokenize_prompt` to mask out tokens when weight is set to 0. Extend `tests/prompt_strategies/test_sharegpt.py` with four test cases that contain messages with `weight` keys. Switch names `test_w_train_on_input` and `test_no_train_on_input`. [1]: https://github.com/mistralai/mistral-finetune

winglian

thank you!

DavidFarago · 2024-06-13T06:49:35Z

you are welcome, @winglian -- thank you for reviewing.

Will you merge this PR or give me write access to this repository?

winglian · 2024-06-20T14:05:31Z

merged. thanks @DavidFarago !

DavidFarago force-pushed the mr-weight-on-main branch 2 times, most recently from ccebf54 to d9bbf5d Compare June 12, 2024 13:32

winglian requested changes Jun 12, 2024

View reviewed changes

tests/prompt_strategies/test_sharegpt.py Show resolved Hide resolved

DavidFarago force-pushed the mr-weight-on-main branch from d9bbf5d to 14428da Compare June 12, 2024 15:41

winglian approved these changes Jun 12, 2024

View reviewed changes

winglian merged commit 559562d into OpenAccess-AI-Collective:main Jun 20, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow "weight: 0" in messages to mask them #1703

Allow "weight: 0" in messages to mask them #1703

DavidFarago commented Jun 11, 2024 •

edited

winglian left a comment

DavidFarago commented Jun 13, 2024

winglian commented Jun 20, 2024

Allow "weight: 0" in messages to mask them #1703

Allow "weight: 0" in messages to mask them #1703

Conversation

DavidFarago commented Jun 11, 2024 • edited

Description

Motivation and Context

How has this been tested?

winglian left a comment

Choose a reason for hiding this comment

DavidFarago commented Jun 13, 2024

winglian commented Jun 20, 2024

DavidFarago commented Jun 11, 2024 •

edited