Enhancement of chat template handling #4

Nondzu · 2024-02-23T22:27:39Z

This PR introduces several enhancements. Specifically, it focuses on the wrap_chat_template and log_likelihood methods within the huggingface.py module.

Context of the Changes

These changes are based on the unmerged PR (EleutherAI#1287) from the original repository. The mentioned PR contains valuable improvements that are not yet merged into the main branch. By integrating these improvements into our fork, we aim to enhance the model's interaction with chat templates and accuracy in performance evaluation.

Changes Made

Instance Class Enhancements

args Property with Setter: A new property args with a setter has been added to the Instance class to update the instance's arguments. This ensures that the new arguments match the old ones in length and type, enhancing the integrity of instance modifications.

Huggingface Model Improvements

Chat Templating Settings: Added use_chat_template and system_prompt as optional parameters to the HFLM class to facilitate the handling of chat templates.
Template Wrapping Enhancement: The wrap_chat_template method has been improved to include system prompts and better prepare the context for text generation.
Generation Argument Refinement: Modified the handling of generation_kwargs to remove the temperature parameter when do_sample is set to False, providing a more consistent deterministic generation behavior.
Log Likelihood Updates: The loglikelihood method now utilizes the chat template functionality when enabled, with debug prints added for monitoring the prompt formatting process.
Rolling Log Likelihood Note: A note has been added to indicate that chat templates are currently not considered in per-token log likelihood evaluations, with a placeholder for a future warning message.

Additional Notes

Several debug print statements have been added to assist with further development and maintenance.

Impact

The proposed enhancements are expected to improve the model's flexibility and reliability when dealing with chat templates. They also aim to provide a more accurate and performant evaluation of the model's capabilities.

Please review the proposed changes and provide feedback on any further requirements or amendments needed.

I look forward to your feedback and am ready to make any necessary adjustments to prepare this PR for merging.

Thank you for considering this contribution to the project.

Nondzu added 2 commits February 23, 2024 08:38

Add prompt templating for HF models

12622cf

rever change trust_remote_code

de8ad42

Nondzu force-pushed the fix_hf_prompt branch from 5e40810 to de8ad42 Compare February 28, 2024 08:17

Nondzu added 2 commits February 28, 2024 09:21

add chat template switch

08e050c

Merge branch 'speakleash:main' into fix_hf_prompt

586845f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancement of chat template handling #4

Enhancement of chat template handling #4

Nondzu commented Feb 23, 2024

Enhancement of chat template handling #4

Are you sure you want to change the base?

Enhancement of chat template handling #4

Conversation

Nondzu commented Feb 23, 2024

Context of the Changes

Changes Made

Instance Class Enhancements

Huggingface Model Improvements

Additional Notes

Impact