-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Add chat templating for HF models #1287
Conversation
Awesome to see this PR go up! Happy to provide context (pun intended) on some of my commits, my experimentation was starting to get messy at the end there. Overall, I wasn't seeing the positive move in eval scores that I had anticipated. I hypothesize that dealing with few-shot examples in context and adding triggers will be helpful to getting scores to be better. |
Likewise, initial tests with OpenHermes on my end are showing similarly, on ARC and Lambada (the latter of which does not surprise me too much). We will also need to address the correctness of |
Yep, I have a more recent version I’ll push tomorrow morning! |
does this currently not allow for custom system prompts? |
Co-authored-by: lewtun <[email protected]>
Co-authored-by: Daniel Furman <[email protected]>
TODOs:
|
Hi @haileyschoelkopf, have the changes been made part of the current main branch? If not, what are the blockers for this? Should I start tackling the TODOs that you have mentioned above for this PR? |
courtesy of @KonradSzafer @clefourrier , we've just merged #1873 , supporting chat templating in HF models -- feedback on additional features or blind spots there would be very greatly appreciated! |
This is a WIP PR , carrying on the draft @daniel-furman in #1209 started of adding the specified oft-requested chat templating feature.
Current TODOs are:
Feedback appreciated!