-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Selfcheckgpt evaluation to tasks #1080
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are accessibility issues in these changes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👏 You fixed the issue(s)! Great work.
I am trying to add a new task for llm hallucination. This is the very initial code. There is something I'd like to discuss with everyone about:
I have implemented one possible solution. But I am not sure it is the best one. Thank you very much. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've left a few comments as review! I believe the functionality you want (N generations per document created) is already supported without any novel YAML config options. Please let me know if I am misunderstanding your intended functionality
hi @haileyschoelkopf @lintangsutawika, I have refactored the code so that it has its own task.py to deal with its unique multiple generations kwargs. In order to make it to be recognized by the llm-eval. I imported the selfcheckgpt in task/init.py which is similar to squadv2. Thank you very much. |
@erenup Thank you for your contribution to our library. We will not be able to merge this PR until you do the following:
|
hi @StellaAthena Thank you.
|
It has been addressed and a new review is needed
@lintangsutawika @haileyschoelkopf this is now ready for your review again. |
No description provided.