-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inverse Scaling Tasks? #1442
Comments
Hasn't been asked before! Supporting these tasks as originally implemented would be very nice! We ourselves probably won't have the bandwidth for it soon, but if anyone wishes to contribute them we'd be happy to assist and review. |
That implementation looks interesting, do you mind if I try it? |
Yes, that'd be fantastic if you're interested! |
Thank you for assigning. I'll get to work soon! |
To address any possible issues, I'm currently asking the inverse scaling slack if it's okay to implement these tasks. I will start implementing them as soon as they are approved. |
@RylanSchaeffer @haileyschoelkopf The initial implementation is done, all that's left is to test that it produces results like the paper. I'll make a pull request once I've verified the results. |
Awesome :)
Cheers,
Rylan Schaeffer
…On Fri, Mar 8, 2024 at 6:25 PM Hanwool Albert Lee ***@***.***> wrote:
@RylanSchaeffer <https://github.com/RylanSchaeffer> @haileyschoelkopf
<https://github.com/haileyschoelkopf> The initial implementation is done,
all that's left is to test that it produces results like the paper. I'll
make a pull request once I've verified the results.
—
Reply to this email directly, view it on GitHub
<#1442 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACEHLCZBRHSEGU3IKRGC363YXJXLLAVCNFSM6AAAAABDOMNE6WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSOBWGY4TONZRGA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
@haileyschoelkopf |
Or maybe I'll use the code utilized in the evaluation(of inverse scaling prize) as a custom metric. I didn't get anything from the inverse-scaling team, but I did find some related work in the authors' github. |
@h-albert-lee That’s great progress! Would you be able to open a PR with your implementation and resulting scores so we can discuss there? It’s hard to say without being able to look at the implementation differences/concrete numbers. |
@haileyschoelkopf Thanks a lot!, I'll apply the pre-commit and post a pull request with my experimental results soon. |
Apologies if this has been asked before, but I couldn't find the answer in
lm_evals/tasks
or any issues. Are there plans to add Inverse Scaling (https://github.com/inverse-scaling/prize) into thelm-evaluation-harness
?The text was updated successfully, but these errors were encountered: