-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Default behavior bootstrapping #1121
Comments
Hey @tom-doerr , I think I understand the issue here, but I'd love to hear more on what you mean, potentially with an example if possible. Is the idea that there should be more dynamic feedback during the bootstrapping to ensure the demonstration selection will lead to equal or better performance compared to the uncompiled program, and avoid such cases where performance decreases? We could definitely explore some improvements to the existing BootstrapFewShot optimizer. |
I'm not sure what a good solution would be, but the current behavior isn't optimal, in my opinion. Example: tweet, score Bootstrap will use the first 4 nonsense samples in its prompt, making performance much worse. This also happened to me on real data. |
I see! Ironically, as I was updating documentation for Bootstrap from your other issue #1118, the arg |
I think during bootstrapping there should be a special case for a situations where all scores and scalars are non zero.
To me it seems that bootstrapping fails completely and decreases performance when a metric is used that never or almost never is zero.
The text was updated successfully, but these errors were encountered: