-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Issues: openai/evals
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Safety Eval Idea: allergen information of different food products in the Israeli market.
#874
opened Apr 30, 2023 by
ido777
Idea for Evals: Sorting numbers with repeats and negatives
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#782
opened Apr 23, 2023 by
voynow
Idea for Evals: Count how many numbers are greater than or less than X
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#785
opened Apr 23, 2023 by
voynow
Please add an option to change language or understand other languages
#795
opened Apr 24, 2023 by
0XxMuzanxX0
Idea for eval - Bible knowledge as tested in the yearly International Bible youth Contest
#840
opened Apr 27, 2023 by
ido777
Feature suggestion - Save and Load Conversation History
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#907
opened May 3, 2023 by
wawryszukd
pip install evals throws AssertionError
bug
Something isn't working
#918
opened May 4, 2023 by
CholoTook
Eval idea: Security code review for unicode attacks on code
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#787
opened Apr 24, 2023 by
qrdlgit
Add BigBench Tasks for evaluation
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#153
opened Mar 15, 2023 by
Muhtasham
Evaluation on computer vision benchmarks
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#235
opened Mar 16, 2023 by
finitearth
Evaluate GPT-4 on classical NLP tasks
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#246
opened Mar 16, 2023 by
LifeIsStrange
Create an evaluation that measures a model's ability to remember specifics about texts in it's dataset?
Idea for Eval
These issues keep track of requests for different kinds of eval PRs
#383
opened Mar 21, 2023 by
mrconter1
ProTip!
Follow long discussions with comments:>50.