Skip to content

Issues: openai/evals

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Mandarin homophones eval 🇨🇳
#689 opened Apr 15, 2023 by ofou
fix error
#765 opened Apr 23, 2023 by Huymoixode
Idea for Evals: Sorting numbers with repeats and negatives Idea for Eval These issues keep track of requests for different kinds of eval PRs
#782 opened Apr 23, 2023 by voynow
Idea for Evals: Count how many numbers are greater than or less than X Idea for Eval These issues keep track of requests for different kinds of eval PRs
#785 opened Apr 23, 2023 by voynow
docs out of date bug Something isn't working
#929 opened May 7, 2023 by rustam-e
Inaccuracy of AI-generated responses bug Something isn't working
#906 opened May 3, 2023 by vmn2014
Feature suggestion - Save and Load Conversation History Idea for Eval These issues keep track of requests for different kinds of eval PRs
#907 opened May 3, 2023 by wawryszukd
pip install evals throws AssertionError bug Something isn't working
#918 opened May 4, 2023 by CholoTook
Eval idea: Security code review for unicode attacks on code Idea for Eval These issues keep track of requests for different kinds of eval PRs
#787 opened Apr 24, 2023 by qrdlgit
Make GPT4 aware of the evals format
#143 opened Mar 15, 2023 by bhack
Add BigBench Tasks for evaluation Idea for Eval These issues keep track of requests for different kinds of eval PRs
#153 opened Mar 15, 2023 by Muhtasham
Evaluation on computer vision benchmarks Idea for Eval These issues keep track of requests for different kinds of eval PRs
#235 opened Mar 16, 2023 by finitearth
Evaluate GPT-4 on classical NLP tasks Idea for Eval These issues keep track of requests for different kinds of eval PRs
#246 opened Mar 16, 2023 by LifeIsStrange
Windows path and unicode decoding
#379 opened Mar 21, 2023 by ulasdilek
Create an evaluation that measures a model's ability to remember specifics about texts in it's dataset? Idea for Eval These issues keep track of requests for different kinds of eval PRs
#383 opened Mar 21, 2023 by mrconter1
ProTip! Add no:assignee to see everything that’s not assigned.