EleutherAI / lm-evaluation-harness Public

Notifications You must be signed in to change notification settings
Fork 1.5k
Star 5.7k

Code
Issues 210
Pull requests 73
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: EleutherAI/lm-evaluation-harness

[Discussion] Add Major Code Benchmarks

#1157 opened Dec 18, 2023 by haileyschoelkopf

Open 4

Labels 10 Milestones 1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

170 Open 588 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Implement GPT2 greedy_until feature request

A feature that isn't implemented yet.

#95 by leogao2 was closed Nov 21, 2022

Implement the QuAC evaluation feature request

A feature that isn't implemented yet.

good first issue

Good for newcomers

#18 by StellaAthena was closed Nov 14, 2023

1 of 2 tasks

Implement the Natural Language Inference (NLI) evaluation feature request

A feature that isn't implemented yet.

good first issue

Good for newcomers

#23 by StellaAthena was closed Feb 12, 2021

1 of 2 tasks

Implement the English Grammar Correction evaluation feature request

A feature that isn't implemented yet.

good first issue

Good for newcomers

#28 by StellaAthena was closed Nov 21, 2022

2 tasks

Implement the News Article Generation evaluation feature request

A feature that isn't implemented yet.

#29 by StellaAthena was closed Nov 21, 2022

2 tasks

Implement the Novel Word evaluation feature request

A feature that isn't implemented yet.

good first issue

Good for newcomers

#30 by StellaAthena was closed Nov 21, 2022

2 tasks

Support richer example-packing functionality. feature request

A feature that isn't implemented yet.

#31 by zphang was closed Jan 4, 2021

Support writing out predictions feature request

A feature that isn't implemented yet.

#32 by zphang was closed Jan 4, 2021

Double check all of the zero/few-shot formats documentation

Improvements or additions to documentation.

#34 by leogao2 was closed Jan 4, 2021

Implement Semantic Search evaluation feature request

A feature that isn't implemented yet.

#58 by cauefcr was closed Nov 21, 2022

Implement all GLUE evaluations

#92 by leogao2 was closed Jan 28, 2021

Implement GLUE STSB feature request

A feature that isn't implemented yet.

#93 by leogao2 was closed Feb 8, 2021

Implement the StoryCloze evaluation feature request

A feature that isn't implemented yet.

good first issue

Good for newcomers

#8 by StellaAthena was closed Apr 1, 2022

1 of 2 tasks

Implement GPT3 logprobs for new framework feature request

A feature that isn't implemented yet.

#94 by leogao2 was closed Nov 21, 2022

Implement GPT3 greedy_until feature request

A feature that isn't implemented yet.

#96 by leogao2 was closed Nov 21, 2022

Implement a shuffle option in main.py

#102 by leogao2 was closed Nov 21, 2022

Implement the BioASQ evaluation

#114 by leogao2 was closed Nov 21, 2022

Implement the BioREAD evaluation

#116 by leogao2 was closed Nov 21, 2022

Implement the BMKC evaluation

#117 by leogao2 was closed Nov 21, 2022

Implement the HendrycksTest evaluation

#120 by leogao2 was closed Mar 26, 2021

Make all of the natural language prompts

#131 by leogao2 was closed Nov 21, 2022

Implement the EpicQA Evaluation feature request

A feature that isn't implemented yet.

#137 by nicholaskross was closed Nov 21, 2022

Write unit tests for GPT3 LM

#149 by leogao2 was closed Jun 12, 2021

MATH

#164 by leogao2 was closed Mar 25, 2021

vllm backend faild

#2028 by chunniunai220ml was closed Jun 27, 2024

Previous 1 2 3 4 5 … 23 24 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly