Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SuperGLUE part 1 #4

Merged
merged 3 commits into from
Sep 16, 2020
Merged

Conversation

zphang
Copy link
Contributor

@zphang zphang commented Sep 14, 2020

  • Adding 6/8 SuperGLUE tasks (I'm confirming the task formulation for MultiRC and ReCoRD with the OpenAI folks)
  • Documentation/API tweaks (e.g. supporting truncation)
  • write_out.py script for writing out the LM inputs for inspection

@StellaAthena StellaAthena merged commit 635a215 into EleutherAI:master Sep 16, 2020
@StellaAthena StellaAthena linked an issue Sep 16, 2020 that may be closed by this pull request
2 tasks
@StellaAthena StellaAthena linked an issue Oct 23, 2020 that may be closed by this pull request
2 tasks
leogao2 pushed a commit that referenced this pull request Feb 11, 2021
StellaAthena pushed a commit to dirkgr/lm-evaluation-harness that referenced this pull request Apr 27, 2022
lintangsutawika pushed a commit that referenced this pull request Jun 22, 2023
Fully merge hf-causal and seq2seq impls.
qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this pull request Aug 17, 2023
qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this pull request Aug 17, 2023
qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this pull request Aug 17, 2023
LZY-the-boys pushed a commit to LZY-the-boys/lm-evaluation-harness-fast that referenced this pull request Sep 12, 2023
lintangsutawika pushed a commit that referenced this pull request Jul 8, 2024
bash script for gpt models
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Implement the SuperGLUE evaluation Implement the WSC273 Winograd Schemas Challenge evaluation
2 participants