Skip to content

Commit

Permalink
Amend eval location in readme
Browse files Browse the repository at this point in the history
  • Loading branch information
haileyschoelkopf committed Apr 26, 2023
1 parent 2e5330c commit ce49833
Showing 1 changed file with 1 addition and 5 deletions.
6 changes: 1 addition & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -130,11 +130,7 @@ We also provide benchmark 0-shot and 5-shot results on a variety of NLP datasets
- BLiMP (`blimp_*`)
- MMLU (`hendrycksTest*`)

Evaluations were performed in GPT-NeoX using the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness), and are viewable by model and step at `results/json/*` in this repository.

### Plotting Results

We will also provide utilities for creating plots based on the dumped zero and few-shot results. Sample notebook and data format forthcoming.
Evaluations were performed in GPT-NeoX using the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness), and are viewable by model and step at `results/json/v1.1-evals/*` in this repository.



Expand Down

0 comments on commit ce49833

Please sign in to comment.