Train/valid/test split #102

choidami · 2023-05-05T22:28:01Z

Hello,

I was wondering if the validation and test sets were separated from the train set, or sampled from the same distribution.
According to here: https://github.com/EleutherAI/pythia/blob/main/models/70M/pythia-70m.yml#L91
and how the datasets area formed here: https://github.com/EleutherAI/gpt-neox/blob/main/megatron/data/data_utils.py#L332
it seems like valid and test sets are overlapping with the test set.

StellaAthena · 2023-05-21T04:08:46Z

Hello,

I was wondering if the validation and test sets were separated from the train set, or sampled from the same distribution. According to here: https://github.com/EleutherAI/pythia/blob/main/models/70M/pythia-70m.yml#L91 and how the datasets area formed here: https://github.com/EleutherAI/gpt-neox/blob/main/megatron/data/data_utils.py#L332 it seems like valid and test sets are overlapping with the test set.

Ah yes, that’s a hack we used because we were being lazy and probably a misleading thing to have posted. There are official Pile validation and test sets that you can download from pile.eleuther.ai and which you can evaluate models on using our evaluation codebase. These two datasets are deduplicated against the rest of the Pile and each other. Details on their construction can be found in the Pile paper.

We didn’t actually have them downloaded on the cluster we trained the models with, so filled out those values with the training set path and just ignored the results.

StellaAthena closed this as completed May 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train/valid/test split #102

Train/valid/test split #102

choidami commented May 5, 2023

StellaAthena commented May 21, 2023

Train/valid/test split #102

Train/valid/test split #102

Comments

choidami commented May 5, 2023

StellaAthena commented May 21, 2023