Refactor train eval #147

lauritowal · 2023-03-23T01:31:11Z

No description provided.

derpyplops · 2023-03-23T17:05:24Z

Looks like type tests are failing. The type of device should probably be str

for more information, see https://pre-commit.ci

… into refactor-train-eval

for more information, see https://pre-commit.ci

AlexTMallen

I think this looks good overall! I just want to make the train/val change for Eval. I can write the code for this.

AlexTMallen · 2023-03-25T17:41:40Z

elk/evaluation/evaluate.py

+ output directory. Defaults to False.
+ """
+
+ data: Extract


currently extract requires there to be a train and validation split, which I don't think we should mandate when running Eval (I'd like to be able to run truthfulQA without having to make a fake training split). I think we should modify the __post_init__ of Evaluate to only extract the validation dataset.

Just for the others from our conversation in the chat:

Your change requests should probably be maybe implemented as a new feature by itself, since the current pull request is only about refactoring, but we can add them soonish after merging...

AlexTMallen · 2023-03-25T18:01:42Z

elk/utils/data_utils.py

+ """Get a list of indices of hidden layers given a `DatasetDict`."""
+ layers = [
+ int(feat[len("hidden_") :])
+ for feat in ds["train"].features


ds might not contain a "train" split, so we should just take ds.values().pop().features or the equivalent.

don't you always still get at least train if there is no split?

I'm thinking about the refactor I want to do to support evaluations on a single split, and in the case of E.g. truthful QA there is only a validation split. This can wait for the next PR though.

AlexTMallen · 2023-03-25T18:04:45Z

elk/evaluation/evaluate.py


- devices = select_usable_devices(cfg.num_gpus)
- num_devices = len(devices)
+ _, _, test_x0, test_x1, _, test_labels = self.prepare_data(


I think we should split up the extraction of train and test data so we don't have to do this

thejaminator · 2023-03-26T09:26:09Z

^ sorry my bad, was branching off from you and forgot to switch my branch when pushing. corrected it and ignore the previous push

for more information, see https://pre-commit.ci

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns

These changes should go in a new PR

AlexTMallen

lgtm. We can support single splits in the next pr.

Issues fixed

lauritowal added 17 commits March 17, 2023 15:07

add initial changes

7d72e47

Merge branch 'main' into refactor-train-eval

ae4c930

add save config function

9bdef85

rename for better understanding

5306452

move functions, since they convert types + add upcasting

44dccf3

remove duplicate functions and add get_layers

9d5cdf7

refactoring, using functions from other modules

b75e14e

remove unused

ecb06a3

fix circular imports

b004e64

refactor train, rename RunConfig

49aaf36

cleanup

5346b89

Use SimpleParsing's subparsers

fde47a8

remove unused

4079250

add classes

59f68e0

fix type errors

283a346

add out_dir to run

0411314

set outdir correctly + precommit cleanup

2daccba

lauritowal and others added 9 commits March 23, 2023 20:32

cleanup + typing

579b44c

pre-commit cleanup

5f47394

Merge branch 'main' into refactor-train-eval

431fe6c

cleanup + add save meta + save debug

1a25f90

[pre-commit.ci] auto fixes from pre-commit.com hooks

ac5559d

for more information, see https://pre-commit.ci

cleanup

4f34cee

correct path for reporters and lr models

6d40655

Merge branch 'refactor-train-eval' of https://github.com/EleutherAI/elk…

f8434e6

… into refactor-train-eval

add types + cleanup + pre-commit changes

0e3b7ba

lauritowal requested review from AlexTMallen and AlexWan0 and removed request for AlexWan0 March 23, 2023 22:24

lauritowal and others added 7 commits March 25, 2023 01:24

remove priorities from function args

0ba0475

remove priorities + cleanup

0f545b0

Merge branch 'main' into refactor-train-eval

10a2c73

fix naming

615e52e

[pre-commit.ci] auto fixes from pre-commit.com hooks

f43d354

for more information, see https://pre-commit.ci

clean up

b277817

dep inject columns and writing

a204748

AlexTMallen previously requested changes Mar 25, 2023

View reviewed changes

thejaminator added 6 commits March 26, 2023 16:26

refactor names

e458afc

write iterator tests

97e88f7

add log csv elements tests

1fdcce1

refactor generics

d2b5da7

add multiprocessing tests

f794b03

remove comment

0c8bc0f

thejaminator force-pushed the refactor-train-eval branch from 0c8bc0f to f43d354 Compare March 26, 2023 09:25

thejaminator mentioned this pull request Mar 26, 2023

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns #155

Merged

thejaminator and others added 8 commits March 26, 2023 17:32

remove comment

6f72e5a

add ece to eval log

d9d62b8

add docstrings

8c43028

wait for other processes to finish before crashing the pool

54c4ac1

fix tests

eb11b4d

make flake happy

72719ce

[pre-commit.ci] auto fixes from pre-commit.com hooks

71f796f

for more information, see https://pre-commit.ci

Merge pull request #155 from EleutherAI/refactor-csv

a55219b

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns

AlexTMallen approved these changes Mar 26, 2023

View reviewed changes

lauritowal merged commit 12e2c12 into main Mar 26, 2023

lauritowal deleted the refactor-train-eval branch March 26, 2023 19:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor train eval #147

Refactor train eval #147

lauritowal commented Mar 23, 2023

derpyplops commented Mar 23, 2023

AlexTMallen left a comment

AlexTMallen Mar 25, 2023

lauritowal Mar 26, 2023

AlexTMallen Mar 25, 2023

lauritowal Mar 25, 2023

AlexTMallen Mar 26, 2023 •

edited

Loading

AlexTMallen Mar 25, 2023

thejaminator commented Mar 26, 2023

AlexTMallen left a comment

Refactor train eval #147

Refactor train eval #147

Conversation

lauritowal commented Mar 23, 2023

derpyplops commented Mar 23, 2023

AlexTMallen left a comment

Choose a reason for hiding this comment

AlexTMallen Mar 25, 2023

Choose a reason for hiding this comment

lauritowal Mar 26, 2023

Choose a reason for hiding this comment

AlexTMallen Mar 25, 2023

Choose a reason for hiding this comment

lauritowal Mar 25, 2023

Choose a reason for hiding this comment

AlexTMallen Mar 26, 2023 • edited Loading

Choose a reason for hiding this comment

AlexTMallen Mar 25, 2023

Choose a reason for hiding this comment

thejaminator commented Mar 26, 2023

AlexTMallen left a comment

Choose a reason for hiding this comment

AlexTMallen Mar 26, 2023 •

edited

Loading