Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns #155

thejaminator · 2023-03-26T09:29:59Z

Depends on #147
Closes #153

thejaminator · 2023-03-26T09:38:01Z

elk/evaluation/evaluate_log.py

+ self.eval_result.acc,
+ self.eval_result.cal_acc,
+ self.eval_result.auroc,
+ self.eval_result.ece,


previously the columns were ["layer", "loss", "acc", "cal_acc", "auroc"] for whatever Eval logged. So i think there was a bug where acc was written to to "loss" instead, and "cal_acc" to "acc" instead, etc.

thejaminator · 2023-03-26T09:43:01Z

tests/test_write_iterator_to_file.py

+ # and don't want to fail the test
+ pass
+ # We should still have results for layer 1, 3, even though layer 2 failed
+ with open(tmp_path / "eval.csv", "r") as f:


test for kabooms

thejaminator · 2023-03-26T09:43:35Z

elk/utils/csv.py

+ row = to_csv_line(row)
+ writer.writerow(row)
+ if debug:
+ save_debug_log(dataset, out_dir)


factored this out so its much easier to test

thejaminator · 2023-03-26T09:44:18Z

elk/utils/csv.py

+The layer field is used to sort the logs by layer."""
+Log = TypeVar("Log", EvalLog, ElicitLog)
+
+


this typevar has to be bounded on EvalLog, ElicitLog since we call .layer on it (to sort the CSV)

thejaminator · 2023-03-26T09:46:42Z

elk/run.py

+ to_csv_line: A function that converts a Log to a list of strings.
+ This has to be injected in because the Run class does not know
+ the extra options e.g. skip_baseline to apply to function.
+ csv_columns: The columns of the CSV file."""
 self.out_dir = assert_type(Path, self.out_dir)


i made func have a signature like

Callable[[int], Log]

previously it was something like

Callable[[int, int, list[int]], Log] # layer, device, list of devices

which could be pretty confusing

previously func wasn't annotated so pyright wouldn't catch keyword mispelling / mistakes in partial.

we could also make it a paramspec instead so you indiate that is a function that has keywords world_size, devices, but i didn't want to complicate things

thejaminator · 2023-03-26T11:01:37Z

elk/evaluation/evaluate.py

- def evaluate_reporter(self, layer: int, devices: list[str], world_size: int):
+ def evaluate_reporter(
+ self, layer: int, devices: list[str], world_size: int = 1
+ ) -> EvalLog:


it now returns a dataclass instead of a list[str].
Which i think is nicer since

Whoever wants to call Evaluate manually (e.g. in a notebook) gets back a dataclass rather than a list of unlabelled strings

The responsbility of writing a CSV format is done somewhere else.

for more information, see https://pre-commit.ci

lauritowal

Thanks for the improvements! Looks good to me, I've added a few comments, but I'll merge it into the refactor-train-eval branch later.

lauritowal · 2023-03-26T14:38:44Z

elk/run.py

 self.out_dir = assert_type(Path, self.out_dir)
+ # Should we write to different CSV files for elicit vs eval?


Should we write to different CSV files for elicit vs eval?

we are writing to different CSV files, since for eval the out_dir is usually by default under ..../transfer_eval... However, we could rename "eval.csv" to something like "transfer_eval.csv", maybe.

lauritowal · 2023-03-26T14:53:43Z

elk/run.py

- writer.writerow(row)
- if self.cfg.debug:
- save_debug_log(self.dataset, self.out_dir)
+ iterator: Iterator[Log] = tqdm( # type: ignore


Is the type: ignore necessary here?

thejaminator added 10 commits March 25, 2023 21:13

clean up

b277817

dep inject columns and writing

a204748

refactor names

e458afc

write iterator tests

97e88f7

add log csv elements tests

1fdcce1

refactor generics

d2b5da7

add multiprocessing tests

f794b03

remove comment

0c8bc0f

remove comment

6f72e5a

add ece to eval log

d9d62b8

thejaminator commented Mar 26, 2023

View reviewed changes

add docstrings

8c43028

thejaminator commented Mar 26, 2023

View reviewed changes

thejaminator changed the title ~~Create dataclasses for writing to CSV, refactor the CSV logging~~ Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns Mar 26, 2023

thejaminator added 2 commits March 26, 2023 18:34

wait for other processes to finish before crashing the pool

54c4ac1

fix tests

eb11b4d

thejaminator commented Mar 26, 2023

View reviewed changes

thejaminator and others added 2 commits March 26, 2023 19:04

make flake happy

72719ce

[pre-commit.ci] auto fixes from pre-commit.com hooks

71f796f

for more information, see https://pre-commit.ci

lauritowal approved these changes Mar 26, 2023

View reviewed changes

lauritowal merged commit a55219b into refactor-train-eval Mar 26, 2023

lauritowal deleted the refactor-csv branch March 26, 2023 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns #155

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns #155

thejaminator commented Mar 26, 2023 •

edited

Loading

thejaminator Mar 26, 2023 •

edited

Loading

thejaminator Mar 26, 2023

thejaminator Mar 26, 2023

thejaminator Mar 26, 2023 •

edited

Loading

thejaminator Mar 26, 2023

thejaminator Mar 26, 2023

thejaminator Mar 26, 2023

lauritowal left a comment

lauritowal Mar 26, 2023 •

edited

Loading

lauritowal Mar 26, 2023

		The layer field is used to sort the logs by layer."""
		Log = TypeVar("Log", EvalLog, ElicitLog)

		self.out_dir = assert_type(Path, self.out_dir)
		# Should we write to different CSV files for elicit vs eval?

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns #155

Create dataclasses for writing to CSV, refactor CSV logging, fix eval CSV columns #155

Conversation

thejaminator commented Mar 26, 2023 • edited Loading

thejaminator Mar 26, 2023 • edited Loading

Choose a reason for hiding this comment

thejaminator Mar 26, 2023

Choose a reason for hiding this comment

thejaminator Mar 26, 2023

Choose a reason for hiding this comment

thejaminator Mar 26, 2023 • edited Loading

Choose a reason for hiding this comment

thejaminator Mar 26, 2023

Choose a reason for hiding this comment

thejaminator Mar 26, 2023

Choose a reason for hiding this comment

thejaminator Mar 26, 2023

Choose a reason for hiding this comment

lauritowal left a comment

Choose a reason for hiding this comment

lauritowal Mar 26, 2023 • edited Loading

Choose a reason for hiding this comment

lauritowal Mar 26, 2023

Choose a reason for hiding this comment

thejaminator commented Mar 26, 2023 •

edited

Loading

thejaminator Mar 26, 2023 •

edited

Loading

thejaminator Mar 26, 2023 •

edited

Loading

lauritowal Mar 26, 2023 •

edited

Loading