Sim eval #8

schuemie · 2023-12-07T09:59:21Z

Adding a framework for evaluating various settings using simulated data. Switching from ini files to yaml files to support more complicated config (for the evaluation experiment). Depending on the configuration, the meta-evaluator fits various pre-trained models, fine-tunes those on the simulated training sets, and evaluates them on the simulated test sets. Results are aggregated in to CSV files. Added metrics for evaluation (loss, Brier score, AUROC, and AUPRC). Also added a simple regression model to server as baseline.

…en fine-tuning

…RC and Brier score

…(evaluation across many model settings)

… number of batches in fine-tuned model

schuemie and others added 26 commits October 27, 2023 12:50

Passing forward settings used to generate model, to avoid mistakes wh…

193c696

…en fine-tuning

Work on simulation evaluation

3144429

Switching label prediction to logistic regression. Computing AUC, AUP…

8a42846

…RC and Brier score

Applying settings pattern to simulator

9ed7acc

Allowing user to limit number of simulated prediction tasks

6ff3cc6

Implemented fine-tuning and testing of single prediction example

770c21c

Removing debug code

2a7a6e1

Not writing configs and tokenizers when evaluating model only.

3c6b31c

Switcing from ini to yaml config files. Implementing meta-evaluation …

1a4ba18

…(evaluation across many model settings)

More work on seperating inputs from learning objectives

1ce82ef

Adding support for MPS

8f029e4

Combining evaluation results across runs into single CSV

c0157ab

Adding option to select specific epoch of pretrained model, and limit…

bf33203

… number of batches in fine-tuned model

Fix typo

36b9259

Defining a pilot simulation study

7da3cb4

Fix unit tests

b6049d6

Attempting to fix GA build

1e028cb

Requiring visit concept tokenizer when using visit concept embedding

ed34b19

Add checkpoint_every option to save disk space

29ce336

Merge branch 'sim_eval' of https://github.com/OHDSI/Apollo into sim_eval

6d38908

Forgot to pass through checkpoint_every in evaluator.

61921fe

Fixing evaluation output file

ed81bc2

Also grouping results by pretrain_epoch

2a4a691

Adding simple regression model as baseline

f7d36ca

Fix error when training

153d5a0

Fixing another error

6152ee6

schuemie merged commit 53ead53 into main Dec 7, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sim eval #8

Sim eval #8

schuemie commented Dec 7, 2023

Sim eval #8

Sim eval #8

Conversation

schuemie commented Dec 7, 2023