Run eval harness during training #367

sdtblck · 2021-06-24T19:35:42Z

addresses #366

The generation tasks aren't that fast - so best to stick to the log likelihood tasks (you can add them to the yaml with the "eval_tasks" parameter.

e.g

"eval_tasks": ["lambada", "wikitext", "piqa"],

StellaAthena

It looks like it's locking on piqa. We need to check out the HF dataset builder internals and make sure it's thread safe.

StellaAthena · 2021-07-11T01:39:43Z

Notes on this PR's behavior:

Running just lambada works fine
Running just piqa works fine
Running both results in the code freezing as show below, regardless of which is listed first

Running loglikelihood requests
  0%|          | 0.00/1.82M [00:00<?, ?byte/s]Running loglikelihood requests
Running loglikelihood requests
100%|##########| 1.82M/1.82M [00:01<00:00, 1.79Mbyte/s]File downloaded. Checksum: 4aa8d02cd17c719165fc8a7887fddd641f43fcafa4b1c806ca8abc31fabdb226

Running loglikelihood requests
 42%|####1     | 3668/8827 [00:40<00:36, 139.70it/s]

StellaAthena · 2021-07-11T12:50:06Z

mathqa + piqa has been running for over 8 hours without any problems. I’m wondering if the core problem is about mixing HF and non-HF tasks

sdtblck · 2021-08-31T11:44:11Z

Hey @StellaAthena can you test out the above change, and let me know if it makes a difference?
I try to pre-download all the tasks on local rank 0, to get rid of any multithreading problems

StellaAthena · 2021-08-31T13:14:40Z

Hey @StellaAthena can you test out the above change, and let me know if it makes a difference?
I try to pre-download all the tasks on local rank 0, to get rid of any multithreading problems

It's on today's TODO list :)

sdtblck · 2021-08-31T14:38:59Z

Just some peace of mind for @StellaAthena that this is definitely working :)

sdtblck added 4 commits June 24, 2021 17:16

run eval harness during training

2c08d4d

fix logging

63fc228

cleanup imports + change print string

356a155

cleanup imports

77a310e

sdtblck requested a review from a team as a code owner June 24, 2021 19:35

sdtblck requested review from joshlk and StellaAthena June 24, 2021 19:35

StellaAthena requested changes Jun 24, 2021

View reviewed changes

StellaAthena linked an issue Jun 24, 2021 that may be closed by this pull request

Eval Harness doesn't log during training #366

Closed

Update adaptor.py

25b9ae1

sdtblck requested a review from StellaAthena August 31, 2021 13:12

sweinbach approved these changes Aug 31, 2021

View reviewed changes

StellaAthena approved these changes Aug 31, 2021

View reviewed changes

StellaAthena merged commit 069f856 into main Aug 31, 2021

StellaAthena deleted the eval_harness_update branch August 31, 2021 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run eval harness during training #367

Run eval harness during training #367

sdtblck commented Jun 24, 2021 •

edited

Loading

StellaAthena left a comment

StellaAthena commented Jul 11, 2021 •

edited

Loading

StellaAthena commented Jul 11, 2021

sdtblck commented Aug 31, 2021

StellaAthena commented Aug 31, 2021

sdtblck commented Aug 31, 2021

Run eval harness during training #367

Run eval harness during training #367

Conversation

sdtblck commented Jun 24, 2021 • edited Loading

StellaAthena left a comment

Choose a reason for hiding this comment

StellaAthena commented Jul 11, 2021 • edited Loading

StellaAthena commented Jul 11, 2021

sdtblck commented Aug 31, 2021

StellaAthena commented Aug 31, 2021

sdtblck commented Aug 31, 2021

sdtblck commented Jun 24, 2021 •

edited

Loading

StellaAthena commented Jul 11, 2021 •

edited

Loading