Pythia 12b flash config #162

jvendrow · 2024-05-26T19:18:45Z

The pythia 12b config has:

"attention-config": [[["flash"], 40]],

However, in the gpt-neox repo the 40 is replaced by 36, and in the file:

This value of 36. Is this a mistake? Also, the attention config line at:

Line 19 in 1ff5ade

"attention-config": [[["flash"], 40]]

seems to be missing a comma?

Edit: The num-layers value also seems off, 36 v. 40.

The text was updated successfully, but these errors were encountered:

Provide feedback