-
Notifications
You must be signed in to change notification settings - Fork 1k
Pull requests: EleutherAI/gpt-neox
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Return to broadcasting megatron and deepspeed configs as training.py arguments
#948
by dashstander
was merged May 23, 2023
Loading…
Bump transformers version and update enwik8 link
#1024
by dashstander
was merged Sep 13, 2023
Loading…
Fix bf16 for zero > 0 and pipeline parallelism > 0
#1032
by dashstander
was merged Sep 18, 2023
Loading…
Remove support for lazy dataset implementation
#1033
by dashstander
was merged Sep 18, 2023
Loading…
Remove the NeoX implementation of GPT2Tokenizer
#1042
by dashstander
was merged Sep 25, 2023
Loading…
Add section to the README detailing how to start distributed jobs
#1048
by dashstander
was merged Sep 29, 2023
Loading…
Add documentation about using labelled datasets
#1056
by dashstander
was merged Oct 4, 2023
Loading…
Updates bf16 demo config and mixed precision docutmentation.
#941
by dashstander
was merged May 18, 2023
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.