-
Notifications
You must be signed in to change notification settings - Fork 981
Pull requests: EleutherAI/gpt-neox
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add documentation about using labelled datasets
#1056
by dashstander
was merged Oct 4, 2023
Loading…
Add section to the README detailing how to start distributed jobs
#1048
by dashstander
was merged Sep 29, 2023
Loading…
Remove the NeoX implementation of GPT2Tokenizer
#1042
by dashstander
was merged Sep 25, 2023
Loading…
Remove support for lazy dataset implementation
#1033
by dashstander
was merged Sep 18, 2023
Loading…
Fix bf16 for zero > 0 and pipeline parallelism > 0
#1032
by dashstander
was merged Sep 18, 2023
Loading…
Bump transformers version and update enwik8 link
#1024
by dashstander
was merged Sep 13, 2023
Loading…
Return to broadcasting megatron and deepspeed configs as training.py arguments
#948
by dashstander
was merged May 23, 2023
Loading…
Updates bf16 demo config and mixed precision docutmentation.
#941
by dashstander
was merged May 18, 2023
Loading…
Adds a script to convert NeoX 2.0 checkpoints to DeepSpeed's universal checkpoint format
merge-queue
This PR is next on the queue to merge
#836
opened Mar 14, 2023 by
dashstander
Loading…
ProTip!
Follow long discussions with comments:>50.