Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

test
#1 by lucidrains was merged Dec 22, 2020 Loading…
Create CODEOWNERS
#8 by StellaAthena was merged Dec 24, 2020 Loading…
PR for Deepspeed Integration
#9 by trisongz was merged Dec 24, 2020 Loading…
remove enwik8 data from repository
#13 by lucidrains was merged Dec 26, 2020 Loading…
add tensorboard logging
#15 by sdtblck was merged Dec 27, 2020 Loading…
GPT-3 Small Works
#24 by StellaAthena was merged Jan 3, 2021 Loading…
add linear warmup over 5000 steps and gradient clipping
#29 by lucidrains was merged Jan 4, 2021 Loading…
untie classifier weights by default
#30 by lucidrains was merged Jan 4, 2021 Loading…
Automatically download owt2
#33 by steven-mi was merged Jan 4, 2021 Loading…
Update requirements.txt
#36 by sdtblck was merged Jan 4, 2021 Loading…
Fix deprecation warning
#42 by sdtblck was merged Jan 5, 2021 Loading…
update tensorflow to 2.4.0
#47 by sdtblck was merged Jan 5, 2021 Loading…
Added link to an installation walk-through
#48 by StellaAthena was merged Jan 6, 2021 Loading…
Stella athena patch 1
#49 by StellaAthena was closed Jan 7, 2021 Loading…
Updating branch with new PR code
#53 by StellaAthena was merged Jan 12, 2021 Loading…
Updating from main
#54 by StellaAthena was merged Jan 12, 2021 Loading…
Add enron_jsonl and enron_tfr datasets (mostly for testing)
#56 by sdtblck was merged Jan 13, 2021 Loading…
Revert GPT2Dataset back to old working state
#57 by sdtblck was merged Jan 13, 2021 Loading…
implement gradient checkpointing
#59 by sdtblck was merged Jan 13, 2021 Loading…
Pipeline parallelism for enwik8
#60 by sdtblck was merged Jan 13, 2021 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.