Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

PR for Deepspeed Integration
#9 by trisongz was merged Dec 24, 2020 Loading… updated Dec 24, 2020
get rid of test file
#10 by sdtblck was merged Dec 26, 2020 Loading… updated Dec 26, 2020
test
#1 by lucidrains was merged Dec 22, 2020 Loading… updated Dec 26, 2020
fix small bug where sequence length is not passed into attention class
#21 by lucidrains was merged Jan 1, 2021 Loading… updated Jan 1, 2021
make mask value smaller by factor of 2
#25 by lucidrains was merged Jan 4, 2021 Loading… updated Jan 4, 2021
update tensorflow to 2.4.0
#47 by sdtblck was merged Jan 5, 2021 Loading… updated Jan 5, 2021
fix everything that i broke
#61 by sdtblck was merged Jan 13, 2021 Loading… updated Jan 13, 2021
Minor fixes
#85 by sdtblck was merged Jan 23, 2021 Loading… updated Jan 23, 2021
Implement distributed training using Kubernetes
#77 by leogao2 was merged Jan 23, 2021 Loading… updated Jan 23, 2021
2
6
Batch size needs to be specified
#87 by joshlk was merged Jan 23, 2021 Loading… updated Jan 23, 2021
Fix train pipeline
#89 by sdtblck was merged Jan 25, 2021 Loading… updated Jan 25, 2021
2
1
Update base_model.json
#93 by srulikbd was merged Jan 26, 2021 Loading… updated Jan 26, 2021
Miscellaneous docker QoL improvements
#91 by leogao2 was merged Jan 26, 2021 Loading… updated Jan 26, 2021
Update deploy_k8s.sh
#101 by leogao2 was merged Jan 27, 2021 Loading… updated Jan 27, 2021
Update deploy_k8s.sh
#102 by leogao2 was merged Jan 27, 2021 Loading… updated Jan 27, 2021
Add checkpoint saving / loading
#90 by sdtblck was merged Jan 28, 2021 Loading… updated Jan 28, 2021
Create runmany_k8s.sh
#106 by leogao2 was merged Jan 30, 2021 Loading… updated Jan 30, 2021
Better deployment
#107 by joshlk was merged Feb 1, 2021 Loading… updated Feb 1, 2021
Monitoring using wandb
#108 by joshlk was merged Feb 1, 2021 Loading… updated Feb 1, 2021
Remove layer caching
#109 by joshlk was merged Feb 1, 2021 Loading… updated Feb 1, 2021
Replace neox code with megatron
#119 by leogao2 was merged Feb 16, 2021 Loading… updated Feb 16, 2021
Small patch. Otherwise stops before setting up new service
#125 by joshlk was merged Feb 17, 2021 Loading… updated Feb 17, 2021
Wandb
#129 by joshlk was merged Feb 17, 2021 Loading… updated Feb 17, 2021
Documentation
#134 by joshlk was merged Feb 26, 2021 Loading… updated Feb 26, 2021
Config documentation
#145 by joshlk was merged Feb 28, 2021 Loading… updated Feb 28, 2021
ProTip! Filter pull requests by the default branch with base:main.