Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

test
#1 by lucidrains was merged Dec 22, 2020 Loading…
PR for Deepspeed Integration
#9 by trisongz was merged Dec 24, 2020 Loading…
get rid of test file
#10 by sdtblck was merged Dec 26, 2020 Loading…
make mask value smaller by factor of 2
#25 by lucidrains was merged Jan 4, 2021 Loading…
update tensorflow to 2.4.0
#47 by sdtblck was merged Jan 5, 2021 Loading…
fix everything that i broke
#61 by sdtblck was merged Jan 13, 2021 Loading…
Implement distributed training using Kubernetes
#77 by leogao2 was merged Jan 23, 2021 Loading…
2
6
Minor fixes
#85 by sdtblck was merged Jan 23, 2021 Loading…
Batch size needs to be specified
#87 by joshlk was merged Jan 23, 2021 Loading…
Fix train pipeline
#89 by sdtblck was merged Jan 25, 2021 Loading…
2
1
Add checkpoint saving / loading
#90 by sdtblck was merged Jan 28, 2021 Loading…
Miscellaneous docker QoL improvements
#91 by leogao2 was merged Jan 26, 2021 Loading…
Update base_model.json
#93 by srulikbd was merged Jan 26, 2021 Loading…
Update deploy_k8s.sh
#101 by leogao2 was merged Jan 27, 2021 Loading…
Update deploy_k8s.sh
#102 by leogao2 was merged Jan 27, 2021 Loading…
Create runmany_k8s.sh
#106 by leogao2 was merged Jan 30, 2021 Loading…
Better deployment
#107 by joshlk was merged Feb 1, 2021 Loading…
Monitoring using wandb
#108 by joshlk was merged Feb 1, 2021 Loading…
Remove layer caching
#109 by joshlk was merged Feb 1, 2021 Loading…
Replace neox code with megatron
#119 by leogao2 was merged Feb 16, 2021 Loading…
Small patch. Otherwise stops before setting up new service
#125 by joshlk was merged Feb 17, 2021 Loading…
Wandb
#129 by joshlk was merged Feb 17, 2021 Loading…
Documentation
#134 by joshlk was merged Feb 26, 2021 Loading…
Satisfy fp cross entropy loss arg in pipeline parallel model
#140 by sdtblck was merged Feb 28, 2021 Loading…
ProTip! Filter pull requests by the default branch with base:main.