Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update issue templates
#123 by StellaAthena was closed Feb 17, 2021 Loading…
T5rpe
#141 by sdtblck was merged Feb 28, 2021 Loading…
Satisfy fp cross entropy loss arg in pipeline parallel model
#140 by sdtblck was merged Feb 28, 2021 Loading…
Fix sparsity
#139 by sdtblck was merged Feb 27, 2021 Loading…
Wandb validation loss and megatron timers
#138 by joshlk was merged Feb 27, 2021 Loading…
Merging from main
#137 by StellaAthena was merged Feb 26, 2021 Loading…
Documentation
#134 by joshlk was merged Feb 26, 2021 Loading…
Clean up Neox configuration
#132 by joshlk was merged Feb 27, 2021 Loading…
4 of 5 tasks
Update ds_zero_stage_1_config.json
#131 by StellaAthena was merged Feb 19, 2021 Loading…
Updating from main
#130 by StellaAthena was merged Feb 18, 2021 Loading…
Wandb
#129 by joshlk was merged Feb 17, 2021 Loading…
Small patch. Otherwise stops before setting up new service
#125 by joshlk was merged Feb 17, 2021 Loading…
Update issue templates documentation Improvements or additions to documentation
#124 by StellaAthena was merged Feb 19, 2021 Loading…
Create runmany_k8s.sh
#106 by leogao2 was merged Jan 30, 2021 Loading…
Openmpi, PV and deployment
#122 by joshlk was merged Feb 16, 2021 Loading…
Added copyright disclaimers
#121 by StellaAthena was merged Feb 15, 2021 Loading…
Replace neox code with megatron
#119 by leogao2 was merged Feb 16, 2021 Loading…
New codebase based on megatron
#118 by StellaAthena was closed Feb 13, 2021 Loading…
mish
#117 by ClashLuke was merged Feb 11, 2021 Loading…
typos; add train_batch_size back to base config
#115 by Muennighoff was closed Feb 15, 2021 Loading…
merged script for train.py, train_pipeline.py
#113 by ShivanshuPurohit was closed Feb 15, 2021 Loading…
Updating from main
#111 by StellaAthena was merged Feb 3, 2021 Loading…
Remove layer caching
#109 by joshlk was merged Feb 1, 2021 Loading…
Monitoring using wandb
#108 by joshlk was merged Feb 1, 2021 Loading…
Better deployment
#107 by joshlk was merged Feb 1, 2021 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.