Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Improve Conversion Utilities
#1124 by haileyschoelkopf was merged Feb 8, 2024 Loading…
9 tasks done
Add megablocks dropless MoE
#1192 by yang was merged May 4, 2024 Loading…
Updating default configs to be less bad
#665 by StellaAthena was merged Nov 18, 2022 Loading…
Optimize data preprocessing by using numpy
#771 by zhuzilin was merged Jan 20, 2023 Loading…
Extra DeepSpeed Argument Capability
#819 by curt-tigges was merged Mar 26, 2023 Loading…
Simplify and relax dependencies (Take 2)
#818 by EricHallahan was merged Mar 9, 2023 Loading…
Extend ci suite
#1080 by mkerin was merged Dec 4, 2023 Loading…
Fused Rotary Embeddings (fixed)
#1108 by yang was merged Jan 5, 2024 Loading…
Make rotary freqs buffer non-persistent
#1168 by haileyschoelkopf was merged Mar 4, 2024 Loading…
add rwkv support merge-queue This PR is next on the queue to merge
#1198 by jahatef was merged May 6, 2024 Loading…
LR scheduler fix no longer breaks inference
#1060 by dashstander was merged Oct 17, 2023 Loading…
Lion Optimizer
#1062 by andylolu2 was merged Oct 20, 2023 Loading…
PR for Deepspeed Integration
#9 by trisongz was merged Dec 24, 2020 Loading…
get rid of test file
#10 by sdtblck was merged Dec 26, 2020 Loading…
make mask value smaller by factor of 2
#25 by lucidrains was merged Jan 4, 2021 Loading…
test
#1 by lucidrains was merged Dec 22, 2020 Loading…
Update base_model.json
#93 by srulikbd was merged Jan 26, 2021 Loading…
Implement distributed training using Kubernetes
#77 by leogao2 was merged Jan 23, 2021 Loading…
2
6
Batch size needs to be specified
#87 by joshlk was merged Jan 23, 2021 Loading…
Add checkpoint saving / loading
#90 by sdtblck was merged Jan 28, 2021 Loading…
Remove layer caching
#109 by joshlk was merged Feb 1, 2021 Loading…
Create runmany_k8s.sh
#106 by leogao2 was merged Jan 30, 2021 Loading…
ProTip! Follow long discussions with comments:>50.