Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Support for LLaMA
#841 by zphang was merged May 2, 2023 Loading…
Add support for Flash attention
#725 by VHellendoorn was merged Dec 10, 2022 Loading… Release V2
Megatron-LM style Sequence Parallel
#1257 by haileyschoelkopf was merged Aug 23, 2024 Loading…
4 tasks done
align gpt-j layernorm to hf
#481 by sweinbach was merged Dec 6, 2022 Loading…
Add Mamba Architecture
#1157 by haileyschoelkopf was merged Mar 10, 2024 Loading…
Add intermediate_size to GPT-NeoX models
#1212 by dtamayo-nlp was merged Sep 7, 2024 Loading…
fix alibi inference shapes for cached layer_past
#452 by sweinbach was merged Dec 12, 2021 Loading…
Extend ci suite
#1080 by mkerin was merged Dec 4, 2023 Loading…
fused layernorm
#1105 by yang was merged Jan 26, 2024 Loading…
Monitoring using wandb
#108 by joshlk was merged Feb 1, 2021 Loading…
Improve Eval Harness
#471 by sdtblck was merged Dec 20, 2021 Loading…
[Bug] Make Configs Consistent
#928 by austinburnett was merged May 9, 2023 Loading…
3 tasks done
Adding the possibility of passing a label dataset
#958 by honglu2875 was merged Jun 7, 2023 Loading…
Draft PR Adding mistral 0.1
#1131 by AIproj was merged Feb 23, 2024 Loading…
fixes the use of uninitialized variable
#539 by cloudcell was merged Feb 12, 2022 Loading…
Add DeepSpeed bf16 configuration
#787 by dashstander was merged May 16, 2023 Loading…
Fix flash attention
#910 by liamcli was merged May 9, 2023 Loading…
Better deployment
#107 by joshlk was merged Feb 1, 2021 Loading…
Fix different positional embeddings clashing
#147 by sdtblck was merged Mar 4, 2021 Loading…
Refactor how we configure sparsity
#284 by sdtblck was merged May 2, 2021 Loading…
add log_grad_pct_zeros to neox_args
#477 by CoEich was merged Feb 24, 2022 Loading…
Lm eval 0.4.0 support
#1101 by haileyschoelkopf was merged Dec 23, 2023 Loading…
[AMD] Supporting fused kernels build using JIT
#1188 by R0n12 was merged Apr 1, 2024 Loading…
2 tasks done
Lion Optimizer
#1062 by andylolu2 was merged Oct 20, 2023 Loading…
ProTip! What’s not been updated in a month: updated:<2024-09-04.