Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Cleaner dockerfile: Remove already installed deps
#1175 by tf-nv was merged Mar 8, 2024 Loading…
remove best_download as dependency
#1179 by haileyschoelkopf was merged Mar 8, 2024 Loading…
Resolve issues between kv_cache and flash attention.
#1178 by chaochen99 was merged Mar 8, 2024 Loading…
Remove gas from Pythia configs
#1181 by yang was merged Mar 8, 2024 Loading…
Fix moe_loss in gpt_j_residual path
#1180 by yang was merged Mar 8, 2024 Loading…
Add Mamba Architecture
#1157 by haileyschoelkopf was merged Mar 10, 2024 Loading…
Mamba + Tensor Parallel Support
#1184 by haileyschoelkopf was merged Mar 15, 2024 Loading…
[ZeRO-3] Partitioned init with deepspeed.zero.Init()
#1190 by R0n12 was merged Mar 19, 2024 Loading…
Support Lion with Zero Optimizer
#1166 by DayOfThePenguin was merged Mar 4, 2024 Loading…
Remove unnecessary fp32/bf16 conversion
#1169 by DayOfThePenguin was merged Mar 4, 2024 Loading…
Fixes a weird typo
#1207 by StellaAthena was merged Apr 25, 2024 Loading…
add rwkv support merge-queue This PR is next on the queue to merge
#1198 by jahatef was merged May 6, 2024 Loading…
Add megablocks dropless MoE
#1192 by yang was merged May 4, 2024 Loading…
Jaimemcc intel/ci composite cpu tests
#1205 by jaimemcc-intel was merged May 4, 2024 Loading…
Bump transformers from 4.36.0 to 4.38.0 in /requirements dependencies Pull requests that update a dependency file
#1199 by dependabot bot was merged May 4, 2024 Loading…
Bump jinja2 from 3.1.3 to 3.1.4 in /requirements dependencies Pull requests that update a dependency file
#1211 by dependabot bot was merged May 13, 2024 Loading…
[AMD] Supporting fused kernels build using JIT
#1188 by R0n12 was merged Apr 1, 2024 Loading…
2 tasks done
Better run_eval_harness import
#1139 by R0n12 was merged Jan 30, 2024 Loading…
Small tidying
#1222 by yang was merged May 21, 2024 Loading…
Rwkv pipeline parallelism
#1221 by jahatef was merged May 21, 2024 Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.