-
Notifications
You must be signed in to change notification settings - Fork 998
Pull requests: EleutherAI/gpt-neox
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix documentation for --jsonl-keys argument of preprocess_data scripts
#1176
by KeitaW
was merged Mar 8, 2024
Loading…
Resolve issues between kv_cache and flash attention.
#1178
by chaochen99
was merged Mar 8, 2024
Loading…
[streaming_ds branch] Allow resuming from latest checkpoint when using StreamingDataset
#1163
by LeoGrin
was merged Mar 15, 2024
Loading…
[ZeRO-3] Partitioned init with
deepspeed.zero.Init()
#1190
by R0n12
was merged Mar 19, 2024
Loading…
making PR triggered CPU test for changes to megatron
#1195
by jaimemcc-intel
was merged Apr 1, 2024
Loading…
add rwkv support
merge-queue
This PR is next on the queue to merge
#1198
by jahatef
was merged May 6, 2024
Loading…
Bump transformers from 4.36.0 to 4.38.0 in /requirements
dependencies
Pull requests that update a dependency file
#1199
by dependabot
bot
was merged May 4, 2024
Loading…
Fix bug in tools/ckpts/convert_neox_to_hf.py for setting intermediate_size
#1209
by jvendrow
was merged May 4, 2024
Loading…
Bump jinja2 from 3.1.3 to 3.1.4 in /requirements
dependencies
Pull requests that update a dependency file
#1211
by dependabot
bot
was merged May 13, 2024
Loading…
[ZeRO-3] Ensured passing neox deepspeed_config when using partitioned init
#1191
by R0n12
was merged Apr 1, 2024
Loading…
[AMD] Supporting fused kernels build using JIT
#1188
by R0n12
was merged Apr 1, 2024
Loading…
2 tasks done
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.