Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix paper reference in init_functions.py
#1241 by rasbt was merged Jun 28, 2024 Loading… updated Jun 28, 2024
Add a chat data preprocessing script
#1239 by dmahan93 was merged Jun 25, 2024 Loading… updated Jun 25, 2024
fix python version and pytest install
#1234 by jahatef was merged Jun 19, 2024 Loading… updated Jun 19, 2024
Conversion script bugfixes
#1218 by haileyschoelkopf was merged Jun 7, 2024 Loading… updated Jun 7, 2024
Fix changed behavior of pipe_parallel
#1219 by yang was merged Jun 7, 2024 Loading… updated Jun 7, 2024
fix conversion of hf -> neox for pythia in model parallel
#1220 by dmahan93 was merged Jun 7, 2024 Loading… updated Jun 7, 2024
Change python invocation syntax
#1223 by jaimemcc-intel was merged Jun 5, 2024 Loading… updated Jun 5, 2024
init changes to README
#1232 by jaimemcc-intel was merged Jun 5, 2024 Loading… updated Jun 5, 2024
add workflow_dispatch to gh actions pr so we can run on command
#1233 by jahatef was merged Jun 4, 2024 Loading… updated Jun 4, 2024
Fix markdown formatting error
#1217 by StellaAthena was merged May 26, 2024 Loading… updated May 26, 2024
fixed fused_rope naming in JIT + Readme
#1224 by R0n12 was merged May 21, 2024 Loading… updated May 22, 2024
2 tasks done
Small tidying
#1222 by yang was merged May 21, 2024 Loading… updated May 21, 2024
Add Torch Profiler Support
#1226 by DayOfThePenguin was merged May 21, 2024 Loading… updated May 21, 2024
Rwkv pipeline parallelism
#1221 by jahatef was merged May 21, 2024 Loading… updated May 21, 2024
Better run_eval_harness import
#1139 by R0n12 was merged Jan 30, 2024 Loading… updated May 20, 2024
[AMD] Supporting fused kernels build using JIT
#1188 by R0n12 was merged Apr 1, 2024 Loading… updated May 20, 2024
2 tasks done
[ZeRO-3] Ensured passing neox deepspeed_config when using partitioned init
#1191 by R0n12 was merged Apr 1, 2024 Loading… updated May 20, 2024
Run document update again
#1216 by jahatef was merged May 16, 2024 Loading… updated May 16, 2024
Bump jinja2 from 3.1.3 to 3.1.4 in /requirements dependencies Pull requests that update a dependency file
#1211 by dependabot bot was merged May 13, 2024 Loading… updated May 13, 2024
add rwkv support merge-queue This PR is next on the queue to merge
#1198 by jahatef was merged May 6, 2024 Loading… updated May 6, 2024
Fix bug in tools/ckpts/convert_neox_to_hf.py for setting intermediate_size
#1209 by jvendrow was merged May 4, 2024 Loading… updated May 4, 2024
Add megablocks dropless MoE
#1192 by yang was merged May 4, 2024 Loading… updated May 4, 2024
Jaimemcc intel/ci composite cpu tests
#1205 by jaimemcc-intel was merged May 4, 2024 Loading… updated May 4, 2024
Bump transformers from 4.36.0 to 4.38.0 in /requirements dependencies Pull requests that update a dependency file
#1199 by dependabot bot was merged May 4, 2024 Loading… updated May 4, 2024
Fixes a weird typo
#1207 by StellaAthena was merged Apr 25, 2024 Loading… updated Apr 27, 2024
ProTip! What’s not been updated in a month: updated:<2024-06-25.