-
Notifications
You must be signed in to change notification settings - Fork 977
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ONNX Export / Inference Engine
feature request
New feature or request
#537
opened Feb 10, 2022 by
Mistobaan
2 tasks
OOM issues running inference with large contexts on 2x3090 system
bug
Something isn't working
#631
opened Jun 9, 2022 by
fpgaminer
Text generation yields different outputs despite temperature = 0.0
bug
Something isn't working
good first issue
Good for newcomers
#643
opened Jul 5, 2022 by
ScTof
Multi-node training without shared memory
deprioritized
Issues that are not closed, but are low priority and unlikely to be solved soon
feature request
New feature or request
#765
opened Jan 6, 2023 by
VHellendoorn
Why we need to average LayerNorm values over mp ranks when converting to HFformat checkpoint?
#983
opened Jun 26, 2023 by
forceshorty
Implement neox_args processing when OMPI_COMM_WORLD_SIZE>1
#1073
opened Nov 7, 2023 by
kyuheejang
Loading…
PyTorch Lightning Fused optimizer step
feature request
New feature or request
#1160
opened Feb 29, 2024 by
jahatef
Added infinite lr schedules
merge-queue
This PR is next on the queue to merge
#1194
opened Mar 25, 2024 by
kshitijkg
Loading…
Add Transformer Engine's version of RMSNorm and LayerNorm
#1235
opened Jun 11, 2024 by
lintangsutawika
•
Draft
SFT improvements (labeling fixes, different packing implementations)
#1240
opened Jun 21, 2024 by
dmahan93
Loading…
batch_input and elapsed time per iteration suddenly slow down during model training
bug
Something isn't working
#1248
opened Jun 29, 2024 by
Yuhanleeee
loss stuck in overflow for RPE position embedding together with sparse attention
bug
Something isn't working
#292
opened May 4, 2021 by
sweinbach
Increase Documentation Coverage
feature request
New feature or request
#458
opened Nov 7, 2021 by
sdtblck
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.