-
Notifications
You must be signed in to change notification settings - Fork 982
Issues: EleutherAI/gpt-neox
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Assertion Error when Setting pipe_parallel_size or model_parallel_size in GPT-NeoX
bug
Something isn't working
#1251
opened Jul 10, 2024 by
lieh1203
updated Jul 23, 2024
How to Load Model from pytorch_model.bin into Trained Model for Text Generation?
feature request
New feature or request
#1254
opened Jul 15, 2024 by
lieh1203
updated Jul 15, 2024
what's the biggest dataset you've tried?
bug
Something isn't working
#1253
opened Jul 15, 2024 by
exnx
updated Jul 15, 2024
For nucleus sampling, top-p sampling appears to happen on the softmax-normalized top-k logits
bug
Something isn't working
#1250
opened Jul 3, 2024 by
j-frei
updated Jul 3, 2024
batch_input and elapsed time per iteration suddenly slow down during model training
bug
Something isn't working
#1248
opened Jun 29, 2024 by
Yuhanleeee
updated Jun 29, 2024
Conversion for CI from self-hosted hardware
#1245
opened Jun 28, 2024 by
jaimemcc-intel
Loading…
updated Jun 28, 2024
Replace unsafe
pyyaml
loader with SafeLoader
(#2)
#1243
opened Jun 27, 2024 by
pixeeai
Loading…
updated Jun 27, 2024
SFT improvements (labeling fixes, different packing implementations)
#1240
opened Jun 21, 2024 by
dmahan93
Loading…
updated Jun 25, 2024
Cannot convert neox model to HF
bug
Something isn't working
#1231
opened May 28, 2024 by
srivassid
updated Jun 22, 2024
My servers used for multi-node training do not have ssh. How can I launch multi-node training using the torchrun command?
feature request
New feature or request
#1203
opened Apr 23, 2024 by
dingning97
updated Jun 20, 2024
How to set the ffn hidden size parameter in gpt neox
feature request
New feature or request
#1230
opened May 28, 2024 by
IronMan-WangJinxi
updated Jun 19, 2024
Add Transformer Engine's version of RMSNorm and LayerNorm
#1235
opened Jun 11, 2024 by
lintangsutawika
•
Draft
updated Jun 11, 2024
Added infinite lr schedules
merge-queue
This PR is next on the queue to merge
#1194
opened Mar 25, 2024 by
kshitijkg
Loading…
updated May 14, 2024
Create cmake-multi-platform.yml
#1201
opened Apr 22, 2024 by
Romario242003
Loading…
updated Apr 23, 2024
Integrate TransformerEngine
feature request
New feature or request
#1098
opened Dec 21, 2023 by
Quentin-Anthony
updated Mar 13, 2024
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.