Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Assertion Error when Setting pipe_parallel_size or model_parallel_size in GPT-NeoX bug Something isn't working
#1251 opened Jul 10, 2024 by lieh1203 updated Jul 23, 2024
How to Load Model from pytorch_model.bin into Trained Model for Text Generation? feature request New feature or request
#1254 opened Jul 15, 2024 by lieh1203 updated Jul 15, 2024
what's the biggest dataset you've tried? bug Something isn't working
#1253 opened Jul 15, 2024 by exnx updated Jul 15, 2024
For nucleus sampling, top-p sampling appears to happen on the softmax-normalized top-k logits bug Something isn't working
#1250 opened Jul 3, 2024 by j-frei updated Jul 3, 2024
batch_input and elapsed time per iteration suddenly slow down during model training bug Something isn't working
#1248 opened Jun 29, 2024 by Yuhanleeee updated Jun 29, 2024
Add hf llama to neox conversion
#1247 opened Jun 28, 2024 by dmahan93 Loading… updated Jun 28, 2024
Conversion for CI from self-hosted hardware
#1245 opened Jun 28, 2024 by jaimemcc-intel Loading… updated Jun 28, 2024
Add Reward Model training
#1246 opened Jun 28, 2024 by dmahan93 Draft updated Jun 28, 2024
Add KTO training
#1244 opened Jun 28, 2024 by dmahan93 Draft updated Jun 28, 2024
Replace unsafe pyyaml loader with SafeLoader (#2)
#1243 opened Jun 27, 2024 by pixeeai Loading… updated Jun 27, 2024
Add DPO training
#1242 opened Jun 25, 2024 by dmahan93 Loading… updated Jun 26, 2024
SFT improvements (labeling fixes, different packing implementations)
#1240 opened Jun 21, 2024 by dmahan93 Loading… updated Jun 25, 2024
Cannot convert neox model to HF bug Something isn't working
#1231 opened May 28, 2024 by srivassid updated Jun 22, 2024
How to set the ffn hidden size parameter in gpt neox feature request New feature or request
#1230 opened May 28, 2024 by IronMan-WangJinxi updated Jun 19, 2024
Add tensor parallelism for RWKV
#1237 opened Jun 19, 2024 by jahatef Draft updated Jun 19, 2024
Deepspeed benchmarking
#878 opened Apr 11, 2023 by cr458 Draft updated Jun 18, 2024
Add Transformer Engine's version of RMSNorm and LayerNorm
#1235 opened Jun 11, 2024 by lintangsutawika Draft updated Jun 11, 2024
Add Transformer Engine
#1213 opened May 10, 2024 by Quentin-Anthony Draft updated May 28, 2024
Dmoe integration
#1210 opened May 6, 2024 by DayOfThePenguin Loading… updated May 22, 2024
Add lora support
#1225 opened May 20, 2024 by mkerin Draft updated May 20, 2024
Added infinite lr schedules merge-queue This PR is next on the queue to merge
#1194 opened Mar 25, 2024 by kshitijkg Loading… updated May 14, 2024
[muP] Rework
#1087 opened Dec 1, 2023 by lintangsutawika Draft updated May 2, 2024
Create cmake-multi-platform.yml
#1201 opened Apr 22, 2024 by Romario242003 Loading… updated Apr 23, 2024
Integrate TransformerEngine feature request New feature or request
#1098 opened Dec 21, 2023 by Quentin-Anthony updated Mar 13, 2024
ProTip! Type g i on any issue or pull request to go back to the issue listing page.