Skip to content

Issues: EleutherAI/gpt-neox

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Add hf llama to neox conversion
#1247 opened Jun 28, 2024 by dmahan93 Loading…
Add Reward Model training
#1246 opened Jun 28, 2024 by dmahan93 Draft
Conversion for CI from self-hosted hardware
#1245 opened Jun 28, 2024 by jaimemcc-intel Loading…
Add KTO training
#1244 opened Jun 28, 2024 by dmahan93 Draft
Replace unsafe pyyaml loader with SafeLoader (#2)
#1243 opened Jun 27, 2024 by pixeeai Loading…
Add DPO training
#1242 opened Jun 25, 2024 by dmahan93 Loading…
Add tensor parallelism for RWKV
#1237 opened Jun 19, 2024 by jahatef Draft
Add lora support
#1225 opened May 20, 2024 by mkerin Draft
Add Transformer Engine
#1213 opened May 10, 2024 by Quentin-Anthony Draft
Add intermediate_size to GPT-NeoX models
#1212 opened May 10, 2024 by dtamayo-nlp Loading…
Dmoe integration
#1210 opened May 6, 2024 by DayOfThePenguin Loading…
Create cmake-multi-platform.yml
#1201 opened Apr 22, 2024 by Romario242003 Loading…
Adding replay into GPT-NeoX
#1200 opened Apr 13, 2024 by AIproj Loading…
Added infinite lr schedules merge-queue This PR is next on the queue to merge
#1194 opened Mar 25, 2024 by kshitijkg Loading…
Add DS inference
#1130 opened Jan 25, 2024 by yang Draft
[muP] Rework
#1087 opened Dec 1, 2023 by lintangsutawika Draft
Adding AxoNN's 3D tensor parallelism [WIP] feature request New feature or request
#1086 opened Nov 28, 2023 by siddharth9820 Draft
1 of 3 tasks
Fixing MuP
#1061 opened Oct 19, 2023 by marcobellagente93 Loading…
Deepspeed benchmarking
#878 opened Apr 11, 2023 by cr458 Draft
ProTip! Add no:assignee to see everything that’s not assigned.