Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix failling tests
#1301 by AI-WAIFU was merged Oct 8, 2024 Loading…
Add additional asserts and update post training readme
#1300 by AI-WAIFU was merged Oct 8, 2024 Loading…
hotfix for tp >= 2 and pp > 2 in autoitercount
#1296 by AI-WAIFU was merged Oct 1, 2024 Loading…
readded RM training removed during merge conflict in KTO
#1295 by dmahan93 was merged Sep 26, 2024 Loading…
Add KTO Post-training example
#1294 by dmahan93 was merged Sep 26, 2024 Loading…
update args docs
#1293 by Quentin-Anthony was merged Sep 23, 2024 Loading…
update neox arg docs
#1292 by Quentin-Anthony was closed Sep 23, 2024 Loading…
mamba flop calculations
#1291 by jahatef was merged Sep 23, 2024 Loading…
Fix dataset bug
#1290 by Quentin-Anthony was merged Sep 22, 2024 Loading…
Remove the remaining two hanging wandb config fields
#1287 by Quentin-Anthony was merged Sep 18, 2024 Loading…
Make monitors consistent
#1286 by Quentin-Anthony was merged Sep 18, 2024 Loading…
Fix off by 1 error on masked tokens for RM training
#1285 by dmahan93 was merged Sep 18, 2024 Loading…
Update Comet integration instructions
#1284 by Lothiraldan was merged Sep 18, 2024 Loading…
Add model parallel group to reduce scatter
#1281 by bclyang was merged Sep 15, 2024 Loading…
Do not fail when git is not installed
#1280 by gcaillaut was merged Sep 24, 2024 Loading…
fix the imports needed for comet integration
#1279 by Quentin-Anthony was merged Sep 11, 2024 Loading…
fix gpt-j residual bias assumption
#1278 by dmahan93 was merged Sep 10, 2024 Loading…
Post training examples
#1277 by dmahan93 was merged Sep 14, 2024 Loading…
Hotfix llama models
#1276 by dmahan93 was merged Sep 10, 2024 Loading…
Add more informative checks for ZeRO incompatibility.
#1275 by AI-WAIFU was merged Sep 9, 2024 Loading…
Expand Docstring
#1273 by AI-WAIFU was merged Sep 9, 2024 Loading…
TE Import Hotfix
#1272 by Quentin-Anthony was merged Sep 9, 2024 Loading…
Hotfix Activation Typo
#1271 by Quentin-Anthony was merged Sep 9, 2024 Loading…
Formatting and Fix Mamba Config
#1270 by Quentin-Anthony was merged Sep 8, 2024 Loading…
ProTip! Exclude everything labeled bug with -label:bug.