Skip to content

Pull requests: OpenAccess-AI-Collective/axolotl

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

add support for multipack for deepseek_v2
#1712 opened Jun 19, 2024 by winglian Loading…
Allow "weight: 0" in messages to mask them
#1703 opened Jun 11, 2024 by DavidFarago Loading…
Update multi-node.qmd
#1688 opened Jun 7, 2024 by shahdivax Loading…
sanity check ranges in freeze.py
#1686 opened Jun 6, 2024 by josharian Loading…
typo
#1685 opened Jun 6, 2024 by Klingefjord Loading…
improve Pre-Tokenized Dataset docs
#1684 opened Jun 5, 2024 by josharian Loading…
jagged lr restart scheudler
#1680 opened Jun 3, 2024 by winglian Loading…
use L4 vs A10G for modal cicd
#1673 opened May 29, 2024 by winglian Loading…
Fix setting correct repo id when pushing dataset to hub
#1657 opened May 25, 2024 by chrislee973 Loading…
1 task done
Fused Cross Entropy Loss
#1601 opened May 7, 2024 by winglian Loading…
add support for SPPO
#1585 opened May 2, 2024 by winglian Loading…
WIP test out new dockerfile with more nvidia tools
#1557 opened Apr 21, 2024 by winglian Loading…
Add experimental install guide for ROCm
#1550 opened Apr 19, 2024 by xzuyn Loading…
Feat: Add cohere (commandr)
#1547 opened Apr 19, 2024 by NanoCode012 Loading…
Add data streaming support through mosaic-streaming
#1525 opened Apr 16, 2024 by fmv1992 Loading…
add tests for merging lora and validating the dtype
#1512 opened Apr 10, 2024 by winglian Loading…
add optimizer step to prevent warning in tests
#1502 opened Apr 9, 2024 by winglian Loading…
add support for adamw schedulefree
#1486 opened Apr 6, 2024 by winglian Loading…
improved Jamba deepspeed z3 compat
#1471 opened Mar 31, 2024 by tmm1 Loading…
fix optimizer reset
#1414 opened Mar 16, 2024 by winglian Loading…
implement post training
#1407 opened Mar 15, 2024 by ehartford Loading…
ProTip! Exclude everything labeled bug with -label:bug.