Skip to content

v0.4.0

Latest
Compare
Choose a tag to compare
@winglian winglian released this 24 Jan 20:08
· 261 commits to main since this release
1427d5b

New Features (highlights)

  • Streaming multipack for continued pre-training
  • Mistral & Mixtral support
  • Simplified Multipack for Mistral, Falcon, Qwen2, and Phi
  • DPO/IPO/KTO-pairs RL-training support via trl
  • Improve BatchSampler for multipack support, allows for resume from checkpointing, shuffling data each epoch
  • bf16: auto support
  • add MLFlow support
  • save YAML configs to WandB
  • save predictions during evals to WandB
  • more tests! more smoke tests for smol model training
  • NEFTune support

What's Changed

New Contributors

Full Changelog: v0.3.0...v0.4.0