Skip to content

Release v0.1.17

Compare
Choose a tag to compare
@merrymercy merrymercy released this 08 Jun 02:58
· 144 commits to main since this release
e8a2327

Highlights

  • Add data parallelim #480
  • Add speculative execution for OpenAI API #250
  • Update vllm to v0.4.3 for new quantization features #511
  • Better error handling (#457, #449, #514)

What's Changed

New Contributors

Full Changelog: v0.1.16...v0.1.17