Skip to content

Tags: bdalal/lorax

Tags

v0.10.1

Toggle v0.10.1's commit message
Hotfix for LoRA batching logic

v0.10.0

Toggle v0.10.0's commit message
Speculative decoding, sgmv + bgmv

v0.9.0

Toggle v0.9.0's commit message
Adapter memory manager

v0.8.1

Toggle v0.8.1's commit message
Gemma support

v0.8.0

Toggle v0.8.0's commit message
Structured output via Outlines

v0.7.0

Toggle v0.7.0's commit message
LoRA merging per request

v0.6.0

Toggle v0.6.0's commit message
OpenAI compatible API

v0.5.0

Toggle v0.5.0's commit message
CUDA graph compile

v0.4.1

Toggle v0.4.1's commit message
Fixes GPT-Q and Phi LoRAs

v0.4.0

Toggle v0.4.0's commit message
Mixtral, Phi