We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Update pyproject.toml (#322)
Add flash_attn support (#306) (#313) * Add flash_attn support (#306) * add dockerfile for flash_attn setup * remove test.py * parametrize model name and engine * Update Dockerfile --------- Co-authored-by: Michael Feil <[email protected]> * Delete libs/infinity_emb/Dockerfile.flash --------- Co-authored-by: Göktürk <[email protected]>
bump versions
Update pyproject.toml Release0.0.50
api changes sync async (#286) * api: fix minor mismatches * add infer.py
Update pyproject.toml