Tags: intel/intel-extension-for-transformers
Tags
[vLLM] Support vLLM CPU backend and provide QBits acceleration (#1551) Co-authored-by: VincyZhang <[email protected]> Co-authored-by: Wang, Chang <[email protected]>
[Transformers] Support load mode from HF Hub when use Neural Speed (#… …1449) Co-authored-by: Wenxin Zhang <[email protected]> Co-authored-by: changwangss <[email protected]>
[LLM Runtime] Fix convert mistral script missing parameter issue (#1100)
Merge branch 'lvl/fix_config_parameters' into llmaas1.3.0
PreviousNext