Tags · intel/intel-extension-for-transformers

v1.4.2

[vLLM] Support vLLM CPU backend and provide QBits acceleration (#1551)

Co-authored-by: VincyZhang <[email protected]>
Co-authored-by: Wang, Chang <[email protected]>

May 24, 2024
0e13607
zip
tar.gz
Notes

v1.4.1

Update modeling_auto.py (#1499)

Signed-off-by: Wang, Chang <[email protected]>

Apr 21, 2024
0fc6e01
zip
tar.gz
Notes

v1.4

[Transformers] Support load mode from HF Hub when use Neural Speed (#…

…1449)

Co-authored-by: Wenxin Zhang <[email protected]>
Co-authored-by: changwangss <[email protected]>

Apr 3, 2024
346211c
zip
tar.gz
Notes

v1.4rc1

[NeuralChat] Fix tgi endpoint in test (#1388)

Mar 18, 2024
a8e5295
zip
tar.gz

v1.3.2

fix ut (#1307)

Feb 24, 2024
9e9e4c7
zip
tar.gz
Notes

v1.3.1

Support weight-only kernel with IPEX for intel GPU (#1153)

Jan 19, 2024
81d4c56
zip
tar.gz
Notes

v1.3.1.dev0

[LLM Runtime] Fix convert mistral script missing parameter issue (#1100)

Jan 5, 2024
e01d330
zip
tar.gz

v1.3

[LLM Runtime] dynamic link the layer to compress binary size (#1059)

Co-authored-by: Ding, Yi <[email protected]>

Dec 22, 2023
6e3a514
zip
tar.gz
Notes

llmaas1.3.0

Merge branch 'lvl/fix_config_parameters' into llmaas1.3.0

Dec 14, 2023
43ea9e9
zip
tar.gz

v1.2.2

fix checkmarx issue (#842)

Signed-off-by: Wenxin Zhang <[email protected]>

Dec 1, 2023
8644604
zip
tar.gz
Notes

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.4.2

v1.4.1

v1.4

v1.4rc1

v1.3.2

v1.3.1

v1.3.1.dev0

v1.3

llmaas1.3.0

v1.2.2

Tags: intel/intel-extension-for-transformers