[Question] Can LLava inference on CPU? #865

wenli135 · 2023-11-27T13:31:44Z

Question

I was trying to run LLava inference on cpu, but it complains "Torch not compiled with CUDA enabled". I noticed that cuda() is called when loading model. If I remove all the cuda() invocation, is it possible to run inference on cpu?

thanks.

papasanimohansrinivas · 2023-11-27T19:02:38Z

you need to install torch cpu and set device map to cpu in model loading side @wenli135

morteza102030 · 2023-11-29T13:22:56Z

you need to install torch cpu and set device map to cpu in model loading side @wenli135

it's possible for you give a complete example for how run LLaVA_13b_4bit_vanilla_colab without gpu?

akkimind · 2023-12-01T08:34:52Z

I made some changes in the code to run inference on CPU, the model is loading but I am getting an error:
BF16 weight prepack needs the cpu support avx512bw, avx512vl and avx512dq, please set dtype to torch.float or set weights_prepack to False
while trying to optimize the model(model = ipex.optimize(model, dtype=torch.bfloat16))
If I set dtype to torch.float, model isn’t supporting it and if set weights_prepack to False, model is taking forever to return response. Is there any Specific CPU which I should use?

ratan · 2024-01-09T09:06:39Z

did anyone able to run Llava inference on CPU without installing Intel Extention for Pytorch environment for inference? Any pointer will be really helpful

feng-intel · 2024-01-17T23:58:12Z

Hi Ratan
Here is the bare metal intel cpu solution intel xFasterTransformer for LLM, but there is no llava support yet. You can try firstly.
llama.cpp also support CPU. We will enable intel dGPU/iGPU later.

Could you tell why you don't want to use Intel Extention for Pytorch? Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Can LLava inference on CPU? #865

[Question] Can LLava inference on CPU? #865

wenli135 commented Nov 27, 2023

papasanimohansrinivas commented Nov 27, 2023

morteza102030 commented Nov 29, 2023

akkimind commented Dec 1, 2023 •

edited

Loading

ratan commented Jan 9, 2024

feng-intel commented Jan 17, 2024

[Question] Can LLava inference on CPU? #865

[Question] Can LLava inference on CPU? #865

Comments

wenli135 commented Nov 27, 2023

Question

papasanimohansrinivas commented Nov 27, 2023

morteza102030 commented Nov 29, 2023

akkimind commented Dec 1, 2023 • edited Loading

ratan commented Jan 9, 2024

feng-intel commented Jan 17, 2024

akkimind commented Dec 1, 2023 •

edited

Loading