AI PC: Text Generation - a Intel Collection

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

AurelPx/Pegasus-7b-slerp

Text Generation • Updated Mar 22 • 70 • 1

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

BAAI/Aquila-7B

Updated Sep 18, 2023 • 679 • 17

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

BAAI/Aquila2-7B

Text Generation • Updated Jun 7 • 770 • 6

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

BAAI/AquilaChat-7B

Updated Sep 21, 2023 • 378 • 48

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

BAAI/AquilaChat2-7B

Text Generation • Updated Aug 15 • 865 • 15

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

BigSalmon/GPT2Neo1.3BPoints

Text Generation • Updated Apr 4, 2022 • 15

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-1.4b

Text Generation • Updated Jul 9, 2023 • 23.3k • 22

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-12b

Text Generation • Updated Jul 9 • 80.4k • 131

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-14m

Text Generation • Updated Jul 26, 2023 • 317k • 18

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-160m

Text Generation • Updated Jul 9, 2023 • 94.5k • 25

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-1b

Text Generation • Updated Jul 9, 2023 • 58.1k • 32

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-2.8b

Text Generation • Updated Jun 9, 2023 • 20.3k • 28

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-410m

Text Generation • Updated Jul 9, 2023 • 34.7k • 20

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-6.9b

Text Generation • Updated Jun 8, 2023 • 26.6k • 47

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

EleutherAI/pythia-70m

Updated Nov 21, 2023 • 83.9k • 55

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

HuggingFaceH4/zephyr-7b-beta

Text Generation • Updated 23 days ago • 809k • • 1.6k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Intel/neural-chat-7b-v1-1

Text Generation • Updated Jan 15 • 40 • 23

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Intel/neural-chat-7b-v3-3

Text Generation • Updated Sep 9 • 53.4k • 75

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/CodeQwen1.5-7B-Chat

Text Generation • Updated Apr 30 • 48.5k • 312

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen-1_8B

Text Generation • Updated Dec 13, 2023 • 3.26k • 61

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen-1_8B-Chat

Text Generation • Updated Dec 13, 2023 • 58.5k • 108

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen-7B

Text Generation • Updated Jan 4 • 19k • 368

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen-7B-Chat

Text Generation • Updated Mar 19 • 63.5k • 751

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-0.5B

Text Generation • Updated Apr 5 • 401k • 143

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-0.5B-Chat

Text Generation • Updated Apr 30 • 331k • 74

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-1.8B

Text Generation • Updated Apr 5 • 354k • 43

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-1.8B-Chat

Text Generation • Updated Apr 30 • 12k • 45

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-4B

Text Generation • Updated Apr 5 • 12.8k • 33

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-4B-Chat

Text Generation • Updated Apr 30 • 6.18k • 37

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-7B

Text Generation • Updated Apr 5 • 203k • 45

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Qwen/Qwen1.5-7B-Chat

Text Generation • Updated Apr 30 • 31.8k • 162

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Salesforce/codegen-2B-multi

Text Generation • Updated Oct 3, 2022 • 11k • 35

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Salesforce/codegen-350M-mono

Text Generation • Updated Oct 3, 2022 • 14.1k • 86

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Salesforce/codegen-6B-multi

Text Generation • Updated Oct 3, 2022 • 4.47k • 19

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Salesforce/codegen2-1B_P

Text Generation • Updated Jul 6, 2023 • 2.58k • 38

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Salesforce/codegen2-3_7B_P

Text Generation • Updated Jul 6, 2023 • 129 • 15

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

Salesforce/codegen2-7B_P

Text Generation • Updated Jul 6, 2023 • 239 • 26

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

WizardLMTeam/WizardMath-7B-V1.1

Text Generation • Updated Jan 12 • 6.92k • 75

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

X-D-Lab/MindChat-Qwen2-4B

Text Generation • Updated Feb 4 • 15 • 4

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

adept/persimmon-8b-chat

Text Generation • Updated Oct 11, 2023 • 1.23k • 42

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

baichuan-inc/Baichuan2-13B-Chat

Text Generation • Updated Feb 26 • 93.8k • 421

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

baichuan-inc/Baichuan2-7B-Base

Text Generation • Updated Jan 31 • 1.89k • 77

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

baichuan-inc/Baichuan2-7B-Chat

Text Generation • Updated Feb 26 • 22.9k • 158

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

bigscience/bloom-560m

Text Generation • Updated Sep 26, 2023 • 222k • 346

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

bigscience/bloomz-1b1

Text Generation • Updated May 27, 2023 • 2.67k • 32

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

bigscience/bloomz-3b

Text Generation • Updated May 27, 2023 • 10.5k • 77

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

bigscience/bloomz-7b1-mt

Text Generation • Updated Jan 10 • 2.61k • 140

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-1.3b

Text Generation • Updated Sep 15, 2023 • 18.6M • 151

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-125m

Text Generation • Updated Sep 15, 2023 • 7.24M • 163

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-13b

Text Generation • Updated Jan 24, 2023 • 19.8k • 65

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-2.7b

Text Generation • Updated Sep 15, 2023 • 30.3k • 80

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-350m

Text Generation • Updated Sep 15, 2023 • 226k • 129

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-6.7b

Text Generation • Updated Jan 24, 2023 • 142k • 109

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

facebook/opt-iml-1.3b

Text Generation • Updated Jan 26, 2023 • 875 • 29

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/codegemma-1.1-2b

Text Generation • Updated Aug 7 • 111 • 17

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/codegemma-1.1-7b-it

Text Generation • Updated Aug 7 • 179 • 49

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/codegemma-2b

Text Generation • Updated Aug 7 • 5.75k • 72

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/codegemma-7b

Text Generation • Updated Aug 7 • 3.86k • 165

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/gemma-1.1-2b-it

Text Generation • Updated Jun 27 • 91.4k • 151

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/gemma-1.1-7b-it

Text Generation • Updated Jun 27 • 16.3k • • 263

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/gemma-2b

Text Generation • Updated Sep 27 • 495k • 908

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/gemma-2b-it

Text Generation • Updated Sep 27 • 95.8k • • 670

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/gemma-7b

Text Generation • Updated Jun 27 • 342k • • 3.05k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

google/gemma-7b-it

Text Generation • Updated Aug 14 • 467k • 1.14k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ibm-granite/granite-3b-code-base-2k

Text Generation • Updated Sep 2 • 44.4k • 35

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ibm-granite/granite-3b-code-instruct-2k

Text Generation • Updated Sep 2 • 13.2k • 31

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ibm-granite/granite-8b-code-base-4k

Text Generation • Updated Sep 2 • 2.99k • 29

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ibm-granite/granite-8b-code-instruct-4k

Text Generation • Updated Sep 2 • 9.51k • 108

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-1_8b

Text Generation • Updated Aug 20 • 16.8k • 28

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-7b

Text Generation • Updated Aug 20 • 12.7k • 40

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-chat-1_8b

Text Generation • Updated Aug 20 • 8.94k • 29

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-chat-7b

Text Generation • Updated Aug 20 • 15.9k • 82

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-chat-7b-sft

Text Generation • Updated Aug 20 • 7.93k • 6

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-math-7b

Text Generation • Updated Aug 20 • 259 • 27

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

internlm/internlm2-math-base-7b

Text Generation • Updated Aug 20 • 568 • 2

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ise-uiuc/Magicoder-CL-7B

Text Generation • Updated Dec 6, 2023 • 96 • 21

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ise-uiuc/Magicoder-DS-6.7B

Text Generation • Updated Mar 6 • 720 • 37

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ise-uiuc/Magicoder-S-CL-7B

Text Generation • Updated Dec 6, 2023 • 1.93k • 44

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

ise-uiuc/Magicoder-S-DS-6.7B

Text Generation • Updated Mar 6 • 3.18k • 200

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Llama-2-13b-chat-hf

Text Generation • Updated Apr 17 • 1.02M • 1.02k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Llama-2-13b-hf

Text Generation • Updated Apr 17 • 190k • 573

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17 • 830k • • 3.97k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Llama-2-7b-hf

Text Generation • Updated Apr 17 • 1.28M • 1.77k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27 • 664k • 5.81k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • Updated Sep 27 • 2.19M • • 3.59k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

meta-llama/Meta-Llama-Guard-2-8B

Text Generation • Updated May 13 • 12.6k • 281

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

microsoft/Phi-3-medium-4k-instruct

Text Generation • Updated Aug 20 • 42.1k • 211

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

microsoft/Phi-3-mini-128k-instruct

Text Generation • Updated Aug 20 • 584k • 1.6k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

microsoft/phi-2

Text Generation • Updated Apr 29 • 248k • 3.24k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • Updated Sep 27 • 1.1M • • 2.57k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21 • 484k • • 1.12k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mistralai/Mistral-7B-v0.3

Text Generation • Updated Jul 24 • 288k • 387

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mosaicml/mpt-7b

Text Generation • Updated Mar 5 • 42.7k • 1.16k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mosaicml/mpt-7b-8k

Text Generation • Updated Mar 5 • 1.85k • 26

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mosaicml/mpt-7b-8k-chat

Text Generation • Updated Mar 5 • 1.21k • 40

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mosaicml/mpt-7b-chat

Text Generation • Updated Mar 5 • 8.3k • 512

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mosaicml/mpt-7b-instruct

Text Generation • Updated Mar 5 • 8.05k • 467

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

mosaicml/mpt-7b-storywriter

Text Generation • Updated Mar 5 • 2.15k • 822

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

openai-community/gpt2

Text Generation • Updated Feb 19 • 15.9M • • 2.35k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

openbmb/MiniCPM-2B-sft-bf16

Text Generation • Updated Sep 7 • 1.86k • 118

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

openchat/openchat-3.6-8b-20240522

Text Generation • Updated May 28 • 14.6k • 149

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-2-12b

Text Generation • Updated Jul 10 • 927 • 115

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-2-12b-chat

Text Generation • Updated May 20 • 2.39k • 86

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-2-1_6b

Text Generation • Updated Jul 10 • 4.39k • 185

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-2-1_6b-chat

Text Generation • Updated Jun 3 • 4.54k • 31

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-2-zephyr-1_6b

Text Generation • Updated Jun 3 • 8.98k • 180

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-3b-4e1t

Text Generation • Updated Mar 7 • 16k • 309

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-base-alpha-3b

Text Generation • Updated Oct 19, 2023 • 2.16k • 82

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-tuned-alpha-7b

Text Generation • Updated Apr 19, 2023 • 4.15k • 357

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stabilityai/stablelm-zephyr-3b

Text Generation • Updated Jul 10 • 12.8k • 247

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

stanford-crfm/BioMedLM

Text Generation • Updated Mar 28 • 2.67k • 394

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

tiiuae/falcon-11B

Text Generation • Updated Sep 4 • 17.9k • 212

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

tiiuae/falcon-7b

Text Generation • Updated 27 days ago • 105k • 1.08k

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

tiiuae/falcon-7b-instruct

Text Generation • Updated 27 days ago • 168k • • 918

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

togethercomputer/Pythia-Chat-Base-7B

Text Generation • Updated Mar 29, 2023 • 1.59k • 66

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

togethercomputer/RedPajama-INCITE-7B-Base

Text Generation • Updated Jun 6, 2023 • 1.09k • 94

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

togethercomputer/RedPajama-INCITE-7B-Chat

Text Generation • Updated Jun 5, 2023 • 1.71k • 92

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

togethercomputer/RedPajama-INCITE-7B-Instruct

Text Generation • Updated Aug 9, 2023 • 1.63k • 104

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

togethercomputer/RedPajama-INCITE-Chat-3B-v1

Text Generation • Updated May 9, 2023 • 8.52k • 152

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation

xverse/XVERSE-7B-Chat

Text Generation • Updated Nov 6, 2023 • 78 • 8

Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation