Why GPT-J performs better on graviton without using simd than x86 using simd #520

xshen053 · 2023-09-13T01:02:17Z

I run gpt-j -t 1 -m ../build/models/gpt-j-6B/ggml-model.bin -p "This is an example" on both c6i.8xlarge and 'c7g.8xlarge'.

Graviton (c7g.8xlarge)

by defaultGGML_SIMD is not enabled on graviton, I got these results (GGML_SIMD disabled)


Threads: 4 | Average ms/token: 298.38666666666666666666
Threads: 16 | Average ms/token: 80.54333333333333333333
Threads: 32 | Average ms/token: 59.32333333333333333333

GGML_SIMD enabled

Threads: 4 | Average ms/token: 131.37666666666666666666
Threads: 16 | Average ms/token: 63.95666666666666666666
Threads: 32 | Average ms/token: 54.81000000000000000000

x86 intel (c6i.8xlarge)

On intel x86 instance, I got these (GGML_SIMD enabled)

Threads: 4 | Average ms/token: 270.19333333333333333333
Threads: 16 | Average ms/token: 105.40000000000000000000
Threads: 32 | Average ms/token: 97.06333333333333333333

Why intel with simd is not as good as arm without simd??

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why GPT-J performs better on graviton without using simd than x86 using simd #520

Why GPT-J performs better on graviton without using simd than x86 using simd #520

xshen053 commented Sep 13, 2023

Why GPT-J performs better on graviton without using simd than x86 using simd #520

Why GPT-J performs better on graviton without using simd than x86 using simd #520

Comments

xshen053 commented Sep 13, 2023

Graviton (c7g.8xlarge)

x86 intel (c6i.8xlarge)