Slow sparse quantized models #22

clementpoiret · 2022-04-29T17:21:55Z

Describe the bug

Even on AVX512-VNNI CPUs, sparse int8-quantized models are slow

To Reproduce
Steps to reproduce the behavior:

Expected behavior

Faster than FP32 models

Screenshots

N/A

Environment (please complete the following information):

Additional context
N/A

clementpoiret · 2022-04-29T17:22:13Z

clementpoiret added the bug Something isn't working label Apr 29, 2022

ylep mentioned this issue Oct 23, 2023

Sparse-quantized model runs without VNNI acceleration neurospin/HSF#1

Open

Provide feedback