src: cpu: aarch64: add support for s8:s8:s8 in ACL lowp matmul #1966

michalowski-arm · 2024-06-17T15:11:34Z

Performance results:

matrix scale | speed-up

 128x128   |  x0.82
 256x256   |  x2.63
 512x512   |  x33.3
1024x1024  |  x60.7
2048x2048  |  x60.3

To select the correct s8->s8 ACL kernel, we need to send
all quantization info at configuration but oneDNN does
not make these available until execution. This change
goes around this issue by first performing s8->f32 matmul
and then requantizing back to s8.

jondea

This looks great, thank you @michalowski-arm. This has also been reviewed internally.

jondea · 2024-06-24T09:04:08Z

If there are no more comments, would it be possible to get this merged please? The failures look common

vpirogov · 2024-06-24T15:47:56Z

@jondea, the failures are caused by an MSVC bug.

Thanks for the code review!

michalowski-arm and others added 2 commits June 17, 2024 15:05

src: cpu: aarch64: add support for s8:s8:s8 in ACL lowp matmul

91fc2d4

Fix memory_tracking.hpp

ebae5ce

jondea added the platform:aarch64 label Jun 18, 2024

jondea approved these changes Jun 18, 2024

View reviewed changes

vpirogov added this to the v3.6 milestone Jun 21, 2024

vpirogov merged commit 5806809 into oneapi-src:main Jun 24, 2024
7 of 10 checks passed

annop-w mentioned this pull request Jun 24, 2024

cpu: aarch64: matmul: Move allocation of temporary tensors to scratchpad in acl_matmul #1935

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src: cpu: aarch64: add support for s8:s8:s8 in ACL lowp matmul #1966

src: cpu: aarch64: add support for s8:s8:s8 in ACL lowp matmul #1966

michalowski-arm commented Jun 17, 2024

jondea left a comment

jondea commented Jun 24, 2024

vpirogov commented Jun 24, 2024

src: cpu: aarch64: add support for s8:s8:s8 in ACL lowp matmul #1966

src: cpu: aarch64: add support for s8:s8:s8 in ACL lowp matmul #1966

Conversation

michalowski-arm commented Jun 17, 2024

matrix scale | speed-up

jondea left a comment

Choose a reason for hiding this comment

jondea commented Jun 24, 2024

vpirogov commented Jun 24, 2024