cpu: aarch64: matmul: Move allocation of temporary tensors to scratchpad in acl_matmul #1935

annop-w · 2024-05-29T20:41:34Z

Description

Introduce 3 new scrathpad memory key names.

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

src/cpu/aarch64/matmul/acl_matmul_utils.hpp

mgouicem · 2024-05-30T11:31:07Z

tagging @snadampal as this relates to #1470

vpirogov · 2024-06-24T15:49:39Z

+@snadampal, could you please help reviewing these changes?

@annop-w, please resolve merge conflict.

…pad in acl_matmul Introduce 3 new scrathpad memory key names.

annop-w · 2024-06-24T16:00:13Z

@annop-w, please resolve merge conflict.

Done.

src/common/memory_tracking.hpp

snadampal · 2024-06-24T17:15:31Z

glad to see this change finally coming.
Hi @annop-w to understand the change a bit in detail, the scratchpad buffers for src and weights are the same buffers allocated in the framework, right . for example, by ideep in PyTorch and by mkldnn wrapper in TensorFlow?

thanks @mgouicem and @vpirogov for tagging me here.

annop-w · 2024-06-24T18:12:33Z

@snadampal I am not sure how the scratchpads are currently managed in ideep (or Tensorflow), but the idea here is to allow for users (i.e. PyTorch or TF) a chance to decide that, isn't it ? For example, PyTorch can now choose to allocate the same buffer for both src and wei, if sensible, which was not possible before. Does this help ?

snadampal · 2024-06-24T22:33:31Z

i mean reusing the buffer allocated in PT or TF via oneDNN user mode scratchpad, you clarified it, thanks.
this change LGTM.

annop-w · 2024-06-24T22:36:24Z

@snadampal Ah, yes, in that case, you're absolutely right. Thanks for the review.

vpirogov · 2024-06-25T05:05:54Z

Awesome. Thanks, @snadampal.

dzarukin approved these changes May 29, 2024

View reviewed changes

src/cpu/aarch64/matmul/acl_matmul_utils.hpp Outdated Show resolved Hide resolved

annop-w force-pushed the matmul branch from ac49e61 to b80ec23 Compare May 29, 2024 21:07

vpirogov added this to the v3.6 milestone May 29, 2024

mgouicem added the platform:aarch64 label May 30, 2024

cpu: aarch64: matmul: Move allocation of temporary tensors to scratch…

323a9d6

…pad in acl_matmul Introduce 3 new scrathpad memory key names.

annop-w force-pushed the matmul branch from b80ec23 to 323a9d6 Compare June 24, 2024 15:58

dzarukin reviewed Jun 24, 2024

View reviewed changes

src/common/memory_tracking.hpp Show resolved Hide resolved

vpirogov merged commit 6f14365 into oneapi-src:main Jun 25, 2024
6 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu: aarch64: matmul: Move allocation of temporary tensors to scratchpad in acl_matmul #1935

cpu: aarch64: matmul: Move allocation of temporary tensors to scratchpad in acl_matmul #1935

annop-w commented May 29, 2024

mgouicem commented May 30, 2024

vpirogov commented Jun 24, 2024

annop-w commented Jun 24, 2024

snadampal commented Jun 24, 2024

annop-w commented Jun 24, 2024

snadampal commented Jun 24, 2024 •

edited

Loading

annop-w commented Jun 24, 2024

vpirogov commented Jun 25, 2024

cpu: aarch64: matmul: Move allocation of temporary tensors to scratchpad in acl_matmul #1935

cpu: aarch64: matmul: Move allocation of temporary tensors to scratchpad in acl_matmul #1935

Conversation

annop-w commented May 29, 2024

Description

Checklist

General

mgouicem commented May 30, 2024

vpirogov commented Jun 24, 2024

annop-w commented Jun 24, 2024

snadampal commented Jun 24, 2024

annop-w commented Jun 24, 2024

snadampal commented Jun 24, 2024 • edited Loading

annop-w commented Jun 24, 2024

vpirogov commented Jun 25, 2024

snadampal commented Jun 24, 2024 •

edited

Loading