[CODEGEN][METAL] Fix unaligned vector load #14332

tqchen · 2023-03-19T14:27:24Z

This PR fixes the implementation of unaligned vector load. Previously vector construction was printed as (float2)(v0, v1). This will cause problem as C have comma expression, and (v0, v1) will be evaluated as v1. The final result will become float2(v1, v1). The bug affects all codegen that uses the default implementation, such as metal. We added a testcase on metal to cover this case.

tvm-bot · 2023-03-19T14:27:27Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @echuraev _{See #10317 for details}

_{Generated by tvm-bot}

spectrometerHBH

Shall we somehow examine the output C source code is as expected?

tqchen · 2023-03-19T17:08:44Z

I have thought a bit about it in the end feel test correctness coverage is better in this case if we want future edits. We can indeed followup with source checking if new related regression arises

junrushao · 2023-03-19T18:51:44Z

it seems that this patch breaks OpenCL/Vulkan codegen tests..

tqchen · 2023-03-19T19:52:15Z

interesting, will dig a bit.

tqchen · 2023-03-19T19:55:45Z

seems that (float2)(v0,v1) was the right syntax for opencl, will add an overload to the opencl codegen

This PR fixes the implementation of unaligned vector load. Previously vector construction was printed as (float2)(v0, v1). This will cause problem as C have comma expression, and (v0, v1) will be evaluated as v1. The final result will become float2(v1, v1). The bug affects all codegen that uses the default implementation, such as metal. We added a testcase on metal to cover this case. Also updated codegen opencl to keep the old style as that is the convention opencl follows.

tqchen force-pushed the metal2 branch from a238376 to 4e855bd Compare March 19, 2023 14:29

junrushao approved these changes Mar 19, 2023

View reviewed changes

MasterJH5574 approved these changes Mar 19, 2023

View reviewed changes

spectrometerHBH approved these changes Mar 19, 2023

View reviewed changes

tqchen force-pushed the metal2 branch from 4e855bd to 8e914e1 Compare March 19, 2023 21:29

tqchen force-pushed the metal2 branch from 8e914e1 to 7d33223 Compare March 19, 2023 21:32

MasterJH5574 merged commit fc2a9e5 into apache:main Mar 20, 2023

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CODEGEN][METAL] Fix unaligned vector load #14332

[CODEGEN][METAL] Fix unaligned vector load #14332

tqchen commented Mar 19, 2023

tvm-bot commented Mar 19, 2023

spectrometerHBH left a comment

tqchen commented Mar 19, 2023

junrushao commented Mar 19, 2023 •

edited

Loading

tqchen commented Mar 19, 2023

tqchen commented Mar 19, 2023

[CODEGEN][METAL] Fix unaligned vector load #14332

[CODEGEN][METAL] Fix unaligned vector load #14332

Conversation

tqchen commented Mar 19, 2023

tvm-bot commented Mar 19, 2023

spectrometerHBH left a comment

Choose a reason for hiding this comment

tqchen commented Mar 19, 2023

junrushao commented Mar 19, 2023 • edited Loading

tqchen commented Mar 19, 2023

tqchen commented Mar 19, 2023

junrushao commented Mar 19, 2023 •

edited

Loading