Implement AtomicFAddEXT for the CUDA BE #2853

AGindinson · 2020-12-02T18:59:24Z

After 4fdbfae, there are preparations to switch atomic fetch_add/fetch_sub FP implementations to using the new SPIR-V operand. Providing a "native" implementation in the CUDA BE would enable us to use the leveraged function for NVPTX targets as well (#if !defined(__NVPTX__) macros would have to be removed to achieve this).

The text was updated successfully, but these errors were encountered:

ldrumm · 2023-01-09T17:16:00Z

I think this is now implemented. It looks like @AGindinson did the meat of this work in 37a9a2a

Additionally, relevant libclc support went in in the following PRs:
#4820
#4853
#5025
#5191

@AGindinson is there anything missing? Perhaps we can close this?

AGindinson · 2023-01-10T10:48:49Z

@AlexeySachkov, could you please help with evaluating this one?

npmiller · 2023-05-09T14:52:02Z

@AlexeySachkov @AGindinson any updates on this?

AlexeySachkov · 2023-05-09T15:15:28Z

@AlexeySachkov @AGindinson any updates on this?

Not really. Both of us are not directly working on CUDA, so this item is a lower priority for us both. Feel free to pick it up. I'm also fine with closing it if we believe that everything is implemented already

npmiller · 2023-05-09T15:35:13Z

From a quick look into the headers I don't see any #if !defined(__NVPTX__) usage for fetch_add/fetch_sub, so I believe this is implemented, I'm fine with closing it, what do you think @ldrumm ?

The `OpSizeOf` instruction was added in SPIR-V 1.1, but not supported yet. Original commit: KhronosGroup/SPIRV-LLVM-Translator@9aeb7eb92d7c0cb

AGindinson added the cuda CUDA back-end label Dec 2, 2020

AGindinson mentioned this issue Jan 13, 2021

[SYCL] Specialize atomic fetch_add for floating point types #2765

Merged

AlexeySachkov added the enhancement New feature or request label Feb 2, 2021

Pennycook mentioned this issue Mar 1, 2021

[SYCL][CUDA] Add initial support for FP atomics #3276

Merged

AGindinson assigned Pennycook Mar 3, 2021

AerialMantis added the performance Performance related issues label Aug 18, 2021

ldrumm closed this as completed May 9, 2023

jsji pushed a commit that referenced this issue Nov 22, 2024

SPIRVReader: Add OpSizeOf support (#2853)

ed629d3

The `OpSizeOf` instruction was added in SPIR-V 1.1, but not supported yet. Original commit: KhronosGroup/SPIRV-LLVM-Translator@9aeb7eb92d7c0cb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement AtomicFAddEXT for the CUDA BE #2853

Implement AtomicFAddEXT for the CUDA BE #2853

AGindinson commented Dec 2, 2020

ldrumm commented Jan 9, 2023

AGindinson commented Jan 10, 2023

npmiller commented May 9, 2023

AlexeySachkov commented May 9, 2023

npmiller commented May 9, 2023

Implement AtomicFAddEXT for the CUDA BE #2853

Implement AtomicFAddEXT for the CUDA BE #2853

Comments

AGindinson commented Dec 2, 2020

ldrumm commented Jan 9, 2023

AGindinson commented Jan 10, 2023

npmiller commented May 9, 2023

AlexeySachkov commented May 9, 2023

npmiller commented May 9, 2023