Officially Support AMD GPUs #954

Quentin-Anthony · 2023-05-26T23:17:21Z

Currently, AMD GPU support lives experimentally in main...AMD

Test latest gpt-neox main on AMD GPUs with fused kernels set to false
Port fused kernels to AMD hardware and test them on AMD GPUs
Add conditional HIP guards so that the same fused kernel code can run on AMD and NVIDIA GPUs without modification
Add flash-attn fallbacks because 2.x is not yet supported there

The text was updated successfully, but these errors were encountered:

R0n12 · 2023-10-18T16:37:44Z

Will take a look at this!

R0n12 · 2024-01-15T17:51:56Z

Flash-Attention-2 detection code
All fused-kernels (except fused_rotary_positional_embedding) build passed with MI250X+ROCm5.6.0 without HIP guards
fused_rotary_positional_embedding build on AMD GPUs
Adding HIP guards to generalize building process for AMD and NVIDIA GPUs
Tests on NVIDIA platforms with no modification

Do we still need to detect flash attention 1 at this point since AMD has already ported their version 2? (https://github.com/ROCmSoftwarePlatform/flash-attention)

Quentin-Anthony · 2024-01-15T20:37:16Z

Do we still need to detect flash attention 1 at this point since AMD has already ported their version 2?

No I think we can drop that.

R0n12 · 2024-03-10T00:35:51Z

Status Update: Been busy with life things recently, will get a new clean branch pushed out by next week.

Quentin-Anthony · 2024-04-21T22:11:23Z

Done! Great work @R0n12

Quentin-Anthony added the feature request New feature or request label May 26, 2023

Quentin-Anthony assigned R0n12 Oct 18, 2023

R0n12 mentioned this issue Feb 26, 2024

Fused kernel support for AMD (using JIT) #1156

Closed

R0n12 mentioned this issue Mar 18, 2024

[AMD] Supporting fused kernels build using JIT #1188

Merged

2 tasks

Quentin-Anthony closed this as completed Apr 21, 2024

Provide feedback