AMDGPU JIT compiler #453

mratsim · 2024-08-04T16:19:05Z

🔥 🔥 🔥

This adds an end-to-end LLVM IR -> AMD GPU JIT compiler.

The good news is that AMD GPUs support vectorized add-with-carry. The bad news is that unlike Nvidia GPUs, you cannot use inline assembly to guarantee it so you need to cajole the compiler into producing those:

See details there GPU backends #92 (comment)

Another good news is that the device function is properly vectorized without needing to use tricks like __forceinline__ or "Scalable Vector" types in LLVM.

… not object code)

mratsim added 2 commits August 3, 2024 18:55

feat(AMD GPU): initial commit to support AMD GPU (working for ASM but…

9e6f30b

… not object code)

feat(AMD GPU): 🔥 🔥 end to end JIT compilation to AMD GPU is working

9a4dfaf

mratsim added the enhancement New feature or request label Aug 4, 2024

mratsim merged commit 1e34ec2 into master Aug 5, 2024
24 checks passed

mratsim deleted the amdgpu branch August 5, 2024 05:23

This was referenced Aug 27, 2024

Nvidia remastered #464

Merged

[GPU] GPU / LLVM IR Elliptic curves implementation plan #465

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMDGPU JIT compiler #453

AMDGPU JIT compiler #453

mratsim commented Aug 4, 2024

AMDGPU JIT compiler #453

AMDGPU JIT compiler #453

Conversation

mratsim commented Aug 4, 2024