-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Luca Parisi edited this page Aug 15, 2024
·
9 revisions
- Naive Jacobi implementation [ teams only, teams + parallel ]
- Add mapping directives to control memory transfers
- Use a custom mapper for transferring the whole class
- Split up in subdomains for overlapping transfer and computation ( use streams )
- Use shared memory to improve performance bandwidth ( custom memory allocators ? )
- Use fft ( Cuda/rocm fft library ) instead of Jacobi to solve the equations