Skip to content
Luca Parisi edited this page Aug 15, 2024 · 9 revisions

Exercices

Poisson equation

  • Naive Jacobi implementation [ teams only, teams + parallel ]
  • Add mapping directives to control memory transfers
  • Use a custom mapper for transferring the whole class
  • Split up in subdomains for overlapping transfer and computation ( use streams )
  • Use shared memory to improve performance bandwidth ( custom memory allocators ? )
  • Use fft ( Cuda/rocm fft library ) instead of Jacobi to solve the equations
Clone this wiki locally