-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential problem with ExeSpaceUtils view_reduction and parallel_reduce #254
Comments
Upon a second look, |
I should also note that the error only occurs when team_size > 1, which is what you get when MIMIC_GPU is On (team size 7) which is On by default for Debug builds. |
I believe the problems with |
Yes, you're right. I was working on completing last Friday, but did not finish by week's end. I should be done today. |
I think this was completed in #258. Closing. |
Describe the bug
This was discovered when porting shoc_energy_integrals to small kernels. I was getting large differences in the outputs of the view_reductions when num_threads>1. I suspect the problem is in the handling of the garbage of the last pack because the problem went away when I used nlev % pack_size = 0.
To Reproduce
Steps to reproduce the behavior:
-DSCREAM_SMALL_KERNELS=On -DCMAKE_BUILD_TYPE=Debug
OMP_NUM_THREADS=16 ./shoc_tests shoc_main_bfb
Expected behavior
view_reduction should have produced bfb results with fortran.
The text was updated successfully, but these errors were encountered: