Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UCX w/ MPI deadlocking in CI #1048

Open
davidozog opened this issue Mar 23, 2022 · 3 comments
Open

UCX w/ MPI deadlocking in CI #1048

davidozog opened this issue Mar 23, 2022 · 3 comments

Comments

@davidozog
Copy link
Member

This row:
UCX (--enable-pmi-mpi CC=mpicc --disable-fortran)

Example here:
https://github.com/Sandia-OpenSHMEM/SOS/runs/5650454734?check_suite_focus=true

@davidozog
Copy link
Member Author

I haven't seen this failure in a long time - it may be a false positive.

@davidozog
Copy link
Member Author

This issue recurred at least twice in PR #1108, so reopening the issue for now. In one case on the spec-example test shmem_team_broadcast deadlocked.

We may prefer to disable or swap out this test row until it's resolved because it can slow down PR procedures considerably.

@davidozog davidozog reopened this Feb 7, 2024
@davidozog
Copy link
Member Author

Seeing another instance of probable deadlock on PR #1106, on unit test cxx_test_shmem_put I think.

davidozog added a commit to kholland-intel/SOS that referenced this issue Feb 7, 2024
davidozog added a commit to kholland-intel/SOS that referenced this issue Feb 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant