Add shmem_{initialized, finalized} #470

nspark · 2021-06-25T20:31:09Z

This PR adds shmem_initialized and shmem_finalized to query the state of the library.

Closes #457

nspark · 2021-08-16T19:40:19Z

Suggestion from 8/16 WG: Add note clarifying that shmem_initialized() == 0 is not necessarily sufficient to prevent multiple threads from racing to call shmem_init.

manjugv · 2021-08-27T17:59:58Z

August spec meeting: Please add this note in the shmem_initialized section as API notes.

BryantLam · 2021-08-27T20:32:43Z

These were the states I could think of in a K-map style:

Before the library is initialized, shmem_initialized() == 0 and shmem_finalized() == 0.
- If initialization routine is called, go to (2).
- If finalization routine is called, spec doesn't say what happens, but implies undefined behavior 🐢 backed up by non-authoritative error note in backmatter.
- (mentioned above and will be added as a note) If library is not initialized (and is it valid to do so), it is user error to have multiple threads race on calling shmem_init.
After library is initialized, shmem_initialized() == 1 and shmem_finalized() == 0.
- If initialization routine is called, undefined behavior 🐌 with error note.
- If finalization routine is called, go to (3).
After library is finalized, shmem_initialized() == <don't care> and shmem_finalized() == 1.
- If initialization routine is called, undefined behavior like (2) 🐌.
- If finalization routine is called, undefined behavior like (1) 🐢 and because it's not the last OpenSHMEM call 🐙.
- As currently defined, we can get away with shmem_initialized() == <don't care> because we are now finalized and any subsequent OpenSHMEM call (except shmem_initialized() and shmem_finalized()) would result in undefined behavior 🐌🐙. As written, we have to set shmem_initialized() to some value but the value doesn't matter; it's effectively useless (or in a different respect, error prone) due to all the undefined behavior.

From these states, this would be one of the only valid conditional guards:

void app_startup() {
    if (shmem_finalized()) {
        // OpenSHMEM is done. Undefined behavior to call any OpenSHMEM routine.
        return or abort();
    }
    if (!shmem_initialized()) {
        // OpenSHMEM is not initialized. Okay to call OpenSHMEM initialization logic.
        shmem_init() or shmem_init_thread();
        this_app_initialized_openshmem = 1;
    }
    // OpenSHMEM is initialized.
}

void app_teardown() {
    if (this_app_initialized_openshmem == 1) {
        shmem_finalize();
    }
}

Or if your apps are not tearing down in reverse-startup order, you can choose to not (RE: must not?) put shmem_finalize() in app_teardown() and finalize OpenSHMEM from the parent scope:

in parent {
    app_startup();
    // do work
    app_teardown();
    if (!shmem_finalized()) { // Note: Should not use `shmem_initialized()` due to aforementioned `<don't care>`.
        shmem_finalize();
    }
}

I think this is a bit error-prone in my mind, but it works and it's what we've specified.

🐌 If subsequent calls to initialization routines were allowed in an active OpenSHMEM program, one behavior could be to "do nothing and increment value of shmem_initialized()" for nesting or counting references (shmem_finalized() is counting now too 🐙). This would then change the conditional guard to something like:

void app_startup() {
    if (shmem_finalized() == shmem_initialized()) {
        return or abort();
    }
    shmem_init() or shmem_init_thread();
    // OpenSHMEM is initialized.
}

void app_teardown() {
    shmem_finalize();
}

in parent {
    app_startup();
    // do work
    app_teardown();
    assert(shmem_finalized() == shmem_initialized());
    // All apps tore down correctly, or either one of them or you have a bug.
}

The "do nothing" behavior could be a problem if subsequent calls of shmem_init_thread had incompatible arguments.

jdinan · 2021-09-02T13:47:53Z

If we follow the MPI semantics, the guard for calling shmem_init would be !shmem_initialized() && !shmem_finalized().

nspark · 2021-09-08T19:17:42Z

Added the following note:

Although shmem_initialized is thread-safe, its return value is not a sufficient guard to prevent multiple threads from racing to initialize the OpenSHMEM library concurrently, as shmem_initialized may return 0 to one thread while library initialization is in progress due to a call from another thread. Applications must ensure that only one call to shmem_init[_thread] is made to initialize the OpenSHMEM library.

jdinan · 2021-09-08T20:14:39Z

@nspark The note is reasonable, but why not add reference counted init/finalize and require that init functions are thread safe?

nspark · 2021-09-08T20:43:41Z

@jdinan IDK, that just seemed like a change of a larger scope. I'm happy to draft it, but I'd probably do it as a separate PR.

jdinan · 2022-03-16T17:08:55Z

content/shmem_initialized.tex

+
+\apidescription{
+  The \FUNC{shmem\_initialized} routine returns a value indicating
+  whether the \openshmem library has been initialized (i.e, a call to


Do we want "has been" or "is currently"?

jdinan · 2022-03-16T17:12:04Z

content/shmem_finalized.tex

+
+\apidescription{
+  The \FUNC{shmem\_finalized} routine returns a value indicating
+  whether the \openshmem library has been finalized (i.e, a call to


Do we want "has been" or "is currently"? The problem with "has been" is that it tells you whether the API was ever called. If we assume that an OpenSHMEM library can't be reinitialized, then this is enough information to tell you the library state. If an OpenSHMEM library supports reinitialization then "has been" is not enough information to know the state of the library. You need to know "is currently".

For reference, here is the NVSHMEM library state query function: https://docs.nvidia.com/hpc-sdk/nvshmem/api/docs/gen/api/setup.html#nvshmemx-init-status

jdinan · 2024-05-30T14:06:08Z

Superseded by #512

Add shmem_{initialized, finalized}

f1a9a6c

nspark added the NeedsChangeLogEntry label Jun 25, 2021

Changelog for shmem_{initialized,finalized}

5ed7209

nspark removed the NeedsChangeLogEntry label Jun 25, 2021

nspark mentioned this pull request Jun 25, 2021

Fix GitHub Actions CI #472

Open

nspark added the PendingReading label Aug 20, 2021

manjugv added the HadReading label Aug 27, 2021

nspark removed the PendingReading label Aug 31, 2021

Add note on multithreaded races for shmem_initialized

c8296d3

nspark removed the HadReading label Jan 28, 2022

jdinan reviewed Mar 16, 2022

View reviewed changes

jdinan added the HadReading label May 27, 2022

jdinan added this to the OpenSHMEM 1.6 milestone May 27, 2022

jdinan closed this May 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add shmem_{initialized, finalized} #470

Add shmem_{initialized, finalized} #470

nspark commented Jun 25, 2021

nspark commented Aug 16, 2021

manjugv commented Aug 27, 2021

BryantLam commented Aug 27, 2021 •

edited

Loading

jdinan commented Sep 2, 2021

nspark commented Sep 8, 2021

jdinan commented Sep 8, 2021

nspark commented Sep 8, 2021

jdinan Mar 16, 2022

jdinan Mar 16, 2022

jdinan May 27, 2022

jdinan commented May 30, 2024

Add shmem_{initialized, finalized} #470

Add shmem_{initialized, finalized} #470

Conversation

nspark commented Jun 25, 2021

nspark commented Aug 16, 2021

manjugv commented Aug 27, 2021

BryantLam commented Aug 27, 2021 • edited Loading

jdinan commented Sep 2, 2021

nspark commented Sep 8, 2021

jdinan commented Sep 8, 2021

nspark commented Sep 8, 2021

jdinan Mar 16, 2022

Choose a reason for hiding this comment

jdinan Mar 16, 2022

Choose a reason for hiding this comment

jdinan May 27, 2022

Choose a reason for hiding this comment

jdinan commented May 30, 2024

BryantLam commented Aug 27, 2021 •

edited

Loading