Update monitoring task_inputs after input Futures are resolved #3054

benclifford · 2024-02-07T13:33:56Z

The monitoring task table stores a column that should store the inputs argument of a task, after Future resolution.

Prior to this PR:

Although the sending side of the code, DataFlowKernel._send_task_log_info, is correctly invoked to send that information, it first sends an earlier representation of that task where inputs are still unresolved futures - for example:

[<AppFuture at 0x7fca83d22610 state=pending>, <AppFuture at 0x7fca83d7b550 state=pending>, <AppFuture at 0x7fca83d80110 state=pending>]

When a later update is sent, not all columns of the task table are updated; only a specific list of columns that are expected to change. task_inputs was not on that list, so even after future resolution, the monitoring task_inputs column was not updated.

This PR adds task_inputs to the list of columns that are updated on every task update message.

This PR adds a test which if run without this change results in this test failure:

            result = connection.execute(text("SELECT task_inputs FROM task"))
            (task_inputs, ) = result.first()
>           assert task_inputs == "[" + repr(TOKEN) + "]"
E           AssertionError: assert '[<Future at ...ate=pending>]' == '[54739]'
E             - [54739]
E             + [<Future at 0x7f946e3f9010 state=pending>]

parsl/tests/test_monitoring/test_incomplete_futures.py:66: AssertionError

Changed Behaviour

Better info in monitoring db for tasks with inputs= parameters.

Type of change

Bug fix

The monitoring task table stores a column that should store the inputs argument of a task, after Future resolution. Prior to this PR: Although the sending side of the code, DataFlowKernel._send_task_log_info, is correctly invoked to send that information, it first sends an earlier representation of that task where inputs are still unresolved futures - for example: [<AppFuture at 0x7fca83d22610 state=pending>, <AppFuture at 0x7fca83d7b550 state=pending>, <AppFuture at 0x7fca83d80110 state=pending>] When a later update is sent, not all columns of the task table are updated; only a specific list of columns that are expected to change. task_inputs was not on that list, so even after future resolution, the monitoring task_inputs column was not updated. This PR adds task_inputs to the list of columns that are updated on every task update message. This PR adds a test which if run without this change results in this test failure: ``` result = connection.execute(text("SELECT task_inputs FROM task")) (task_inputs, ) = result.first() > assert task_inputs == "[" + repr(TOKEN) + "]" E AssertionError: assert '[<Future at ...ate=pending>]' == '[54739]' E - [54739] E + [<Future at 0x7f946e3f9010 state=pending>] parsl/tests/test_monitoring/test_incomplete_futures.py:66: AssertionError ```

parsl/tests/test_monitoring/test_incomplete_futures.py

khk-globus · 2024-02-08T20:23:46Z

parsl/tests/test_monitoring/test_incomplete_futures.py

+    f2 = this_app(inputs=[f1])
+
+    f1.set_result(TOKEN)
+
+    assert f2.result() == TOKEN


Is there any value in checking the intermediate state of the monitoring DB between these statements?

it's a bit non-deterministic so I didn't go that way - after line 43, the task row will appear (with an in-progress value) "eventually" in the monitoring database, but it's not synchronous with line 43 completing.

What I would expect is that various records would appear - basically the asserts that are in eg. PR #3056 - but I think it would be some code that has to sit there and poll the DB a few times until the "eventually consistent" DB reflects that the task is launched.

The interesting thing to check for the specific bug in this PR would be to check that the input field does not have TOKEN stored in it until after line 45, as a pre-condition.

That's also what could happen instead of shutting down the DFK with cleanup a few lines later: don't shut down parsl here, but instead poll the DB until it looks right.

I don't really have any strong feelings about whether it's the right thing to do or not.

…g-record-resolved-inputs

…s expecting

khk-globus approved these changes Feb 8, 2024

View reviewed changes

benclifford added 6 commits February 12, 2024 15:56

Merge remote-tracking branch 'origin/master' into benc-desc-monitorin…

bd34b8a

…g-record-resolved-inputs

Remove loggers; make arbitrary token more arbitrary

d924ee0

Give a range to randint - it doesn't have a default range, which I wa…

00622b3

…s expecting

Put monitoring.db in a test temporary directory

ac76fe8

Put runinfo in temporary test directory

835cf81

Fix whitespace flake8

33fe536

benclifford merged commit 6784355 into master Feb 13, 2024
6 checks passed

benclifford deleted the benc-desc-monitoring-record-resolved-inputs branch February 13, 2024 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update monitoring task_inputs after input Futures are resolved #3054

Update monitoring task_inputs after input Futures are resolved #3054

benclifford commented Feb 7, 2024

khk-globus Feb 8, 2024

benclifford Feb 9, 2024

Update monitoring task_inputs after input Futures are resolved #3054

Update monitoring task_inputs after input Futures are resolved #3054

Conversation

benclifford commented Feb 7, 2024

Changed Behaviour

Type of change

khk-globus Feb 8, 2024

Choose a reason for hiding this comment

benclifford Feb 9, 2024

Choose a reason for hiding this comment