Make deliberately scaled-in unstarted blocks not be failures #3594

benclifford · 2024-08-19T14:25:15Z

This PR adds a new terminal job state, SCALED_IN. None of the existing providers will return it, but the scaling layer will use it to mark a job as deliberately scaled in, so that error handling code will not regard it as failed.

Fixes #3568

validated that it fails when run against master 789ee82

…ause scaled down blocks are no longer failures, so this test should not do any scale down

khk-globus

The block comment is unfortunately necessary, so thank you for including it. The rest are window dressing suggestions for clarity.

khk-globus · 2024-08-26T13:09:47Z

parsl/tests/test_scaling/test_regression_3568_scaledown_vs_MISSING.py

+    # permanent htex failure, and so the task execution below would raise
+    # a BadStateException rather than attempt to run the task.
+
+    assert htex.provider.launcher.prepend != "", "Pre-req: prepend attribute should exist and be non-empty"


Would use of .startswith() work? If so, it makes for a stronger assertion / good faith in the test. Perhaps:

_inf_sleep = "sleep inf ; " ... launcher=WrappedLauncher(prepend=_inf_sleep), ... assert htex.provider.launcher.prepend.startswith(_inf_sleep), "..."

khk-globus · 2024-08-26T14:45:39Z

parsl/tests/test_scaling/test_regression_3568_scaledown_vs_MISSING.py

+    try_assert(lambda: len(htex.status_facade) == 1 and htex.status_facade['0'].terminal,
+               timeout_ms=10000)


Consider using the fail_msg argument to explain that this is a block. (Analogous to assert ..., "fail_msg")

khk-globus · 2024-08-26T14:47:44Z

parsl/tests/test_scaling/test_regression_3568_scaledown_vs_MISSING.py

+    htex.provider.launcher.prepend = ""
+    assert task().result() == 7


Can you distill the comment above into the fail msg for this assert? Not an easy task, I know, but when this does fail, makes it that easier to dig in.

benclifford added 9 commits August 19, 2024 14:01

WIP

a56397b

tag questionable test

7ad5608

fix lints

66f94af

fix up monitoring test that now sees a different status

86f2ff1

Merge remote-tracking branch 'origin/master' into benc-talk

5575b19

Adapt missing.py test from issue as a regression test

e569c2a

validated that it fails when run against master 789ee82

lint test

058187a

Fixup/reenable test_multiple_disconnected_blocks - it was failing bec…

4e4017d

…ause scaled down blocks are no longer failures, so this test should not do any scale down

Tidy up some debug logs

e875dda

benclifford changed the title ~~[not for merge] test some WIP in CI~~ Make deliberately scaled-in unstarted blocks not be failures Aug 23, 2024

benclifford added 4 commits August 23, 2024 11:18

Remove a debug line

660318a

Undo some unnecessary test changes

7d9b420

Rework test to be more driven by events than sleep

885d094

test documentation and whitespace fix

2e5ea4f

benclifford requested review from khk-globus, yadudoc and rjmello and removed request for khk-globus August 26, 2024 09:55

benclifford marked this pull request as ready for review August 26, 2024 09:56

Simplify config, add more description about test flow

99924ae

khk-globus approved these changes Aug 26, 2024

View reviewed changes

benclifford merged commit 4ea3fbc into master Aug 26, 2024
7 checks passed

benclifford deleted the benc-talk branch August 26, 2024 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make deliberately scaled-in unstarted blocks not be failures #3594

Make deliberately scaled-in unstarted blocks not be failures #3594

benclifford commented Aug 19, 2024 •

edited

Loading

khk-globus left a comment

khk-globus Aug 26, 2024

khk-globus Aug 26, 2024

khk-globus Aug 26, 2024

		try_assert(lambda: len(htex.status_facade) == 1 and htex.status_facade['0'].terminal,
		timeout_ms=10000)

		htex.provider.launcher.prepend = ""
		assert task().result() == 7

Make deliberately scaled-in unstarted blocks not be failures #3594

Make deliberately scaled-in unstarted blocks not be failures #3594

Conversation

benclifford commented Aug 19, 2024 • edited Loading

khk-globus left a comment

Choose a reason for hiding this comment

khk-globus Aug 26, 2024

Choose a reason for hiding this comment

khk-globus Aug 26, 2024

Choose a reason for hiding this comment

khk-globus Aug 26, 2024

Choose a reason for hiding this comment

benclifford commented Aug 19, 2024 •

edited

Loading