Launch tasks in from a shallow Python stack to avoid recursion errors #3478

benclifford · 2024-06-07T08:32:11Z

Description

Prior to this PR, task dependencies were launched from fairly arbitrary places in the call stack: for example, wherever a particular Future callback happens, sometimes deep inside executor-specific threads and/or in launch code for earlier tasks. In situations described in #3472 this could result in a a Python call stack overflow, as long call chains accumulate.

This PR makes all task launches be queued to happen from a near-empty stack in a single thread (managed as a concurrent.futures.ThreadPoolExcutor) - with a "invoke this call soon" pattern.

The launch_if_ready method was already intended to be launched multiple times from multiple threads. this PR might make the invocation to launch_if_ready happen a bit later, but correctness-wise that should be fine: a task can only become more ready to run, not less ready.

This PR introduces a test test_dependency_deep, which simulates a situation where this #3472 failure happens (before this PR), and tests (for the future) that tasks are not launched beyond a fairly arbitrary stack depth.

Changed Behaviour

Performance of task launching is probably a bit different now. I have not quantified this.

Exceptions during task launch (from parsl, or from plugged in executors) will now appear in a different call stack, and so be reported differently.

Fixes

Fixes #3472

Type of change

Bug fix

... rather than recursively deep in a dependency processing chain probably some change in performance? launch_if_ready was already intended to be launched multiple times from multiple threads. this PR might make the invocation to launch_if_ready happen a bit later, but correctness-wise that should be fine: a task can only become *more* ready to run, not less *ready*.

benclifford added 10 commits June 5, 2024 14:03

Add a stack-depth test

49203a9

Add notes on test constant

b9eba7a

fix flake8, mypy

d0b369b

WIP: dependency launcher pool

0e978eb

Merge remote-tracking branch 'origin/master' into benc-3472-recursion

862c6c3

isort

324465a

Remove cancel futures because it needs Python 3.9

dce6eb3

rename for consistency with other test

464c71e

Logging and docstrings

d6930a4

benclifford changed the title ~~fix 3472 recursion problems by launching new tasks with a fairly new Python call stack~~ Launch tasks in from a shallow Python stack to avoid recursion errors Jun 7, 2024

benclifford marked this pull request as ready for review June 7, 2024 11:20

khk-globus approved these changes Jun 10, 2024

View reviewed changes

Merge branch 'master' into benc-3472-recursion

9725dce

benclifford merged commit a18f0a7 into master Jun 10, 2024
6 checks passed

benclifford deleted the benc-3472-recursion branch June 10, 2024 20:42

benclifford mentioned this pull request Jun 24, 2024

[not for merge] Temporary release branch for Sander Vandenhaute testing multiple outstanding PRs #3479

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Launch tasks in from a shallow Python stack to avoid recursion errors #3478

Launch tasks in from a shallow Python stack to avoid recursion errors #3478

benclifford commented Jun 7, 2024 •

edited

Loading

Launch tasks in from a shallow Python stack to avoid recursion errors #3478

Launch tasks in from a shallow Python stack to avoid recursion errors #3478

Conversation

benclifford commented Jun 7, 2024 • edited Loading

Description

Changed Behaviour

Fixes

Type of change

benclifford commented Jun 7, 2024 •

edited

Loading