Vine: Measure and Tune Dispatch Rates #3284

dthain · 2023-04-20T13:07:46Z

Now that we have fast serverless tasks running in TaskVine+Parsl, the scheduling of tasks is likely to be the performance constraint, if we intend to have millions of tasks running on thousands of nodes. Let's understand the performance of the current TaskVine scheduler and if necessary, see what algorithmic improvements can be made to get dispatch rates that are reasonable stable with respect to the number of tasks and workers.

Create a simple dispatch benchmark for null tasks.
Write up a high level pseudo-code description of the current scheduling loop.
Propose some improvements to our approach.

dthain · 2023-05-30T15:42:35Z

@colinthomas-z80 please summarize here what you have found and keep some running notes as you go.

colinthomas-z80 · 2023-05-30T17:48:39Z

added entries to performance log and vine_profile_dispatch to plot accumulated scheduling time
scheduling loop pseudo code identified the complexity increasing factor being the number of workers
profiled scheduling loop with gprof to identify hot spots
optimized common functions in rmsummary for taskvine and work queue
optimized scheduling loop for SCHEDULE_FILES

colinthomas-z80 · 2023-05-30T17:54:20Z

We have observed that null task throughput will decline as larger batches of tasks are being submitted and processed by a single worker. Since the number of waiting tasks does not seem to influence the cost of scheduling, we will look further into this performance in work queue, taskvine, as well as in the context of Parsl, where high task throughput is desirable.

colinthomas-z80 · 2023-06-05T16:08:38Z

Separate from the issue of scheduling tasks to workers, we have observed that having an exceptionally large waiting task queue will cause the manager to spend unnecessary time iterating through tasks to schedule, when it should be fetching results so the workers will become available again.

That is to say, with 10k+ tasks submitted, the manager will dispatch tasks to all workers, yet continue to iterate through the 10k tasks trying to find one that will fit the busy workers. In throughput testing cases, and perhaps some practical cases, the tasks will finish before the manager is even done iterating through the list. Therefore it would be more effective for the manager to retrieve these tasks and perform other bookkeeping rather than iterate through tasks that cannot be scheduled.

A simple test where we only attempt to schedule the waiting task at the top of the list shows much better retained throughput as the size of the task queue grows. This however would severely limit workers from being effectively packed in the case of diverse tasks (3 core and 1 core tasks submitted to be run in parallel). Attempting to make a quick judgement about the resources available in the cluster compared to the task requirements in the queue is perhaps not possible.

One possible method would be to attempt scheduling a fixed number of tasks, and if none succeed, then we assume no workers are available and results should be fetched. This method shows good throughput results by considering 100 tasks. The implications of this on other aspects of the workflow still need to be studied.

dthain · 2023-06-05T16:22:21Z

A slight variation: just keep a cursor in the list, and examine ~100 before going back through the main loop. Then, next time, pick up where you left off. That way, you eventually make it through the entire list, just not all in one scheduling pass.

dthain · 2023-06-06T15:40:35Z

Alternate discussed today: add list_rotate option that moves the head to the tail, then you can just consider one item and keep going, up until N.

colinthomas-z80 · 2023-06-09T14:52:45Z

list_rotate implemented and merged into work queue. Taskvine equivalent pending.

colinthomas-z80 · 2023-06-16T18:49:40Z

Further main loop optimizations for WQ in #3380. This utilizes the list rotate method in the expire_tasks routine as it was previously applied to send_one_task. expire_waiting_tasks by itself is rather expensive and it likely often runs unnecessarily during workflows where tasks are not specified with deadlines.

colinthomas-z80 · 2023-06-20T17:42:02Z

Running the manager through gprof shows that a majority of time is spent in work_queue_get_stats, which is a logging mechanism. It may be called multiple times during a single pass of the main loop and causes a 3n iteration of the task list each invocation.

            0.01   22.53   25000/50005       add_task_report [13]
            0.01   22.53   25005/50005       log_queue_stats [14]
            [7]     64.7    0.02   45.06   50005         work_queue_get_stats [7]

            4.48   40.22  150015/150015      work_queue_get_stats [7]
           [8]     64.2    4.48   40.22  150015         task_state_count [8]
           24.83    0.00 1875300018/2871551875     itable_nextkey [9]
            2.36   12.79 1875150003/2871071303     task_state_is [12]
            0.23    0.00  150015/312725      itable_firstkey [28]

dthain · 2023-06-21T02:20:50Z

Indeed! But have a look at task_state_count which likely does a troubling amount of work...

dthain · 2023-06-21T19:34:49Z

#3380 is a nice tune-up for Work Queue. Now let's focus on "doing it right" in TaskVine with some more fundamental changes.

dthain · 2023-06-22T13:48:37Z

@colinthomas-z80 after thinking about this a little bit, I think the last_waiting_task and last_retrieved_task aren't safe under all conditions. If some other action causes the task to be changed or removed from the data structure (e.g. remove a task) then it's going to result in a crash.

Please remove that shortcut, and let's keep the others in place.

dthain · 2023-07-10T13:24:03Z

#3387

dthain · 2023-07-10T13:24:20Z

@colinthomas-z80 please summarize here what sort of dispatch rates we have now after your latest work.

dthain · 2024-01-29T22:05:54Z

Fixed

dthain added this to TaskVine Phase 2 Apr 18, 2023

dthain converted this from a draft issue Apr 20, 2023

dthain added enhancement TaskVine labels Apr 20, 2023

dthain assigned colinthomas-z80 May 16, 2023

colinthomas-z80 moved this to In Progress in TaskVine Phase 2 Jun 6, 2023

colinthomas-z80 mentioned this issue Jun 20, 2023

WQ: Further mainloop optimizations #3380

Merged

dthain removed this from TaskVine Phase 2 Aug 16, 2023

dthain added this to TaskVine Phase 3 - Scaling, Robustness, Performance Aug 16, 2023

dthain added this to TaskVine/Parsl Integration Jan 29, 2024

dthain closed this as completed Jan 29, 2024

github-project-automation bot moved this to Done in TaskVine/Parsl Integration Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vine: Measure and Tune Dispatch Rates #3284

Vine: Measure and Tune Dispatch Rates #3284

dthain commented Apr 20, 2023 •

edited

Loading

dthain commented May 30, 2023

colinthomas-z80 commented May 30, 2023

colinthomas-z80 commented May 30, 2023

colinthomas-z80 commented Jun 5, 2023

dthain commented Jun 5, 2023

dthain commented Jun 6, 2023

colinthomas-z80 commented Jun 9, 2023

colinthomas-z80 commented Jun 16, 2023

colinthomas-z80 commented Jun 20, 2023 •

edited

Loading

dthain commented Jun 21, 2023

dthain commented Jun 21, 2023

dthain commented Jun 22, 2023

dthain commented Jul 10, 2023

dthain commented Jul 10, 2023

dthain commented Jan 29, 2024

Vine: Measure and Tune Dispatch Rates #3284

Vine: Measure and Tune Dispatch Rates #3284

Comments

dthain commented Apr 20, 2023 • edited Loading

dthain commented May 30, 2023

colinthomas-z80 commented May 30, 2023

colinthomas-z80 commented May 30, 2023

colinthomas-z80 commented Jun 5, 2023

dthain commented Jun 5, 2023

dthain commented Jun 6, 2023

colinthomas-z80 commented Jun 9, 2023

colinthomas-z80 commented Jun 16, 2023

colinthomas-z80 commented Jun 20, 2023 • edited Loading

dthain commented Jun 21, 2023

dthain commented Jun 21, 2023

dthain commented Jun 22, 2023

dthain commented Jul 10, 2023

dthain commented Jul 10, 2023

dthain commented Jan 29, 2024

dthain commented Apr 20, 2023 •

edited

Loading

colinthomas-z80 commented Jun 20, 2023 •

edited

Loading