Decrease load on DB #2291

jbearer · 2024-11-15T19:01:55Z

We have seen that in some cases, when storage is slow or the node is way behind on its event stream, it can trigger many fetches of old proposals, which other nodes have all garbage collected. These fetches accumulate over time, leading to a worsening problem.

While debugging this, I also noticed that we don't "unfill" the leaf payload when storing anchor leaves, so DB performance gets worse when blocks are very large, because we are storing/loading these large payloads when we think we are only dealing with leaves.

This PR:

Limits the number of parallelism in proposal fetching. Instead of spawning a new task dynamically each time we need to fetch a proposal, we spawn a fixed number of worker tasks, each of which will only fetch one proposal at a time. A scanner task follows the event stream and detects when a proposal needs to be fetched, then broadcasts the request to fetch it to the worker tasks. It will be picked up when a worker is free.

Unfills leaf payloads before storing decided leaves.

We have seen that in some cases, when storage is slow or the node is way behind on its event stream, it can trigger many fetches of old proposals, which other nodes have all garbage collected. These fetches accumulate over time, leading to a worsening problem. This change limits the number of parallelism in proposal fetching. Instead of spawning a new task dynamically each time we need to fetch a proposal, we spawn a fixed number of worker tasks, each of which will only fetch one proposal at a time. A scanner task follows the event stream and detects when a proposal needs to be fetched, then broadcasts the request to fetch it to the worker tasks. It will be picked up when a worker is free.

sveitser · 2024-11-18T16:08:58Z

sequencer/src/context.rs

+
+            // If we fail fetching the proposal, don't let it clog up the fetching task. Just push
+            // it back onto the queue and move onto the next proposal.
+            sender.broadcast_direct((view, leaf)).await.ok();


Does this create a busy loop if the task is failing consistently?

I guess it kind of does. I was thinking we would be rate limited by the finite number of workers and time spent waiting on I/O for each failure. But I guess that's not guaranteed, depending on what the failure is. I'll add a sleep here

github-actions · 2024-11-18T19:16:44Z

Backport failed for release-thehounds, because it was unable to cherry-pick the commit(s).

Please cherry-pick the changes locally and resolve any conflicts.

git fetch origin release-thehounds
git worktree add -d .worktree/backport-2291-to-release-thehounds origin/release-thehounds
cd .worktree/backport-2291-to-release-thehounds
git switch --create backport-2291-to-release-thehounds
git cherry-pick -x 4f4efbaee341f679fa33bf676e3b0f90904ce295

jbearer requested review from philippecamacho, ImJeremyHe, sveitser, tbro and imabdulbasit as code owners November 15, 2024 19:01

jbearer added the backport release-thehounds label Nov 15, 2024

Decrease DB load by not storing full payload with anchor leaf

916646a

jbearer changed the title ~~Limit number of simultaneous proposal fetches~~ Decrease load on DB Nov 16, 2024

Handle genesis payload edge case

0c3af7c

sveitser reviewed Nov 18, 2024

View reviewed changes

Avoid busy loop

251f797

rob-maron approved these changes Nov 18, 2024

View reviewed changes

jbearer merged commit 4f4efba into main Nov 18, 2024
18 checks passed

jbearer deleted the jb/proposal-fetching-worker branch November 18, 2024 19:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decrease load on DB #2291

Decrease load on DB #2291

jbearer commented Nov 15, 2024 •

edited

Loading

sveitser Nov 18, 2024

jbearer Nov 18, 2024

github-actions bot commented Nov 18, 2024

Decrease load on DB #2291

Decrease load on DB #2291

Conversation

jbearer commented Nov 15, 2024 • edited Loading

This PR:

sveitser Nov 18, 2024

Choose a reason for hiding this comment

jbearer Nov 18, 2024

Choose a reason for hiding this comment

github-actions bot commented Nov 18, 2024

jbearer commented Nov 15, 2024 •

edited

Loading