Improve the performance of computing the destination for each shred #1131

ptaffet-jump · 2024-01-05T20:26:49Z

Performance investigations showed that computing the destination for each shred was extremely expensive compared to computing the shred data itself. With a lot of effort (and including the recent Chacha20 improvements), performance increased from 28.3 us/shred to 8.3 us/shred on my workstation.

Shred footprint changed, and that is causing some tests to fail. I should also rewrite the explanation to update it to radix 9, and switch the right_sums to left_sums which are much more intuitive, since the performance win from right sums didn't materialize.

Switch from using treap to using a light-weight radix-9 tree inspired by a B-tree. Almost entirely branch-free because branch mispredictions were killer.

fd_shred_dest: add benchmark based on testnet distribution

c84e088

ptaffet-jump force-pushed the ptaffet/shred-perf-improvements branch 2 times, most recently from a69f46e to c377173 Compare January 12, 2024 23:47

ptaffet-jump marked this pull request as ready for review January 12, 2024 23:48

fd_wsample: improve performance

357b4d7

Switch from using treap to using a light-weight radix-9 tree inspired by a B-tree. Almost entirely branch-free because branch mispredictions were killer.

ptaffet-jump force-pushed the ptaffet/shred-perf-improvements branch from c377173 to 357b4d7 Compare January 16, 2024 21:38

ptaffet-jump requested review from nbridge-jump and mmcgee-jump January 17, 2024 17:26

mmcgee-jump approved these changes Jan 31, 2024

View reviewed changes

ptaffet-jump added this pull request to the merge queue Jan 31, 2024

Merged via the queue into main with commit 0bb91a0 Jan 31, 2024
7 checks passed

ptaffet-jump deleted the ptaffet/shred-perf-improvements branch January 31, 2024 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the performance of computing the destination for each shred #1131

Improve the performance of computing the destination for each shred #1131

ptaffet-jump commented Jan 5, 2024

Improve the performance of computing the destination for each shred #1131

Improve the performance of computing the destination for each shred #1131

Conversation

ptaffet-jump commented Jan 5, 2024