Memory consumption issue #4

Mr-Milk · 2020-12-08T04:37:36Z

I tried to scan motif on a genome region with hg38 build with -t 18 corresponded to my CPU number

but it raised:

OSError: [Errno 12] Cannot allocate memory

And then I tried with -t 8, the program ate up to around 50G of my RAM. I ran it on WSL2 ubuntu 20.04 TLS.

The text was updated successfully, but these errors were encountered:

hongduosun · 2020-12-08T05:32:02Z

Sorry, but how many regions were scanned?

Mr-Milk · 2020-12-08T05:53:26Z

More than 200K

hongduosun · 2020-12-08T06:58:23Z

I'm afraid this is a temporary limit for MotifScan because only small parts of codes are refactored using C to speed up calculating motif scores. So every single motif score is stored and passing back to Python and this requires O(n_region * length_per_region * n_motif) memories.
I'll improve this in the next update.

Mr-Milk · 2020-12-08T09:55:58Z

Thanks for your answer. Just a little suggestion, I looked at your code, the parallelism is using python's multiprocessing which might be the reason for such huge memory consumption. Since it will basically copy the whole process of the current python process. It might help if you could try to implement the parallelism from C-side.

hongduosun · 2020-12-08T15:59:55Z

Thanks a lot for your advice!

hongduosun · 2021-01-21T14:08:57Z

This has been fixed in v1.3.0 after using pthread in the C extension. Thanks again!

Mr-Milk · 2021-01-22T07:58:27Z

I tried it with the same dataset, at some point, the programme still ate up all of my RAM and caused a system exit 😥, but it's sure better than before 🥰. Is it possible to free some unused memory, save the results to the disk during the process?

hongduosun added the enhancement label Dec 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory consumption issue #4

Memory consumption issue #4

Mr-Milk commented Dec 8, 2020

hongduosun commented Dec 8, 2020

Mr-Milk commented Dec 8, 2020

hongduosun commented Dec 8, 2020

Mr-Milk commented Dec 8, 2020

hongduosun commented Dec 8, 2020

hongduosun commented Jan 21, 2021

Mr-Milk commented Jan 22, 2021

Memory consumption issue #4

Memory consumption issue #4

Comments

Mr-Milk commented Dec 8, 2020

hongduosun commented Dec 8, 2020

Mr-Milk commented Dec 8, 2020

hongduosun commented Dec 8, 2020

Mr-Milk commented Dec 8, 2020

hongduosun commented Dec 8, 2020

hongduosun commented Jan 21, 2021

Mr-Milk commented Jan 22, 2021