memmap refactoring #1265

samuelgarcia · 2023-05-03T20:44:20Z

This is a proposal to change the actual memap behavior in rawio layer.

spikeinterface use neo.rawio a lot with extenssive IO demand with parralel read/computing.
Many users experience big memmory saturation even on big machine when the file are enormous.
This quite a bit OS dependant, mainly windows but even some linux have this problem).
Somehow, the memmap do not release page after the use of one buffer and so the memory increase forever.

The actual strategy we now have in almost all IO when it is binary base :

In parse_header, open one or several numpy.memmap object and set then as attribute or nested dict
In get_analogsignal_chunk : lazly fetch buffer via numpy slicing using this memmap arrays
After consumming the traces chunk the memory should be relased and so the memmory usage should low but in fact in some case it increase until saturating.

@h-mayorquin and @samuelgarcia made some benchmark to find a better strategy.
Here some code examples:
https://gist.github.com/h-mayorquin/8b8d1899c724690ce7c18fb49f56af82

The best strategy we find to try to fix is:

In the parse header, open the file with mode='rb'
In get_analogsignal_chunk : create a new memmap object using the opened file

For the 2. we mimic the numpy.memmap but a but simplify to be faster.
The speed of reading is almost the same and the memory problem seems to be fixed.

@maxjuv @TomBugnon @vncntprvst @juliencarponcy: we need more tests!
I started to change spikeglx and openephysrawbinary.
Could you try to combine this branch and also this PR on spikeinterface
And then:

read a recording with read_spikeglx() or read_openephys()
make a simple preprocessing
rec.save(folder=folder, n_jobs=64, chunk_duration='1s', progress_bar=True)
And monitor the memory usage.
Do you still have an infinite memory increase or not ?

zm711 · 2023-05-05T13:48:41Z

Hey Sam, just as an fyi and to support the need for this refactor I was doing some emg processing (outside of the spike sorting world) and saw the same thing where I import neo in a function create a reader and then return the analog emg data for post-processing and it seems that as long as my IPython kernel is active the memmap and header information aren't releasing (ie, saturating the RAM eventually). Despite the reader being "cleaned up" after the function if I call the same function on the binary file it runs significantly faster (seems like the header and memmap have been cached). Closing the session finally releases the RAM. On a windows for this work currently (which does annoyingly hold onto files way longer than Mac and Linux do).

samuelgarcia · 2023-05-05T14:35:22Z

@zm711 thanks for the feedback. So do you this this patch will help your case ?

zm711 · 2023-05-05T14:40:04Z

@samuelgarcia
I'm using the IntanRawIO, so I can't specifically test this, and this issue only occurs when I'm trying to batch process data, so it's a rarer issue for me, (so I never reported it I just dealt with my computer crashing every once in a while). If this test actually works for the more extreme spikeinterface case, then I would predict it would also be helpful in the edge cases of data analysis I run into. But I would just need to actually see if this is a general windows issue, ipython issue, or np.memmap issue.

jpgill86 · 2023-05-05T15:46:47Z

@zm711, I wonder if your memory release issue is related to #684. Manual garbage collection might help:

import gc
gc.collect()

zm711 · 2023-05-05T18:41:31Z

@jpgill86

I tried the gc.collect() and the memory usage didn't change. I tried to two rounds of experiments with the first gc.collect() returning 23 unreachable objects and the second round returning 86.

I do have some try except blocks for some of this analysis, so the main memory lock occurs if parts of the meta-function fails and moves onto trying a different sub-function due to needing something like #1249 for gaining digital access in addition to analog access (which again is IntanRawIO specific).

The memory getting locked up is a bit inconsistent so it's hard for me to know for sure what is causing it and it is rare enough that hunting it down isn't hugely important to us. So I'm happy to try new implementations to see if anything helps, but I don't want to bog down this PR with memory leak stuff so if we want to continue this we could chat on #684?

maxjuv · 2023-05-09T08:30:27Z

Hello,
I tried theses Neo & Spikeinterface branches on my Windows. While the memory used is a bit high (7G) for 14 cores, 1s chunksize and the processing consisted to highpass/phase shift/ detect_bad channels/CMR on a Neuropixel dataset. However, the memory stays stable at 7G, so it is way better than before.

JuliaSprenger · 2023-05-23T12:24:07Z

Hi @samuelgarcia The tests in the current master are passing again. You might need to rebase / merge for proper CI testing.

TomBugnon · 2023-06-06T16:08:34Z

Hi @samuelgarcia
Sorry for the late reply.

I did not check the memory usage (which I didn't specifically have issues with), but relative to the parent commit (2d63e18) this fixes a OSError: cannot allocate memory I'm getting when too many memmaps/recordings are opened relying on the same file!

For instance this would raise the following error:

import spikeinterface.extractors as se
path = "/path/to/sglx"
extractors = []
for i in range(50):
    extr = se.SpikeGLXRecordingExtractor(, stream_id="imec0.ap")
    extractors.append(extr)

Traceback (most recent call last):
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/pipeline_tests/test_cannot_allocate_memory_error.py", line 49, in
extr = se.SpikeGLXRecordingExtractor("/Volumes/ceph-tononi/npx_archive/CNPIX2-Segundo/1-21-2020/SpikeGLX/1-21-2020_g0/1-21-2020_g0_imec0", stream_id="imec0.ap")
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/spikeinterface/src/spikeinterface/extractors/neoextractors/spikeglx.py", line 51, in init
NeoBaseRecordingExtractor.init(
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/spikeinterface/src/spikeinterface/extractors/neoextractors/neobaseextractor.py", line 94, in init
_NeoBaseExtractor.init(self, block_index, **neo_kwargs)
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/spikeinterface/src/spikeinterface/extractors/neoextractors/neobaseextractor.py", line 48, in init
self.neo_reader = get_neo_io_reader(self.NeoRawIOClass, **neo_kwargs)
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/spikeinterface/src/spikeinterface/extractors/neoextractors/neobaseextractor.py", line 39, in get_neo_io_reader
neo_reader.parse_header()
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/python-neo/neo/rawio/baserawio.py", line 178, in parse_header
self._parse_header()
File "/home/tbugnon/projects/ecephys_dev_si_pipeline/python-neo/neo/rawio/spikeglxrawio.py", line 103, in _parse_header
data = np.memmap(info['bin_file'], dtype='int16', mode='r', offset=0, order='C')
File "/home/tbugnon/miniconda3/envs/ecephys_dev_si_pipeline/lib/python3.10/site-packages/numpy/core/memmap.py", line 267, in new
mm = mmap.mmap(fid.fileno(), bytes, access=acc, offset=start)
OSError: [Errno 12] Cannot allocate memory

And is fixed by this PR.
This happens for our very large recordings. I was NOT able to replicate it on the short recordings from spikeinterface.core.datasets (maybe a size dependence of the number of memmaps?)

I did not have to check out PR 1602 on spikeinterface for this behavior.

Let me know if I can help somehow! Looking forward to seeing it merged
Best
Tom

samuelgarcia · 2023-06-07T06:38:32Z

Salut Tom.
Merci pour le feedback.
I will try to finalize this soon.

TomBugnon · 2023-06-22T13:32:48Z

Hi @samuelgarcia
do you have any ETA regarding this PR? And/or is this branch safe to use as such or does it need more testing?

h-mayorquin · 2023-06-23T10:04:53Z

neo/rawio/spikeglxrawio.py

-        memmap = self._memmaps[seg_index, stream_id]
+        #~ memmap = self._memmaps[seg_index, stream_id]
+        key = (seg_index, stream_id)
+        memmap = create_memmap_buffer(*self._memmap_args[key])


Can't you use get_memmap_shape to avoid creating a buffer here?

Start memmap refacoring.

ade2998

JuliaSprenger added this to the 0.13.0 milestone Jun 14, 2023

h-mayorquin reviewed Jun 23, 2023

View reviewed changes

h-mayorquin mentioned this pull request Jun 23, 2023

Make binary recording memmap efficient SpikeInterface/spikeinterface#1741

Closed

alejoe91 mentioned this pull request Sep 7, 2023

Unify and extend handling of Neuropixels SYNC channel #1327

Merged

5 tasks

h-mayorquin mentioned this pull request Nov 27, 2023

[Backend Configuration IIb] Add backend collection tools catalystneuro/neuroconv#570

Merged

alejoe91 changed the title ~~memmap refacoring~~ memmap refactoring Jan 24, 2024

alejoe91 modified the milestones: 0.13.0, 0.14.0 Jan 26, 2024

apdavison modified the milestones: 0.14.0, future Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memmap refactoring #1265

memmap refactoring #1265

samuelgarcia commented May 3, 2023

zm711 commented May 5, 2023

samuelgarcia commented May 5, 2023

zm711 commented May 5, 2023

jpgill86 commented May 5, 2023

zm711 commented May 5, 2023

maxjuv commented May 9, 2023

JuliaSprenger commented May 23, 2023

TomBugnon commented Jun 6, 2023

samuelgarcia commented Jun 7, 2023

TomBugnon commented Jun 22, 2023

h-mayorquin Jun 23, 2023

memmap refactoring #1265

Are you sure you want to change the base?

memmap refactoring #1265

Conversation

samuelgarcia commented May 3, 2023

zm711 commented May 5, 2023

samuelgarcia commented May 5, 2023

zm711 commented May 5, 2023

jpgill86 commented May 5, 2023

zm711 commented May 5, 2023

maxjuv commented May 9, 2023

JuliaSprenger commented May 23, 2023

TomBugnon commented Jun 6, 2023

samuelgarcia commented Jun 7, 2023

TomBugnon commented Jun 22, 2023

h-mayorquin Jun 23, 2023

Choose a reason for hiding this comment