Replies: 3 comments 13 replies
-
Hi Jan, we have used datasets of a similar size with no problem. How many pixels you can use to train the SOM depends on how much RAM your machine has. You can decrease the |
Beta Was this translation helpful? Give feedback.
-
Hi Ark team, we want to use Pixie on a highly-multiplexed imaging dataset from the Lunaphore COMET platform (Sequential Immunofluorescence). Our images are similar to what @honzee described, about 35k x 30k pixels but we only have 12 channels. Our current cohort consists of 9 such images, but we have other cohorts that are even bigger. We currently try to run pixel clustering with 1% subsetting for SOM building, but it fails during preprocessing due to memory, when asking for 240GB. I'am currently running it with 500GB and will see if this works.
Are there any other ways in the pipeline to reduce the memory footprint or speed up the calculations? I tried using multiprocessing, but that created some errors based on the multiprocess pool on our Slurm cluster (I am not very familiar with using multiprocess on SLURM). Thanks for your help! Best, |
Beta Was this translation helpful? Give feedback.
-
Hi Pixie team, We have been discussing internally a lot recently about a couple of key aspects about pixie, that limit our usage of it. Some of them are already mentioned above by @honzee and me. I will summarize the main aspects here again to make it easier to follow. I will preface this by saying that I really like the approach you have established in Pixie and it has performed great on some of the data we have tested it on. However, here are the major features limiting our usage of it:
The main reason I am asking these questions is to see what you have tried in the past to answer some of these challenges and whether it thus would be worth to direct efforts into implementing some of them. Thanks again and cheers! 😊 |
Beta Was this translation helpful? Give feedback.
-
Hi,
I am also wondering, how would you approach analyzing large images, e.g. images generated from multiplexed imaging platforms. Our images have 40 000 x 40 000 pixels with ~30 channels. Do you think the pixie clustering/annotation approach could be applied?
Best,
Jan
Beta Was this translation helpful? Give feedback.
All reactions