FastOMA icon indicating copy to clipboard operation
FastOMA copied to clipboard

Report step uses very large amount of memory

Open ens-sb opened this issue 1 year ago • 1 comments

Hello,

I have run FastOMA on about 2200 genomes. The inference of HOGs finished all right (with some tweaks in the resource config), however the fastoma_report step failed as it consumes a very large amount of memory (did not fit in 1200GBs!). I just wanted to let you know about this issue in the case it is possible to improve the memory usage.

Regards, Botond

ens-sb avatar Apr 12 '24 14:04 ens-sb

Dear Botond

Thanks for reporting this. I was able to reproduce the error, which rooted in seaborn's sns.displot used for visualisation of protein length distribution in fastoma_notebook_stat.ipynb. Sorry for the inconveniences. We are fixing this and will update the notebook soon.

Best, Sina

sinamajidian avatar Apr 15 '24 01:04 sinamajidian

We decided not to include this visualization for big datasets and limit it to 100 species simply using a condition if len(species_list)<100: in the jupyter report file. Thanks again.

sinamajidian avatar Jun 19 '24 14:06 sinamajidian