seqkit icon indicating copy to clipboard operation
seqkit copied to clipboard

rmdup memory consumption

Open liushuqing506 opened this issue 1 year ago • 2 comments

I have a 100GB fasta file that needs to remove duplicate sequences. How much memory and analysis time does it require?

liushuqing506 avatar Apr 29 '24 08:04 liushuqing506

I'm not sure, < 10GB probably. It's memory efficient.

Just try it, it's faster than waiting for the author's reply.

shenwei356 avatar Apr 29 '24 08:04 shenwei356

ok,thanks,now trying

liushuqing506 avatar Apr 29 '24 09:04 liushuqing506