blitzgsea icon indicating copy to clipboard operation
blitzgsea copied to clipboard

Gene set size?

Open robinpaul85 opened this issue 8 months ago • 1 comments

What does the column geneset_size actually represent in the output ? I had initially thought it is the total number of genes that are present in a given geneset. But the value is substantially lower than this number. Can you please explain?

robinpaul85 avatar May 27 '25 21:05 robinpaul85

The geneset_size is the intersection of that term's gene set and the input signatures list of genes. So it can be much smaller than that terms actual gene set if your signature gene's do not overlap much with it. You can see that this happens in the function strip_gene_set on line 86 of init.py. Hoped this helped.

terynin avatar Oct 17 '25 16:10 terynin