StrainScan icon indicating copy to clipboard operation
StrainScan copied to clipboard

How to infer different species-level abundances based on strain-level abundances

Open xujialupaoli opened this issue 2 years ago • 5 comments

Hi! Thanks for the clear documentation and awesome tool. I have read your explanation about Strainscan, and it seems that it is used to analyze the abundance of each strain in a specified species. But I want to analyze multiple species. Can I build a strain database from multiple species to analyze the abundance of each strain to infer the abundance of different species?

xujialupaoli avatar Jan 27 '24 04:01 xujialupaoli

Hi, thanks for using StrainScan!

Currently, StrainScan can only support the strain identification for a specified species in a single run. If you want to identify strains from multiple species, one possible solution is to build multiple strain databases for multiple species and run StrainScan with all these databases. Finally, given the identified strains and their abundance, you can infer the abundance of strains of different species.

liaoherui avatar Jan 29 '24 04:01 liaoherui

Thank you!

xujialupaoli avatar Jan 29 '24 04:01 xujialupaoli

Can the strain information corresponding to different reads be obtained from the output results of Starin Scan?

xujialupaoli avatar Jan 29 '24 04:01 xujialupaoli

Regret to say that the current version doesn't support that either. I am considering developing a new tool to support these new features (strains from multiple species + reads assignment).

However, I think Kraken2 can be used to do that even though it can have worse performance at the strain-level sometimes. But its performance at the species-level is satisfactory according to my experience. Maybe you can try that tool if required.

liaoherui avatar Jan 29 '24 04:01 liaoherui

thank you very much!

xujialupaoli avatar Jan 29 '24 06:01 xujialupaoli