GenGraph icon indicating copy to clipboard operation
GenGraph copied to clipboard

[HELP / FEATURE] is it possible to use a single fasta file containing many genomes?

Open minu7 opened this issue 4 years ago • 1 comments

The national library of medicine (https://www.ncbi.nlm.nih.gov/datasets) only allows you to download a list of genomes in a single fasta file (extension .fna), I tried to provide those files as input but it did not work. It's my fault? Have you encountered similar problems? Thank you!

minu7 avatar Mar 23 '21 15:03 minu7

Hi minu7,

More a problem of assumptions and standards, for example a fasta file representing one genome can have multiple chromosomes represented in the same file as different > delimited entries. Having multiple genomes with multiple chromosomes in one file can get messy!

It is doable, but a lot of work for a fringe case for now. May make sense for viral genomes though. Will look into it and think about the pros / cons / things that will break.

jambler24 avatar Mar 23 '21 17:03 jambler24