resources icon indicating copy to clipboard operation
resources copied to clipboard

pipeline for what doesn't align to a target?

Open grace-ac opened this issue 4 months ago • 5 comments

wanting to see what doesn't align to seastar genome with my sea star RNAseq data

grace-ac avatar Oct 01 '25 18:10 grace-ac

Almost all aligners have an option to save unaligned reads. Check options for whichever aligner you're using.

kubu4 avatar Oct 01 '25 19:10 kubu4

It seems like you're using HISAT2, so you'd use this option:

--un <path>, --un-gz <path>, --un-bz2 <path> Write unpaired reads that fail to align to file at . These reads correspond to the SAM records with the FLAGS 0x4 bit set and neither the 0x40 nor 0x80 bits set. If --un-gz is specified, output will be gzip compressed. If --un-bz2 is specified, output will be bzip2 compressed. Reads written in this way will appear exactly as they did in the input file, without any modification (same sequence, same name, same quality string, same quality encoding). Reads will not necessarily appear in the same order as they did in the input.

kubu4 avatar Oct 02 '25 17:10 kubu4

@grace-ac can you drop in a url to a

  • genome file
  • gff file
  • gtf
  • directory with trimmed reads

for any one of your species

sr320 avatar Oct 04 '25 14:10 sr320

@grace-ac ?

sr320 avatar Nov 01 '25 13:11 sr320