Sam Chorlton
Sam Chorlton
+1 this, median length before and after filtering would be super helpful as it should not be assumed that read lengths follow a normal distribution. Thanks for your consideration!
@opengene and @sfchen, Hoping you can (re)visit this, as this is actually pretty critical as it breaks downstream popular tools such as biopython. The issue is the merged FASTQ like...
OK, thanks! That's rather unfortunate but understandable.
Perfect, looking forward to the next release! Yes, we also just symlinked in the file to where `mob_recon` was looking for it for now. Thanks again!
Hi @jrober84, thanks for your hard work. It's actually not clear to me that this issue was resolved? It looks like MOB-suite still looks for the ETE3 file in the...
Sorry for my digging further, but how does it cause issues with BLAST? As far as I'm aware, BLAST outputs contig ID as per FASTA standard in many/most of its...
Thanks @jrober84 for the effort! One suggestion: I'd suggest putting the sequence ID in the `contig_id` field, and not the whole FASTA header. Eg. with this fix it puts `contig_1...
Thanks @kbessonov1984. While the output may be "correctly" written to TSV by python, it won't be parseable as it will have variable numbers of columns depending on if a FASTA...
> so as I understand the problem is there isn't enough reads to sample and every value should be 0 in that tsv aside from F1 so that is definitely...
> Are you seeing different FASTA header formats in the final output (i.e. `rnabloom.transcripts.fa`) of different assemblies? Yes this. Different reads used as input leads to differently formatted FASTA headers....