Sam Chorlton comments

Results 45 comments of


                                            Sam Chorlton

Changing ID during merging leads to 'corrupt' fastq-files

@opengene and @sfchen, Hoping you can (re)visit this, as this is actually pretty critical as it breaks downstream popular tools such as biopython. The issue is the merged FASTQ like...

Is --split-prefix supported?

OK, thanks! That's rather unfortunate but understandable.

Bug: `--database_directory` not working in `mob_recon`

Perfect, looking forward to the next release! Yes, we also just symlinked in the file to where `mob_recon` was looking for it for now. Thanks again!

Bug: `--database_directory` not working in `mob_recon`

Hi @jrober84, thanks for your hard work. It's actually not clear to me that this issue was resolved? It looks like MOB-suite still looks for the ETE3 file in the...

contig_id in any of the files cannot be mapped back to original FASTA

Sorry for my digging further, but how does it cause issues with BLAST? As far as I'm aware, BLAST outputs contig ID as per FASTA standard in many/most of its...

contig_id in any of the files cannot be mapped back to original FASTA

Thanks @jrober84 for the effort! One suggestion: I'd suggest putting the sequence ID in the `contig_id` field, and not the whole FASTA header. Eg. with this fix it puts `contig_1...

contig_id in any of the files cannot be mapped back to original FASTA

Thanks @kbessonov1984. While the output may be "correctly" written to TSV by python, it won't be parseable as it will have variable numbers of columns depending on if a FASTA...

integer underflow bug?

> so as I understand the problem is there isn't enough reads to sample and every value should be 0 in that tsv aside from F1 so that is definitely...

Transcript headers follow different formats

> Are you seeing different FASTA header formats in the final output (i.e. `rnabloom.transcripts.fa`) of different assemblies? Yes this. Different reads used as input leads to differently formatted FASTA headers....