hh-suite icon indicating copy to clipboard operation
hh-suite copied to clipboard

Why need to refer a database when I build a customized database?

Open EileenLLL opened this issue 3 years ago • 2 comments

Dear all, I want to use hhblits to build my own database(1,040,000seqs). I followed the tutorial "Building customized databases", and the second step is to build an MSA with HHblits for each sequence, comands as following: """mpirun -np <number_threads>
hhblits_mpi -i _fas -d <path_to/uniclust30> -oa3m _a3m_wo_ss -n 2 -cpu 1 -v 0"""

I wonder why I need to define a -d <path_to/uniclust30>, the procedure is using my seqs to search homologous sequence from uniclust30? why do that? Also I learned that, I can split my fasta database to many single seqs, and then search hhr for each seq using hhblits from uniclust30. However I don't understand why we should do that, and why I can just change my fasta database to hhms directly?

Thanks

EileenLLL avatar Dec 08 '22 13:12 EileenLLL

Hi, I am facing the same thing, are you able to get it? I am not able to get the exact steps to run the my 2 million query sequences against my own database of 250 seq.

Please let me know

Thanks

Citugulia40 avatar Oct 12 '23 19:10 Citugulia40

HMM-HMM comparisions require a diverse MSA on both sides to build good profiles for either.

If you just want to do a sequence-sequence, profile-sequence or sequence-profile search, please use MMseqs2 as it doesn't require involved steps to build databases.

milot-mirdita avatar Dec 26 '23 04:12 milot-mirdita