openfold icon indicating copy to clipboard operation
openfold copied to clipboard

mmseqs inference

Open kchu02 opened this issue 3 years ago • 3 comments

There is clear instructions on running Alphafold on its original sequence alignment protocol but that on mmseqs is unclear. Is mmseqs supported for both monomeric and multimeric structures?

If so, should the multimeric fasta remains the same structure as that in Alphafold? What is the corresponding flag(s)/command(s) and is precomputed MSA required?

kchu02 avatar Jul 11 '22 14:07 kchu02

Currently, the stock inference script w/ complex support doesn't support mmseqs inference automatically. For that, you'll have to use the alignment precomputation scripts in scripts and then pass them to the inference script using the --use_precomputed_alignments flag.

gahdritz avatar Jul 11 '22 14:07 gahdritz

While we try to proceed with precompute_alignments_mmseqs.py, we experience another error.

python3 scripts/precompute_alignments_mmseqs.py input.fasta \
    data/mmseqs_dbs \
    uniref30_2103_db \
    alignment_dir \
    /working/MMseqs2/build/bin/mmseqs \
    --hhsearch_binary_path hhsearch \
    --env_db colabfold_envdb_202108_db \
    --pdb70 data/pdb70/pdb70

Error:

Traceback (most recent call last):
  File "scripts/precompute_alignments_mmseqs.py", line 175, in <module>
    main(args)
  File "scripts/precompute_alignments_mmseqs.py", line 84, in main
    cmd, stdout=subprocess.PIPE, stderr=subprocess.PIPE
  File "/working/openfold/lib/conda/envs/openfold_venv/lib/python3.7/subprocess.py", line 800, in __init__
    restore_signals, start_new_session)
  File "/working/openfold/lib/conda/envs/openfold_venv/lib/python3.7/subprocess.py", line 1551, in _execute_child
    raise child_exception_type(errno_num, err_msg, err_filename)
OSError: [Errno 8] Exec format error: 'scripts/colabfold_search.sh'

kchu02 avatar Jul 14 '22 06:07 kchu02

Looks like that script was missing a shebang. Fixed in 1c279f9.

gahdritz avatar Jul 14 '22 11:07 gahdritz