progres icon indicating copy to clipboard operation
progres copied to clipboard

Code for generating AFDB embedded database

Open aidenkoloj opened this issue 4 months ago • 1 comments

Hi,

What was the approach to embedding the entire AFDB? Were the AFDB structures parsed according to TED, then each one individually saved?

aidenkoloj avatar Oct 02 '25 16:10 aidenkoloj

I just uploaded the script to https://github.com/greener-group/progres/blob/main/scripts/embed_afdb.py.

cluster_rep_file contains the TED domain data and multiple files allow running on a cluster. Each AFDB file is downloaded, the domain is extracted, the domain is embedded and the file is deleted. Afterwards the individual output files are stitched together into one.

jgreener64 avatar Oct 03 '25 14:10 jgreener64