foldingdiff icon indicating copy to clipboard operation
foldingdiff copied to clipboard

Minimum number of structures to train model

Open tanoramb opened this issue 2 years ago • 1 comments

Hello,

I was performing some tests and it seems that there is a minimum number of protein structures to train a model. I have tested datasets with 2 through 10 structures (similar domains) and the pipeline runs starting at 10 structures.

Is it correct? or is there something I am not considering?

Thanks

tanoramb avatar Feb 14 '23 14:02 tanoramb

I don't think there's anything that would cause it to fail with fewer structures. The only thing that comes to mind is that we filter out structures with too few or too many amino acids; is it possible that your small datasets are also too small and get filtered out, leading to an empty dataset?

wukevin avatar Feb 17 '23 23:02 wukevin