MiniFold icon indicating copy to clipboard operation
MiniFold copied to clipboard

Use Physicochemical features of AAs as input.

Open hypnopump opened this issue 6 years ago • 5 comments

It would be useful to use Physicochemical properties beyond Van der Waals radius of AAs as input sucha as:

  • surface exposure
  • predicted solvent accessibility
  • polarity
  • isoelectric point
  • pairwise potential ...?

hypnopump avatar Mar 02 '19 11:03 hypnopump

Those all sound like desirable traits to include.

daniel-z-kaplan avatar Mar 15 '19 16:03 daniel-z-kaplan

Implementation should be straightforward. If you want inspiration you can see the function onehotter_aa from preprocessing/angle_data_preparation_py.ipynb and apply the method to the function wider from models/predicting_distances.ipynb. @Daniel-z-Kaplan What do you think? Can you do it? Contributors are welcome!

hypnopump avatar Mar 17 '19 10:03 hypnopump

Hi, Im working with prot structure prediction as my masters project. I implemented some of your operations as vectorized functions, and I include some physchem properties on input. Im even using your model, with slight modifications (adding dilated convolutions and aproximating to the architecture of Raptor-X-Contacts).

n4ndoz avatar May 09 '20 01:05 n4ndoz

Hi there! Thanks for the interest and continuing the work on the project! I'm sharing my newest version here with you in a zip folder (it includes cool ideas, but i moved on to different projects before integrating them and producing good results).

It includes the following features:

  • vectorized data processing functions
  • handling protein arbitrary length
  • predictions by X squared crops of Y*Y AAs each and averaging at prediction time
  • deeper resnet
  • physicochemical features of AAs

You can grab ideas and copy/paste from this codebase. I would like to ask you if, in return, you could integrate this code with the one in the repo and also share your dilated convolution model (don't overwrite the existing model code, just provide the option of using your model as well via loading a diferent model).

Also if you could share some results it would be very nice as well. Contact me via email so I can send you the newest version of the code: [email protected]

It would be awesome if you could do this! I'll add your name to the contributors / collaborators / authors list!

hypnopump avatar May 09 '20 11:05 hypnopump

Hi, I will reach you by email!

Thanks for sharing the code and ideas. I don't have the time now, at least until July, to add to the repo, since I'm still coding adn writing, but as of July I can update and integrate my models. Since I use a similar workflow than yours (except that I generally prepare the data before passing to a generator).

n4ndoz avatar May 13 '20 20:05 n4ndoz