OpenFold2 icon indicating copy to clipboard operation
OpenFold2 copied to clipboard

Attempt at reproduction of AlphaFold2

OpenFold2

GitHub Logo

Attempt at reproduction of AlphaFold2. This repository is an ecxample of dataset-driven model development. First, we generate a dataset, using a procedure that mimics some aspect of the real data. Then we develop a model, that tries to learn this particular dataset.

Documentation

Toy datasets & models

  1. toy_gpt : GPT model, mainly Karpathy's code but rewritten in a more structured way
  2. toy_se3 : Iterative SE(3)-transformer and simple particle dynamics dataset
  3. toy_prot: Toy protein dataset and structural part of the AlphaFold2 model
  4. toy_msa: Toy multiple sequence alignment dataset, supervised case and complete MSA+structural parts of AlphaFold2