| Tool | Inputs | Outputs | Capabilities |
|---|---|---|---|
| AlphaFold2 | Protein | Average pLDDT, PAE matrix, ipTM/pTM | 5 samples |
| Boltz | Protein, nucleic acid, small molecule (ccd, smiles) | Confidence, PDE/PAE matrix, ipTM/pTM | Multiple samples |
| Chai | Protein, nucleic acid, small molecule (smiles) | Aggregate score, Average pLDDT, PAE/PDE matrix, pTM/ipTM | Multiple samples |
| Protenix | Protein, nucleic acid, small molecule (ccd) | pLDDT, pTM/ipTM, PAE/PDE matrix | Multiple samples |
Recommend for specific cases
- ImmuneBuilder - for fast antibody/nanobody/TCR structure prediction (very fast)
- HighFold - for cyclic peptides, uses AlphaFold2 modified with a cyclic offset
- ESMFold - uses ESM2 language model, very fast
More experimental tools
- AFSample2 - uses AlphaFold jackhammer MSAs instead of MMseqs2 MSAs, so takes several hours. Generates arbitrary number of conformations
- UniFold - retrainable AlphaFold, also uses jackhammer MSAs
- CombFold - combine large complexes with many repeating domains from smaller structure predictions of subunits
- AlphaFlow - generates more diverse conformations
- RosettaFoldNA - Protein/nucleic acid complexes
- RosettaFold-All-Atom - Protein/small molecule complexes - allows for uploading a sdf for ligand
Multiple Sequence Alignment
All of our structure prediction tools require a Multiple Sequence Alignment (MSA) as input. This queries a large database of sequences to find similar sequences to your input to inform the structure prediction. The alignment produces a .a3m file, which stores the result sequences aligned with the input sequence. We then use this file as input to your structure prediction. We use the ColabFold MMseqs2 method to query uniref30, bfd, and the colabfold environmental db to generate the MSA. We host our own servers for MSA, with the databases loaded into memory, for a great improvement in calculation speed.Input limits
All Alphafold3 models have a hard input limit of 2048 residues due to limitations of the models. Alphafold2 has a limit of ~5000 residues due to compute restraints.Alphafold2
AlphaFold2 was a major leap in structure prediction when it came out, shifting the field from physics-based modeling to deep learning. It uses attention-based neural networks and evolutionary data, reaching near-experimental accuracy and outperforming everything that came before it. The training cutoff for AlphaFold2 is April 30, 2018. This can be helpful to keep in mind when evaluating or benchmarking results.Overview of outputs
- pTM (predicted Template Modeling score): Evaluates the confidence in the overall arrangement of the predicted structure, ranging from 0 to 1. Scores above 0.5 suggest a good prediction; scores below 0.5 indicate poor confidence.
- ipTM (interface predicted TM-score): Assesses the confidence in the predicted relative positions of interacting subunits within a protein complex. Values above 0.8 indicate high-quality interface predictions, while scores below 0.6 suggest potential inaccuracies.
- pAE (predicted Aligned Error) matrix: A 2D plot estimating the expected positional error between residue pairs in Ångströms. Lower pAE values denote higher confidence in the relative positioning of residues, whereas higher values indicate uncertainty. This matrix is particularly useful for evaluating the confidence in spatial relationships between different domains or subunits.
Alphafold3
While Alphafold3 is not commercially available, various reproductions of it have come out, which match its accuracy. We offer Chai-1, Boltz, and Protenix. These models are all-atom models, meaning they can incorporate small molecules and nucleic acids into the structure as well. Alphafold3 models are all generative models, so they can sample an unlimited number of output structures (up to thousands). If you are looking for a particular pose, we recommend generating many samples and taking the highest confidence / clustering afterwards. The confidence metric produced from these models can be a good indicator of the model’s confidence in the predicted structure. The other metrics are less clearly useful. We are members of the OpenFold consortium, which is working on OpenFold, a more complete AlphaFold3 reproduction. We’ll have it available when it’s released in the coming weeks!Prediction speed
You can expect most AlphaFold3 structure prediction jobs to complete within ~10 minutes on Tamarind. AlphaFold2 is generally slower, taking ~30 minutes to several hours for larger jobs. Prediction time increases linearly with the number of recycles (AlphaFold2) or samples (AlphaFold3) selected.Antibody structure prediction
Chai and Boltz both provide the option to specify restraints, including contact (between 2 residues) and pocket (between a residue and a chain) restraints. Depending on what epitope and paratope information you have, we recommend constraining your structure prediction. We’ve found using either with a large number of samples to produce good results.Restraints with Boltz/Chai
- Contact restraints - constrain a specific residue on Chain 1 with a specific residue on Chain 2 - only between 2 protein chains
- Pocket restraints - constrain a specific residue on Chain 1 with a different entire chain or a ligand - between 2 protein chains or a protein and a ligand
- Covalent restraints - create a covalent bond between a specific atom on Chain 1 with a specific atom on Chain 2 - between 2 proteins chains or a protein and a ligand