Skip to content

Alphafold

Predicts 3D protein structures from multiple sequence alignments (MSA) produced earlier in the workflow. It converts A3M MSAs to features and runs the Alphafold model to generate ranked PDB structures. Supports mock/test modes for fast iteration and seeding for reproducibility.
Preview

Usage

Use this node after obtaining MSA search results (A3M) for one or more protein sequences. Provide the A3M dictionary keyed by sequence IDs, choose the Alphafold model preset, optionally toggle template search, set which model indices to skip, and run. The output is a dictionary of ranked PDB strings keyed as '_'.

Inputs

FieldRequiredTypeDescriptionExample
a3mTrueA3MMSA search results in A3M format as a dictionary mapping sequence IDs to A3M content. Each key should uniquely identify a sequence; each value is the A3M alignment for that sequence.{"seq1": "", "seq2": ""}
search_templatesTrueBOOLEANWhether to search for structural templates before prediction. In TEST mode this is automatically disabled.false
model_presetTrue["monomer", "monomer_ptm", "monomer_casp14", "multimer"]Select which Alphafold model preset to run. 'multimer' is not supported.monomer
models_to_relaxTrue["NONE", "BEST", "ALL"]Whether to run relaxation on results. Relaxation is not supported yet; only 'NONE' is allowed.NONE
enable_gpu_relaxTrueBOOLEANWhether to use GPU for relaxation if relaxation were enabled. Currently has no effect because relaxation is not supported.true
skip_modelsTrueSTRINGComma-separated list of model indices (1–5) to skip during prediction (e.g., "3,5"). You cannot skip all five models. In TEST mode, this is overridden to skip [2,3,4,5].2,4
seedTrueINTBase random seed (0 to 4294967295). For multiple sequences, each subsequent sequence uses seed + index to ensure deterministic variation per sample.42
modeTrue["MOCK", "PROD", "TEST"]Execution mode. MOCK returns predefined data, PROD runs full prediction, TEST runs a fast prediction with minimal settings (disables templates and skips all but the first model).PROD

Outputs

FieldTypeDescriptionExample
folding.pdbPDBDictionary mapping '_' to PDB file content (string) for each predicted ranking.{"seq1_rank_1": "ATOM ...", "seq1_rank_2": "ATOM ..."}

Important Notes

  • Model support: 'multimer' preset is not supported.
  • Relaxation: Relaxation is not supported yet; set models_to_relax to 'NONE'.
  • skip_models: Allowed indices are 1–5; you cannot skip all five models. Use a comma-separated string (e.g., "1,3").
  • TEST mode overrides: In TEST mode, template search is disabled and skip_models is set to [2,3,4,5] regardless of input.
  • Seeding behavior: For multiple sequences, the effective seed per sequence is seed + sequence_index, enabling reproducible but distinct results per sequence.
  • Input format: The A3M input must be a dict where keys (sequence IDs) map to A3M content; these IDs are used to build output names and to match features internally.

Troubleshooting

  • Error: 'multimer model is not supported': Choose a supported model_preset (monomer, monomer_ptm, or monomer_casp14).
  • Error: 'Relaxation is not supported': Set models_to_relax to 'NONE'.
  • Error: 'Expected models indices in skip_models to be in range [1, 5]': Ensure skip_models contains only numbers 1–5 (e.g., "2,5").
  • Error: 'Cannot skip all 5 models': Remove at least one index from skip_models.
  • Unexpected output naming or missing results: Verify the A3M input dictionary keys are correct and unique; each key becomes a prefix in output PDB names.
  • Results differ despite same seed: Remember seed increments per sequence (seed + i). For single-sequence reproducibility, keep inputs identical and process one sequence.