Skip to content

Alphafold

Runs AlphaFold protein structure prediction from MSA inputs in A3M format. It converts A3M to Stockholm, prepares features, and performs model inference to produce ranked PDB structures per input sequence. Supports MOCK, PROD, and TEST modes; currently, multimer models and relaxation are not supported.
Preview

Usage

Use this node after obtaining multiple sequence alignments (MSAs). A typical workflow is: provide sequences -> run MSA search -> feed the resulting A3M dict into Alphafold. Choose the model preset and whether to search templates (monomer-only). In TEST mode, it speeds up runs by only evaluating the first model and disabling template search. The output is a dictionary of ranked PDB strings keyed by combined sequence ID and rank label.

Inputs

FieldRequiredTypeDescriptionExample
a3mTrueA3MDictionary mapping sequence IDs to A3M MSA contents. Keys are sequence IDs; values are A3M strings for each sequence.{"seq1": ">seq1\nMKT...\n>seq1_homolog1\nM-T...\n", "seq2": ">seq2\nAGV..."}
search_templatesTrueBOOLEANWhether to search structural templates before prediction. Ignored in TEST mode (forced off).false
model_presetTrueSTRING (enum: monomer \| monomer_ptm \| monomer_casp14 \| multimer)AlphaFold model variant to use. Multimer is currently not supported.monomer
models_to_relaxTrueSTRING (enum: NONE \| BEST \| ALL)Selects whether to run relaxation. Currently not supported; must be NONE.NONE
enable_gpu_relaxTrueBOOLEANIf relaxation were enabled, choose GPU vs CPU. Currently not used since relaxation is unsupported.true
skip_modelsTrueSTRINGComma-separated model indices (1–5) to skip during inference (e.g., "2,4"). Cannot include all five. Validated at runtime.2,4
seedTrueINTBase random seed. If multiple sequences are present, each subsequent sample uses seed + index.42
modeTrueSTRING (enum: MOCK \| PROD \| TEST)Execution mode: MOCK uses bundled mock results, PROD runs the actual services, TEST uses quick settings (first model only, no templates) for short sequences.PROD

Outputs

FieldTypeDescriptionExample
folding.pdbPDBDictionary of ranked PDB predictions per input sequence. Keys are combined as {sequence_id}_{rank_label}; values are PDB contents.{"seq1_ranked_0.pdb": "ATOM ...", "seq1_ranked_1.pdb": "ATOM ..."}

Important Notes

  • Relaxation is not supported. models_to_relax must be set to NONE.
  • Multimer preset is not supported. Choose a monomer preset.
  • skip_models must contain only integers 1–5 and cannot include all five; otherwise, the node raises an error.
  • In TEST mode, the node automatically disables template search and evaluates only the first model to speed up execution.
  • The input A3M dictionary keys (sequence IDs) must remain consistent through processing; mismatched IDs between derived FASTA and features will cause validation errors.

Troubleshooting

  • Error: Expected models indices in skip_models to be in range [1, 5]. Resolution: Provide a comma-separated list using only 1–5 (e.g., 2,4) and ensure no spaces or invalid characters.
  • Error: Cannot skip all 5 models. Resolution: Remove at least one index from skip_models.
  • Error: Relaxation is not supported yet. Resolution: Set models_to_relax to NONE.
  • Error: Multimer model is not supported yet. Resolution: Choose monomer, monomer_ptm, or monomer_casp14.
  • Error: Expected FASTA IDs to match features IDs. Resolution: Ensure the a3m input is a dict keyed by the correct sequence IDs and that each value is valid A3M content for that sequence.

Example Pipelines

Example
Example