Alphafold¶

Runs AlphaFold protein structure prediction from MSA inputs in A3M format. It converts A3M to Stockholm, prepares features, and performs model inference to produce ranked PDB structures per input sequence. Supports MOCK, PROD, and TEST modes; currently, multimer models and relaxation are not supported.

Usage¶

Use this node after obtaining multiple sequence alignments (MSAs). A typical workflow is: provide sequences -> run MSA search -> feed the resulting A3M dict into Alphafold. Choose the model preset and whether to search templates (monomer-only). In TEST mode, it speeds up runs by only evaluating the first model and disabling template search. The output is a dictionary of ranked PDB strings keyed by combined sequence ID and rank label.

Inputs¶

Field	Required	Type	Description	Example
a3m	True	A3M	Dictionary mapping sequence IDs to A3M MSA contents. Keys are sequence IDs; values are A3M strings for each sequence.	{"seq1": ">seq1\nMKT...\n>seq1_homolog1\nM-T...\n", "seq2": ">seq2\nAGV..."}
search_templates	True	BOOLEAN	Whether to search structural templates before prediction. Ignored in TEST mode (forced off).	false
model_preset	True	STRING (enum: monomer \\| monomer_ptm \\| monomer_casp14 \\| multimer)	AlphaFold model variant to use. Multimer is currently not supported.	monomer
models_to_relax	True	STRING (enum: NONE \\| BEST \\| ALL)	Selects whether to run relaxation. Currently not supported; must be NONE.	NONE
enable_gpu_relax	True	BOOLEAN	If relaxation were enabled, choose GPU vs CPU. Currently not used since relaxation is unsupported.	true
skip_models	True	STRING	Comma-separated model indices (1–5) to skip during inference (e.g., "2,4"). Cannot include all five. Validated at runtime.	2,4
seed	True	INT	Base random seed. If multiple sequences are present, each subsequent sample uses seed + index.	42
mode	True	STRING (enum: MOCK \\| PROD \\| TEST)	Execution mode: MOCK uses bundled mock results, PROD runs the actual services, TEST uses quick settings (first model only, no templates) for short sequences.	PROD

Outputs¶

Field	Type	Description	Example
folding.pdb	PDB	Dictionary of ranked PDB predictions per input sequence. Keys are combined as {sequence_id}_{rank_label}; values are PDB contents.	{"seq1_ranked_0.pdb": "ATOM ...", "seq1_ranked_1.pdb": "ATOM ..."}

Important Notes¶

Relaxation is not supported. models_to_relax must be set to NONE.
Multimer preset is not supported. Choose a monomer preset.
skip_models must contain only integers 1–5 and cannot include all five; otherwise, the node raises an error.
In TEST mode, the node automatically disables template search and evaluates only the first model to speed up execution.
The input A3M dictionary keys (sequence IDs) must remain consistent through processing; mismatched IDs between derived FASTA and features will cause validation errors.

Troubleshooting¶

Error: Expected models indices in skip_models to be in range [1, 5]. Resolution: Provide a comma-separated list using only 1–5 (e.g., 2,4) and ensure no spaces or invalid characters.
Error: Cannot skip all 5 models. Resolution: Remove at least one index from skip_models.
Error: Relaxation is not supported yet. Resolution: Set models_to_relax to NONE.
Error: Multimer model is not supported yet. Resolution: Choose monomer, monomer_ptm, or monomer_casp14.
Error: Expected FASTA IDs to match features IDs. Resolution: Ensure the a3m input is a dict keyed by the correct sequence IDs and that each value is valid A3M content for that sequence.