Skip to content

Evaluate Unconditional Initial Guess

Evaluates generated protein structures against provided foldings while incorporating an initial guess score table to prioritize candidates. It selects the best folding per protein and produces a CSV with key evaluation metrics (e.g., pst, rmsd, pae, plddt). Supports MOCK, PROD, and TEST modes for flexibility and quick iteration.
Preview

Usage

Use this node after you have: (1) a set of generated protein structures, (2) corresponding predicted foldings, and (3) the score table from an initial guess prediction step. It compares foldings to designs, leverages the initial guess scores to assess prediction quality, selects the best folding per design, and outputs both the best PDBs and a CSV of evaluation scores. In quick pipelines, use TEST mode to process only the first structure; in production runs, use PROD.

Inputs

FieldRequiredTypeDescriptionExample
generation_pdbTruePDBA mapping of design names to PDB text for the generated proteins to evaluate.{'design_A': 'ATOM ...\nEND\n', 'design_B': 'ATOM ...\nEND\n'}
folding_pdbTruePDBA mapping of folding names to PDB text for candidate foldings to evaluate against the generated proteins.{'design_A_ranked_0': 'ATOM ...\nEND\n', 'design_A_ranked_1': 'ATOM ...\nEND\n'}
initial_guess_scoreTrueSTRINGScore table from an initial guess prediction step. This should be the content of score.sc returned from an Alphafold Initial Guess process. Used to evaluate the quality of initial guess predictions.description pst rmsd pae plddt model_1 0.75 1.9 2.1 85.3 model_2 0.72 2.2 2.4 83.9
modeTrue['MOCK', 'PROD', 'TEST']Execution mode. MOCK returns predefined example results, PROD performs the full evaluation, and TEST limits inputs for faster checks.PROD

Outputs

FieldTypeDescriptionExample
best_folding.pdbPDBMapping from protein name to the selected best folding PDB content.{'design_A': 'ATOM ...\nEND\n', 'design_B': 'ATOM ...\nEND\n'}
score.csvCSVCSV table of evaluation metrics for the assessed foldings (e.g., pst, rmsd, pae, plddt).name,pst,rmsd,pae,plddt design_A,0.76,1.8,2.0,86.1 design_B,0.70,2.3,2.5,82.4

Important Notes

  • Initial guess score must be provided as a text string (the contents of score.sc), not a file path.
  • In TEST mode, only the first structure from generation_pdb and folding_pdb is processed to speed up iteration.
  • In MOCK mode, the node returns deterministic example outputs useful for UI/testing; do not use for scientific conclusions.
  • Runtime scales with the number of input designs; large batches may take significantly longer.
  • Ensure that generation and folding inputs are correctly keyed and correspond to the same designs for meaningful evaluation.

Troubleshooting

  • If score.csv is empty or missing rows, verify that generation_pdb and folding_pdb contain entries and that initial_guess_score is provided.
  • If results look incorrect, confirm that the initial_guess_score string is the actual score.sc content and is not truncated or formatted incorrectly.
  • When TEST mode seems to ignore many inputs, this is expected: only the first item of each input is evaluated for speed.
  • If processing takes too long, reduce the number of designs or switch to TEST mode to validate setup before full runs.
  • If you see mismatches in names between best_folding.pdb and score.csv, ensure consistent naming in your input mappings so designs and foldings align.