Skip to content

Boltz YAML Combiner

Combines molecular sequences, constraints, templates, and properties into a single Boltz YAML configuration plus auxiliary files. It validates structure and uniqueness of chain IDs, rewrites inline MSA/template contents into file references, and prepares outputs suitable for Boltz prediction and partial diffusion workflows.
Preview

Usage

Use this node after building inputs with Boltz Sequence/Constraint/Template/Property nodes and optionally aggregating them with Boltz List Combiner. Connect the resulting boltz_yaml and boltz_files to Boltz Predict or Boltz Partial Diffusion to run structure generation or refinement. Ideal when assembling multimers, protein–ligand systems, or models with constraints/templates.

Inputs

FieldRequiredTypeDescriptionExample
sequencesTrue*One or more sequence objects describing proteins/DNA/RNA/ligands, typically produced by Boltz Sequence nodes and combined with Boltz List Combiner. Each entry must include chain id(s) and required fields (e.g., sequence for biopolymers; smiles or ccd for ligands).[{'protein': {'id': 'A', 'sequence': 'MAA...', 'msa': 'empty'}}, {'ligand': {'id': 'L', 'smiles': 'CCO'}}]
constraintsFalse*Optional list of constraints (e.g., pocket, bond, contact) from Boltz Constraint Builder, usually passed via Boltz List Combiner.[{'pocket': {'binder': 'L', 'contacts': [['A', 10], ['A', 25]]}}]
templatesFalse*Optional list of template objects with structure content (e.g., PDB). The node will convert inline structure content into file references in the YAML and place the actual content in boltz_files.[{'pdb': '', 'chain_id': 'A'}]
propertiesFalse*Optional list of properties to compute (e.g., affinity), typically from Boltz Property Builder.[{'affinity': {'binder': 'L'}}]

Outputs

FieldTypeDescriptionExample
boltz_yamlBOLTZ_YAMLStructured Boltz YAML data (as an object) containing version, sequences, and optional constraints/templates/properties. MSA/template entries point to filenames present in boltz_files.{'version': 1, 'sequences': [{'protein': {'id': 'A', 'sequence': 'MAA...', 'msa': 'msa_1.a3m'}}, {'ligand': {'id': 'L', 'smiles': 'CCO'}}], 'templates': [{'pdb': 'template_1.pdb'}], 'properties': [{'affinity': {'binder': 'L'}}]}
boltz_filesBOLTZ_FILESAuxiliary files referenced by boltz_yaml (e.g., MSA .a3m files and template .pdb). Keys are filenames; values are file contents.{'msa_1.a3m': '>seq\nMAA...', 'template_1.pdb': 'ATOM ...'}

Important Notes

  • All chain IDs across all sequences must be unique; duplicates cause an error.
  • At least one sequence is required. Proteins/DNA/RNA must include a sequence; ligands must include either smiles or ccd (but not both).
  • MSA handling: 'empty' is allowed and does not create a file. If multiple protein chains share the exact same sequence, a single shared MSA file is created and referenced by all such chains.
  • Template structure content is moved to external files (e.g., template_#.pdb) and the YAML references those filenames.
  • Version is set to 1 automatically if not provided.
  • Outputs are designed to feed directly into Boltz Predict or Boltz Partial Diffusion nodes.

Troubleshooting

  • Duplicate chain ID found: Modify chain_id or multiple_chains in your sequence inputs so that all chain IDs are unique.
  • Missing required sequence fields: Ensure each protein/DNA/RNA entry has a sequence and each ligand has exactly one of smiles or ccd.
  • Invalid ligand specification (both smiles and ccd): Provide only one field for ligand specification.
  • Empty or invalid template content: Verify that the provided structure content (e.g., PDB) is non-empty and correctly passed to the Template Builder.
  • Unexpected input format: Use Boltz List Combiner to aggregate multiple items; this node accepts either a single object or a list, but each item must follow the expected schema from the builder nodes.
  • Downstream errors in prediction: Ensure that MSA filenames and template filenames output in boltz_yaml are present in boltz_files and that constraints/properties reference valid chain IDs.