Skip to content

Load PDB

Loads a Protein Data Bank (PDB) structure provided as plain text and exposes it as a structured PDB output for downstream biotech nodes. It stores the content under a user-defined identifier without modifying or validating the text. Use this node to introduce existing PDB data directly into a workflow.
Preview

Usage

Use when you have a PDB structure in text form (e.g., copied from a file or fetched elsewhere) and want to feed it into a biotech workflow. Provide a unique ID for the structure; if you plan to pair it with a FASTA sequence later, ensure the FASTA sequence header ID matches this PDB ID. Connect the output to nodes that consume PDB inputs (e.g., visualization, sequence extraction, chain operations, or batch/combiner nodes).

Inputs

FieldRequiredTypeDescriptionExample
pdb_stringTrueSTRINGThe full PDB file content as a text string. The content is passed through unchanged to downstream nodes.HEADER TEST STRUCTURE\nATOM 1 N MET A 1 11.104 13.207 9.101 1.00 20.00 N\nATOM 2 CA MET A 1 12.560 13.420 9.320 1.00 20.00 C\nEND
pdb_idTrueSTRINGIdentifier assigned to this PDB entry. Must be unique within the workflow if multiple PDBs are used. If pairing with a FASTA record, the FASTA sequence ID should match this value.my_protein

Outputs

FieldTypeDescriptionExample
structure.pdbPDBA dictionary mapping the provided PDB ID to the PDB text content, suitable for use by nodes that accept PDB inputs.{'my_protein': 'ATOM 1 N MET A 1 11.104 13.207 9.101 1.00 20.00 N\\n...\\nEND'}

Important Notes

  • IDs should be unique across the workflow to avoid collisions when combining or batching structures.
  • If you intend to pair this PDB with a FASTA sequence, ensure the FASTA header ID matches the provided pdb_id.
  • The node does not validate or correct PDB formatting; downstream nodes may fail if the content is malformed.
  • Content is treated as plain text; ensure it follows standard PDB conventions (e.g., ATOM/HETATM records, END line) for best compatibility.

Troubleshooting

  • Empty or missing output: Ensure pdb_string is not empty and contains valid PDB text.
  • Downstream node errors about parsing: Verify the PDB content is properly formatted (correct columns, ATOM/HETATM lines, and an END record).
  • ID mismatch when pairing with FASTA: Update either pdb_id here or the FASTA header so they match exactly.
  • Conflicts in combined workflows: If merging multiple PDB inputs later, make sure each pdb_id is unique to avoid overwriting entries.

Example Pipelines

Example
Example