China Pharmaceutical University

Help Documentation

Complete guide for using the CpuPDB platform

Last updated: June 2025

Overview

The CpuPDB platform provides distinct search modalities across multiple pages: a Start Page for standard queries, an Explore Page for family relationship analysis, a Dynamics Page for molecular dynamics data, and a Detail Page for comprehensive interaction analysis.

CpuPDB Overview

1. Start Page

1.1 Search Based on PDB ID, Ligand ID, or Ligand SMILES

Users can perform single-parameter or combined searches using the above identifiers.

Search Interface

Workflow:

  1. Select "PDB ID" as the search type.
  2. Enter a PDB ID (e.g., 11GS).
  3. Click "Search" or press Enter.
  4. Refine results using secondary filters (e.g., input ligand ID: GSH).
  5. Search Results:
    • PDB ID: Links to the detailed protein-ligand interaction interface (Detail Page).
    • DOI/UniProt ID/PubMed ID: Hyperlinks to their respective official databases.
    • restypeLig: Ligand identifier (HET code).
    • pocket_type:
      • LVPocket_pocket: Pockets predicted by LvPocket.
      • detected_pocket: Pockets calculated by pykvFinder based on ligand positions.

1.2 Protein Sequence-Based Search

Users can submit protein sequences for homology-based retrieval (results generated via BLAST alignment).

Protein Sequence Search

Workflow:

  1. Select "Protein Sequence" mode.
  2. Input a protein sequence (FASTA format).
  3. Click "Search" or press Enter.
  4. Refine results using in-result filters.

Result Metrics:

  • PDB ID Length: Full sequence length of the matched protein.
  • Aligned Length: Overlapping region between the query and target sequences.
  • Identity (Normalized by query): Percentage of identical residues relative to the query length.
  • Identity (Normalized by aligned length): Percentage identity within the aligned region.
  • E-value: Statistical significance of the alignment (lower values indicate higher confidence).
  • Homologs to Hit: PDB IDs with ≥95% sequence similarity (via CD-HIT). Clicking a PDB ID redirects to the Detail Page.

2. Explore Page

This interface visualizes protein family relationships through a topological network. Nodes represent protein-ligand complexes, and proteins from the same family are color-coded.

2.1 PDB ID-Based Search

PDB ID Search in Explore
  1. Select "PDB ID" mode.
  2. Enter a PDB ID (e.g., 11GS)
  3. Download Embedding: Obtain ESM-calculated protein embeddings (FASTA format).
  4. "Detail" Button: Available only for entries with precomputed pocket data.
  5. InterPro ID: Hyperlinks to the InterPro database for family/superfamily details.
  6. Toggle between Family and Superfamily views.

2.2 InterPro ID-Based Search

InterPro ID Search
  1. Select "Protein Sequence" mode.
  2. Enter an InterPro ID (e.g., IPR000123).
  3. Non-matching nodes turn gray; matching nodes turn blue for emphasis.

2.3 Protein Sequence-Based Search

Protein Sequence in Explore
  1. Select "Protein Sequence" mode.
  2. Input a protein sequence.
  3. A Query Results table appears.
  4. Clicking a PDB ID displays the corresponding protein node in the network.

3. Dynamics Page

Dynamics Data Search

Workflow:

  1. Select "DynamicsData" as the search type from the Start Page.
  2. Enter a PDB ID (e.g., 12AS).
  3. Click "Search" or press Enter.
  4. The search result links to dynamics data analysis page.

3.1 3D Structure Visualization

3D Structure
  • Visualization Panel: 3D Structure of Molecular Dynamics pocket predicted by mdpocket model.
  • Isovalue adjustment tool:
Isovalue Tool

You can play with the Isovalue slider to get more or less conserved cavities during the MD trajectory. The unit of this isovalue can be expressed as number of Voronoi Vertices (alpha sphere centers) in a 8ų cube around each grid point per snapshot.

Download Structure Files

Our website offers three types of files for download:

  • Reference.pdb: The centroid frame obtained through clustering analysis.
  • Density Map (mdpocket_dens.dx): Output grid file containing frequency measures.
  • Frequency Isosurface (mdpocket_freq_iso_0_5.pdb): Contains all grid points having 3 or more Voronoi Vertices.

3.2 Dynamics Data Analysis

Dynamics Analysis

We provide four line charts to show the distance of ligand to proteins, rmsd, SASA, and energy fluctuations in MD.

  • Distance: The average distance between ligand and protein.
  • RMSD: Root Mean Square Deviation
  • SASA: Solvent Accessible Surface Area
  • Energy: ETOT (Total Energy), EKTOT (Total Kinetic Energy), EPTOT (Total Potential Energy), EKCMT (Kinetic Energy of Center of Mass Translation)

4. Protein-Ligand Interaction Page (Detail)

Detail Page

4.1 Visualization Panel

  • 3D Structure: Interactive display of the protein-ligand complex.
  • Controls:
    • Rotate: Left-click and drag.
    • Zoom: Mouse scroll wheel.
  • Naming Convention: PDBID-Ligand-Chain-Pocketsite

4.2 Interaction Control Panel

Display Customization

  • Binding Pocket Location (CHAIN)
  • Ligand Conformation (Ligand)
  • Interaction Sites (Site)

Ligand Details

Clicking Ligand displays its molecular properties.

Ligand Details

Data Download

  • LVPocketDownload: Download predicted pockets (if available).
  • Download: Export selected protein/ligand/site combinations (PDB/mol2 formats).

4.3 Interaction Analysis

Quantitative characterization of non-covalent interactions (calculated via PLIP).

Take the water bridge, for example:

Water Bridge Example
Parameter Description
pdbid Unique identifier in the Protein Data Bank (PDB)
resnr Residue Number
restype Residue Type
reschain Protein chain hosting the residue
resnrLig Ligand residue number (positional identifier)
restypeLig Ligand Residue Type
reschainLig Ligand-Associated Chain. Chain where the ligand is bound
distAW Atom-Water Distance. Spatial separation between a protein atom and bridging water
distDW Donor-Water Distance. Length between hydrogen bond donor and water molecule
donAngle Donor-Acceptor-Water Angle. Geometric angle reflecting hydrogen bond geometry
waterAngle Water-Mediated Angle. Spatial orientation of water relative to interacting atoms
protisdon Protein as Donor. Boolean flag indicating if the protein atom donates a hydrogen
donorIdx Donor Atom Index
donortype Donor Atom Type
acceptorIdx Acceptor Atom Index
acceptortype Acceptor Atom Type
waterIdx Water Molecule Index
ligcoo Ligand Atom Coordinates. 3D spatial position (X, Y, Z in Å) of the ligand atom
protoo Protein Atom Coordinates. Spatial location of the interacting protein atom
watercoo Water Coordinates. Position of the bridging water molecule