The protein common assembly database (ProtCAD)––a comprehensive structural resource of protein complexes

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Proteins often act through oligomeric interactions with other proteins. X-ray crystallography and cryoelectron microscopy provide detailed information on the structures of biological assemblies, defined as the most likely biologically relevant structures derived from experimental data. In crystal structures, the most relevant assembly may be ambiguously determined, since multiple assemblies observed in the crystal lattice may be plausible. It is estimated that 10–15% of PDB entries may have incorrect or ambiguous assembly annotations. Accurate assemblies are required for understanding functional data and training of deep learning methods for predicting assembly structures. As with any other kind of biological data, replication via multiple independent experiments provides important validation for the determination of biological assembly structures. Here we present the Protein Common Assembly Database (ProtCAD), which presents clusters of protein assembly structures observed in independent structure determinations of homologous proteins in the Protein Data Bank (PDB). ProtCAD is searchable by PDB entry, UniProt identifiers, or Pfam domain designations and provides downloads of coordinate files, PyMol scripts, and publicly available assembly annotations for each cluster of assemblies. About 60% of PDB entries contain assemblies in clusters of at least 2 independent experiments. All clusters and coordinates are available on ProtCAD web site (http://dunbrack2.fccc.edu/protcad).

Original languageEnglish
Pages (from-to)D466-D478
JournalNucleic Acids Research
Volume51
Issue numberD1
DOIs
StatePublished - Jan 6 2023

Keywords

  • Cryoelectron Microscopy
  • Crystallography, X-Ray
  • Databases, Protein
  • Multiprotein Complexes/chemistry
  • Proteins/chemistry

Fingerprint

Dive into the research topics of 'The protein common assembly database (ProtCAD)––a comprehensive structural resource of protein complexes'. Together they form a unique fingerprint.

Cite this