Collagen is a family of large, fibrous biomacromolecules common in animals, distinguished by unique molecular, structural, and functional properties. Despite the relatively low complexity of their sequences and the repetitive conformation of the triple helix, which is the defining feature of this family, unraveling sequence–stability and structure–function relationships in this group of proteins remains a challenging task. Considering the importance of the structural aspects in collagen chain recognition and selection, we reviewed our current knowledge of the heterotrimeric structures of non-collagenous (NC) regions that lack the triple helix sequence motif, Gly-X-Y, and are crucial for the correct folding of the functional states of these proteins. This study was conducted by simultaneously surveying the current literature, mining the structural database, and making predictions of the three-dimensional structure of these domains using highly reliable approaches based on machine learning techniques, such as AlphaFold. The combination of experimental structural data and predictive analyses offers some interesting clues about the structural features of heterotrimers formed by collagen NC regions. Structural studies carried out in the last decade show that for fibrillar collagens (types I, V, XI, and mixed V/XI), key factors include the formation of specific disulfide bridges and electrostatic interaction patterns. In the subgroup of collagens whose heterotrimers create supramolecular networks (types IV and VIII), available structural information provides a solid ground for the definition of the basis of the molecular and supramolecular organization. Very recent AlphaFold predictions and structural analyses of type VI collagen offer strong evidence of the specific domains in the NC region of the protein that are involved in chain selection and their staggering. Insightful crystallographic studies have also revealed some fundamental elements of the chain selection process in type IX collagen. Collectively, the data reported here indicate that, although some aspects (particularly the quantification of the relative contribution of the NC and triple helix regions to correct collagen folding) are yet to be fully understood, the available structural information provides a solid foundation for future studies aimed at precisely defining sequence–structure–function relationships in collagens.

Determinants of Chain Selection and Staggering in Heterotrimeric Collagens: A Comprehensive Review of the Structural Data

Vitagliano L.;Doti N.;Balasco N.
2025

Abstract

Collagen is a family of large, fibrous biomacromolecules common in animals, distinguished by unique molecular, structural, and functional properties. Despite the relatively low complexity of their sequences and the repetitive conformation of the triple helix, which is the defining feature of this family, unraveling sequence–stability and structure–function relationships in this group of proteins remains a challenging task. Considering the importance of the structural aspects in collagen chain recognition and selection, we reviewed our current knowledge of the heterotrimeric structures of non-collagenous (NC) regions that lack the triple helix sequence motif, Gly-X-Y, and are crucial for the correct folding of the functional states of these proteins. This study was conducted by simultaneously surveying the current literature, mining the structural database, and making predictions of the three-dimensional structure of these domains using highly reliable approaches based on machine learning techniques, such as AlphaFold. The combination of experimental structural data and predictive analyses offers some interesting clues about the structural features of heterotrimers formed by collagen NC regions. Structural studies carried out in the last decade show that for fibrillar collagens (types I, V, XI, and mixed V/XI), key factors include the formation of specific disulfide bridges and electrostatic interaction patterns. In the subgroup of collagens whose heterotrimers create supramolecular networks (types IV and VIII), available structural information provides a solid ground for the definition of the basis of the molecular and supramolecular organization. Very recent AlphaFold predictions and structural analyses of type VI collagen offer strong evidence of the specific domains in the NC region of the protein that are involved in chain selection and their staggering. Insightful crystallographic studies have also revealed some fundamental elements of the chain selection process in type IX collagen. Collectively, the data reported here indicate that, although some aspects (particularly the quantification of the relative contribution of the NC and triple helix regions to correct collagen folding) are yet to be fully understood, the available structural information provides a solid foundation for future studies aimed at precisely defining sequence–structure–function relationships in collagens.
2025
Istituto di Biologia e Patologia Molecolari - IBPM
Istituto di Biostrutture e Bioimmagini - IBB - Sede Napoli Via Pietro Castellino 111
AlphaFold
chain recognition
collagen heterotrimers
collagen staggering
non-collagenous regions
protein folding
sequence–structure–function relationships
structural biology
File in questo prodotto:
File Dimensione Formato  
ijms-26-10134-v2.pdf

accesso aperto

Descrizione: main text
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 27.39 MB
Formato Adobe PDF
27.39 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/559698
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact