Abstract
Protein dynamics and related conformational changes are essential for their function but difficult to characterise and interpret. Amino acids in a protein behave according to their local energy landscape, which is determined by their local structural context and environmental conditions. The lowest energy state for a given residue can correspond to sharply defined conformations, e.g. in a stable helix, or can cover a wide range of conformations, e.g. in intrinsically disordered regions. A good definition of such low energy states is therefore important to describe the behaviour of a residue and how it changes with its environment. We propose a data-driven probabilistic definition of six low energy conformational states typically accessible for amino acid residues in proteins. This definition is based on solution NMR information of 1322 proteins through a combined analysis of structure ensembles with interpreted chemical shifts. We further introduce a conformational state variability parameter that captures, based on an ensemble of protein structures from molecular dynamics or other methods, how often a residue moves between these conformational states. The approach enables a different perspective on the local conformational behaviour of proteins that is complementary to their static interpretation from single structure models.
| Original language | English |
|---|---|
| Article number | lqae082 |
| Number of pages | 13 |
| Journal | NAR genomics and bioinformatics |
| Volume | 6 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - Sept 2024 |
Bibliographical note
Funding Information:European Union\u2019s Horizon 2020 research and innovation program under the Marie Sk\u0142odowska-Curie [813239 to J.R.-M., J.G.-G.]; Research Foundation Flanders (FWO) [G.032816N to G.O., G.028821N to D.B.]; Research Foundation Flanders (FWO) International Research Infrastructure [I000323N to W.V.]; COST Action ML4NGP, CA21160, supported by COST (European Cooperation in Science and Technology); the resources and services used in this work were provided by the VSC (Flemish Supercomputer Center), funded by the Research Foundation\u2014Flanders (FWO) and the Flemish Government.
Funding Information:
We thank Adrian Diaz for the invaluable help in the distribution of this software. Author contributions: W.V., D.R. and G.O. conceptualised the study. G.O. developed the initial methodology. W.V., D.R., G.O. and D.B. provided supervision. W.V. provided the NMR data and directed the project. J.G.-G. and D.B. developed, implemented and validated the method. J.R.-M. performed the MD simulations. All authors contributed to the writing of the manuscript. European Union\u2019s Horizon 2020 research and innovation program under the Marie Sk\u0142odowska-Curie [813239 to J.R.-M., J.G.-G.]; Research Foundation Flanders (FWO) [G.032816N to G.O., G.028821N to D.B.]; Research Foundation Flanders (FWO) International Research Infrastructure [I000323N to W.V.]; COST Action ML4NGP, CA21160, supported by COST (European Cooperation in Science and Technology); the resources and services used in this work were provided by the VSC (Flemish Supercomputer Center), funded by the Research Foundation\u2014Flanders (FWO) and the Flemish Government.
Publisher Copyright:
© The Author(s) 2024. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.