Data-driven probabilistic definition of the low energy conformational states of protein residues

Jose Gavalda-Garcia, David Bickel, Joel Roca-Martinez, Daniele Raimondi, Gabriele Orlando, Wim Vranken

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)
49 Downloads (Pure)

Abstract

Protein dynamics and related conformational changes are essential for their function but difficult to characterise and interpret. Amino acids in a protein behave according to their local energy landscape, which is determined by their local structural context and environmental conditions. The lowest energy state for a given residue can correspond to sharply defined conformations, e.g. in a stable helix, or can cover a wide range of conformations, e.g. in intrinsically disordered regions. A good definition of such low energy states is therefore important to describe the behaviour of a residue and how it changes with its environment. We propose a data-driven probabilistic definition of six low energy conformational states typically accessible for amino acid residues in proteins. This definition is based on solution NMR information of 1322 proteins through a combined analysis of structure ensembles with interpreted chemical shifts. We further introduce a conformational state variability parameter that captures, based on an ensemble of protein structures from molecular dynamics or other methods, how often a residue moves between these conformational states. The approach enables a different perspective on the local conformational behaviour of proteins that is complementary to their static interpretation from single structure models.

Original languageEnglish
Article numberlqae082
Number of pages13
JournalNAR genomics and bioinformatics
Volume6
Issue number3
DOIs
Publication statusPublished - Sept 2024

Bibliographical note

Funding Information:
European Union\u2019s Horizon 2020 research and innovation program under the Marie Sk\u0142odowska-Curie [813239 to J.R.-M., J.G.-G.]; Research Foundation Flanders (FWO) [G.032816N to G.O., G.028821N to D.B.]; Research Foundation Flanders (FWO) International Research Infrastructure [I000323N to W.V.]; COST Action ML4NGP, CA21160, supported by COST (European Cooperation in Science and Technology); the resources and services used in this work were provided by the VSC (Flemish Supercomputer Center), funded by the Research Foundation\u2014Flanders (FWO) and the Flemish Government.

Funding Information:
We thank Adrian Diaz for the invaluable help in the distribution of this software. Author contributions: W.V., D.R. and G.O. conceptualised the study. G.O. developed the initial methodology. W.V., D.R., G.O. and D.B. provided supervision. W.V. provided the NMR data and directed the project. J.G.-G. and D.B. developed, implemented and validated the method. J.R.-M. performed the MD simulations. All authors contributed to the writing of the manuscript. European Union\u2019s Horizon 2020 research and innovation program under the Marie Sk\u0142odowska-Curie [813239 to J.R.-M., J.G.-G.]; Research Foundation Flanders (FWO) [G.032816N to G.O., G.028821N to D.B.]; Research Foundation Flanders (FWO) International Research Infrastructure [I000323N to W.V.]; COST Action ML4NGP, CA21160, supported by COST (European Cooperation in Science and Technology); the resources and services used in this work were provided by the VSC (Flemish Supercomputer Center), funded by the Research Foundation\u2014Flanders (FWO) and the Flemish Government.

Publisher Copyright:
© The Author(s) 2024. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Fingerprint

Dive into the research topics of 'Data-driven probabilistic definition of the low energy conformational states of protein residues'. Together they form a unique fingerprint.

Cite this