Abstract
The Protein Ensemble Database (PED) (URL: https://proteinensemble.org) is the primary resource for depositing structural ensembles of intrinsically disordered proteins. This updated version of PED reflects advancements in the field, denoting a continual expansion with a total of 461 entries and 538 ensembles, including those generated without explicit experimental data through novel machine learning (ML) techniques. With this significant increment in the number of ensembles, a few yet-unprecedented new entries entered the database, including those also determined or refined by electron paramagnetic resonance or circular dichroism data. In addition, PED was enriched with several new features, including a novel deposition service, improved user interface, new database cross-referencing options and integration with the 3D-Beacons network-all representing efforts to improve the FAIRness of the database. Foreseeably, PED will keep growing in size and expanding with new types of ensembles generated by accurate and fast ML-based generative models and coarse-grained simulations. Therefore, among future efforts, priority will be given to further develop the database to be compatible with ensembles modeled at a coarse-grained level.
| Original language | English |
|---|---|
| Article number | gkad947 |
| Pages (from-to) | D536-D544 |
| Number of pages | 9 |
| Journal | Nucleic Acids Research |
| Volume | 52 |
| Issue number | D1 |
| Early online date | Oct 2023 |
| DOIs | |
| Publication status | Published - 5 Jan 2024 |
Bibliographical note
Publisher Copyright:© The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research.