TY - JOUR
T1 - Scaling up oligogenic diseases research with OLIDA
T2 - the Oligogenic Diseases Database
AU - Nachtegael, Charlotte
AU - Gravel, Barbara
AU - Dillen, Arnau
AU - Smits, Guillaume
AU - Nowé, Ann
AU - Papadimitriou, Sofia
AU - Lenaerts, Tom
N1 - © The Author(s) 2022. Published by Oxford University Press.
PY - 2022/4/12
Y1 - 2022/4/12
N2 - Improving the understanding of the oligogenic nature of diseases requires access to high-quality, well-curated Findable, Accessible, Interoperable, Reusable (FAIR) data. Although first steps were taken with the development of the Digenic Diseases Database, leading to novel computational advancements to assist the field, these were also linked with a number of limitations, for instance, the ad hoc curation protocol and the inclusion of only digenic cases. The OLIgogenic diseases DAtabase (OLIDA) presents a novel, transparent and rigorous curation protocol, introducing a confidence scoring mechanism for the published oligogenic literature. The application of this protocol on the oligogenic literature generated a new repository containing 916 oligogenic variant combinations linked to 159 distinct diseases. Information extracted from the scientific literature is supplemented with current knowledge support obtained from public databases. Each entry is an oligogenic combination linked to a disease, labelled with a confidence score based on the level of genetic and functional evidence that supports its involvement in this disease. These scores allow users to assess the relevance and proof of pathogenicity of each oligogenic combination in the database, constituting markers for reporting improvements on disease-causing oligogenic variant combinations. OLIDA follows the FAIR principles, providing detailed documentation, easy data access through its application programming interface and website, use of unique identifiers and links to existing ontologies.DATABASE URL: https://olida.ibsquare.be.
AB - Improving the understanding of the oligogenic nature of diseases requires access to high-quality, well-curated Findable, Accessible, Interoperable, Reusable (FAIR) data. Although first steps were taken with the development of the Digenic Diseases Database, leading to novel computational advancements to assist the field, these were also linked with a number of limitations, for instance, the ad hoc curation protocol and the inclusion of only digenic cases. The OLIgogenic diseases DAtabase (OLIDA) presents a novel, transparent and rigorous curation protocol, introducing a confidence scoring mechanism for the published oligogenic literature. The application of this protocol on the oligogenic literature generated a new repository containing 916 oligogenic variant combinations linked to 159 distinct diseases. Information extracted from the scientific literature is supplemented with current knowledge support obtained from public databases. Each entry is an oligogenic combination linked to a disease, labelled with a confidence score based on the level of genetic and functional evidence that supports its involvement in this disease. These scores allow users to assess the relevance and proof of pathogenicity of each oligogenic combination in the database, constituting markers for reporting improvements on disease-causing oligogenic variant combinations. OLIDA follows the FAIR principles, providing detailed documentation, easy data access through its application programming interface and website, use of unique identifiers and links to existing ontologies.DATABASE URL: https://olida.ibsquare.be.
KW - Databases, Factual
KW - Software
KW - Vocabulary, Controlled
UR - http://www.scopus.com/inward/record.url?scp=85128089333&partnerID=8YFLogxK
U2 - 10.1093/database/baac023
DO - 10.1093/database/baac023
M3 - Article
C2 - 35411390
VL - 2022
JO - Database : the journal of biological databases and curation
JF - Database : the journal of biological databases and curation
SN - 1758-0463
M1 - baac023
ER -