Abstract
Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The Data Optimization Model Evaluation (DOME) recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by providing a structured set of questions. Here, we introduce the DOME registry (URL: registry.dome-ml.org), a database that allows scientists to manage and access comprehensive DOME-related information on published ML studies. The registry uses external resources like ORCID, APICURON, and the Data Stewardship Wizard to streamline the annotation process and ensure comprehensive documentation. By assigning unique identifiers and DOME scores to publications, the registry fosters a standardized evaluation of ML methods. Future plans include continuing to grow the registry through community curation, improving the DOME score definition and encouraging publishers to adopt DOME standards, and promoting transparency and reproducibility of ML in the life sciences.
| Original language | English |
|---|---|
| Article number | giae094 |
| Number of pages | 8 |
| Journal | GigaScience |
| Volume | 13 |
| DOIs | |
| Publication status | Published - 2 Jan 2024 |
Bibliographical note
© The Author(s) 2024. Published by Oxford University Press GigaScience.Keywords
- Supervised Machine Learning
- Registries
- Reproducibility of Results
- Databases, Factual
- Humans
Fingerprint
Dive into the research topics of 'DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver