Abstract
To study the ways in which compounds can induce adverse effects, toxicologists have been constructing Adverse Outcome Pathways (AOPs). An AOP can be considered as a pragmatic tool to capture and visualize mechanisms underlying different types of toxicity inflicted by any kind of stressor, and describes the interactions between key entities that lead to the adverse outcome on multiple biological levels of organization.The construction or optimization of an AOP is a labor intensive process, which currently depends on the manual search, collection, reviewing and synthesis of available scientific literature. This process could however be largely facilitated using Natural Language Processing (NLP) to extract information contained in scientific literature in a systematic, objective, and rapid manner that would lead to greater accuracy and reproducibility. This would support researchers to invest their expertise in the substantive assessment of the AOPs by replacing the time spent on evidence gathering by a critical review of the data extracted by NLP. As case examples, we selected two frequent adversities observed in the liver: namely cholestasis and steatosis denoting accumulation of bile and lipid, respectively. We used deep learning language models to recognize entities of interest in text and establish causal relationships between them. We demonstrate how an NLP pipeline combining Named Entity Recognition and a simple rules-based relationship extraction model helps screen compounds related to liver adversities in the literature, but also extract mechanistic information for how such adversities develop, from the molecular to the organismal level. Finally, we provide some perspectives 1 CORRADI et al.opened by the recent progress in Large Language Models and how these could be used in the future.We propose this work brings two main contributions:• A proof-of-concept that NLP can support the extraction of information from text for modern toxicology.• A template open-source model for recognition of toxicological entities and extraction of their relationships.All resources are openly accessible via GitHub (https://github.com/ontox-project/en-tox).
| Original language | English |
|---|---|
| Article number | 1393662 |
| Number of pages | 10 |
| Journal | Frontiers in toxicology |
| Volume | 6 |
| DOIs | |
| Publication status | Published - 27 May 2024 |
Bibliographical note
Funding Information:The authors declare that financial support was received for the research, authorship, and/or publication of this article. This work was performed in the context of the ONTOX project ( https://ontox-project.eu/ ) that has received funding from the European Union\u2019s Horizon 2020 Research and Innovation programme under grant agreement No. 963845. ONTOX is part of the ASPIS project cluster ( https://aspis-cluster.eu/ ).
Publisher Copyright:
Copyright © 2024 Corradi, Luechtefeld, de Haan, Pieters, Freedman, Vanhaecke, Vinken and Teunis.
Keywords
- natural language processing
- toxicology
- adverse outcome pathway
- risk assessment
- machine learning
- open science
Fingerprint
Dive into the research topics of 'The application of natural language processing for the extraction of mechanistic information in toxicology'. Together they form a unique fingerprint.Projects
- 1 Active
-
EUAR61: H2020: Ontology-driven and artificial intelligence-based repeated dose toxicity testing of chemicals for next generation risk assessment
Vinken, M. (Administrative Promotor), Vanhaecke, T. (CoI (Co-Promotor)) & Rogiers, V. (CoI (Co-Promotor))
1/05/21 → 31/10/26
Project: Fundamental
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver