Interpretable Semi-Supervised Classifier for Predicting Cancer Stages

Isel Grau, Dipankar Sengupta, Ann Nowe

Onderzoeksoutput: Chapterpeer review

2 Citaten (Scopus)
81 Downloads (Pure)


Machine learning techniques in medicine have been at the forefront addressing challenges such as diagnosis, prognosis prediction, or precision medicine. In this field, the data is sometimes abundant but comes from different data sources or lack assigned labels. The process of manually labeling this data when conforming to a curated dataset for supervised classification can be costly. Semi-supervised classification offers a wide range of methods for leveraging unlabeled data when learning prediction models. However, these classifiers are commonly deep or ensemble learning structures that often result in black boxes. The requirement of interpretable models for medical settings led us to propose the self-labeling grey-box classifier, which outperforms other semi-supervised classifiers on benchmarking datasets while providing interpretability. In this chapter, we illustrate the applications of the self-labeling grey-box on the omics and clinical datasets from the cancer genome atlas. We show that the self-labeling grey-box is accurate in predicting cancer stages of rare cancers by leveraging the unlabeled instances from more common cancer types. We discuss insights, the features influencing prediction, as well as a global representation of the knowledge through decision trees or rule lists, which can aid clinicians and researchers.
Originele taal-2English
TitelMachine Learning, Big Data, and IoT for Medical Informatics
RedacteurenPardeep Kumar, Yugal Kumar, Mohamed Tawhid
Aantal pagina's19
ISBN van elektronische versie9780128217818
ISBN van geprinte versie9780128217771
StatusPublished - 1 jan 2021


Duik in de onderzoeksthema's van 'Interpretable Semi-Supervised Classifier for Predicting Cancer Stages'. Samen vormen ze een unieke vingerafdruk.

Citeer dit