Dealing with Label Scarcity in Computational Pathology: A Use Case in Prostate Cancer Classification

Koen Dercksen; Wouter Bulten; Geert Litjens

Dealing with Label Scarcity in Computational Pathology: A Use Case in Prostate Cancer Classification

Koen Dercksen, Wouter Bulten, Geert Litjens

17 Apr 2019 (modified: 05 May 2023)MIDL Abstract 2019Readers: Everyone

Keywords: computational pathology, prostate cancer, unsupervised learning, deep learning

TL;DR: Comparison of unsupervised, semi-supervised and supervised methods for prostate cancer classification with label scarcity.

Abstract: Large amounts of unlabelled data are commonplace for many applications in computational pathology, whereas labelled data is often expensive, both in time and cost, to acquire. We investigate the performance of unsupervised and supervised deep learning methods when few labelled data are available. Three methods are compared: clustering autoencoder latent vectors (unsupervised), a single layer classifier combined with a pre-trained autoencoder (semi-supervised), and a supervised CNN. We apply these methods on hematoxylin and eosin (H&E) stained prostatectomy images to classify tumour versus non-tumour tissue. Results show that semi-/unsupervised methods have an advantage over supervised learning when few labels are available. Additionally, we show that incorporating immunohistochemistry (IHC) stained data provides an increase in performance over only using H&E.

Code Of Conduct: I have read and accept the code of conduct.

3 Replies

Loading