Learning to Classify Contaminated Blood Tests: A Neural Approach

Jason V Mancuso, Peter Short, Edmunds Reineks, Elena Manilich

Feb 12, 2018 (modified: Jun 04, 2018) ICLR 2018 Workshop Submission readers: everyone Show Bibtex
  • Abstract: Automated laboratories analyze thousands of blood tests every day. Some samples can become contaminated by improper collection, producing incorrect results. Identification of such samples is critical and requires a labor intensive expert review. In our previous work, we demonstrated a useful machine learning approach to automated detection, but a severe class imbalance in the dataset led to unstable results. Using a modified data augmentation strategy and an attentional neural architecture, we show significant improvement over previously published methods in all binary classification metrics, with an absolute F1-score improvement of 13%. This work represents the state-of-the-art in automated contamination detection and demonstrates that a machine learning pipeline based on our neural architecture can perform on par with human experts.
  • Keywords: attention, bioinformatics, pathology, laboratory medicine, data augmentation, generative model, bayesian mixture model, deep learning
  • TL;DR: Expert-level identification of contaminated blood tests using attentional neural network