Concordance analysis of CD8 topology with manual histopathology assessment across seven cancer types
Abstract: Abstract
Background
The degree of CD8+ lymphocyte infiltration into the tumor microenvironment, as well as the distribution of lymphocytes within the tumor and surrounding stroma (inflamed, excluded, or desert immunophenotypes), are key determinants for the potential efficacy of immunotherapy. Thus, accurate characterization of the tumor immune microenvironment is essential. However, manual histopathological assessment of CD8 topology is subject to many challenges, including subjectivity and reproducibility. We developed machine learning (ML)-based models for the identification and quantification of CD8+ lymphocytes and CD8-based CD8 topology classifiers across seven cancer types: urothelial carcinoma (UC), head and neck squamous cell carcinoma (HNSCC), non-small cell lung cancer (NSCLC), gastric cancer (GC), colorectal cancer (CRC), pancreatic cancer (PC), and hepatocellular carcinoma (HCC).
Methods
ML algorithms were developed to quantify CD8+ lymphocytes in UC, HNSCC, NSCLC, GC, CRC, PC, and HCC specimens from clinical trials and commercial sources (N=1603) using CD8+ cell detection models trained on digitized whole slide images of CD8 immunohistochemistry (Agilent-Dako clone C8/144b, Agilent). Annotations were provided by the PathAI network of expert pathologists to train algorithms for classifying tissue regions (including parenchyma and stroma) and cell types. To evaluate the performance of the CD8+ cell model, AI-predicted counts were compared to a consensus count from five independent pathologists for representative fields of view (frames) using Pearson correlation. Inter-pathologist agreement was also calculated. Using a distinct cohort of samples from each cancer type (N=1655), a simple, two-parameter CD8 topology classifier was trained using pathologist-provided CD8 topology scores and CD8 model-derived features. Classifier-predicted immunophenotype scores were compared to pathologist-generated scores using unweighted Cohen’s kappa.
Results
ML-based quantitation of CD8+ lymphocytes yielded counts with high concordance to manual pathologist consensus counts. Frames-based validation of CD8+ counts on held-out test sets from each cancer type confirmed this high correlation (table 1). For scoring CD8 topology, the performance of the CD8 topology classifier was comparable to that of individual pathologists (table 2), indicating non-inferiority of our models with pathologist-generated scores. Overall, our models met pre-determined success criteria for all seven cancer types.
Conclusions
ML model-predicted CD8+ cell counts are highly concordant with pathologist-generated counts across seven solid tumor types, and tumor CD8 topologies were predicted with a simple and highly interpretable two-parameter classifier. These data demonstrate the power of AI-powered digital pathology for accurate and reproducible quantitation of CD8+ lymphocytes and automated immunophenotyping in clinical samples, further confirming the potential for AI-based biomarker measurements in immuno-oncology.
Loading