Papers
Topics
Authors
Recent
2000 character limit reached

Going Deeper through the Gleason Scoring Scale: An Automatic end-to-end System for Histology Prostate Grading and Cribriform Pattern Detection (2105.10490v1)

Published 21 May 2021 in eess.IV and cs.CV

Abstract: The Gleason scoring system is the primary diagnostic and prognostic tool for prostate cancer. In recent years, with the development of digitisation devices, the use of computer vision techniques for the analysis of biopsies has increased. However, to the best of the authors' knowledge, the development of algorithms to automatically detect individual cribriform patterns belonging to Gleason grade 4 has not yet been studied in the literature. The objective of the work presented in this paper is to develop a deep-learning-based system able to support pathologists in the daily analysis of prostate biopsies. The methodological core of this work is a patch-wise predictive model based on convolutional neural networks able to determine the presence of cancerous patterns. In particular, we train from scratch a simple self-design architecture. The cribriform pattern is detected by retraining the set of filters of the last convolutional layer in the network. From the reconstructed prediction map, we compute the percentage of each Gleason grade in the tissue to feed a multi-layer perceptron which provides a biopsy-level score.mIn our SICAPv2 database, composed of 182 annotated whole slide images, we obtained a Cohen's quadratic kappa of 0.77 in the test set for the patch-level Gleason grading with the proposed architecture trained from scratch. Our results outperform previous ones reported in the literature. Furthermore, this model reaches the level of fine-tuned state-of-the-art architectures in a patient-based four groups cross validation. In the cribriform pattern detection task, we obtained an area under ROC curve of 0.82. Regarding the biopsy Gleason scoring, we achieved a quadratic Cohen's Kappa of 0.81 in the test subset. Shallow CNN architectures trained from scratch outperform current state-of-the-art methods for Gleason grades classification.

Citations (100)

Summary

  • The paper introduces a novel CNN architecture (FSConv) for patch-level Gleason grading, achieving a quadratic Cohen’s kappa of 0.77.
  • It fine-tunes the FSConv model for cribriform pattern detection in Gleason grade 4 patches, yielding an ROC AUC of 0.82.
  • The system integrates patch-level outputs into whole slide assessments using an MLP, achieving a biopsy scoring kappa of 0.81.

Automatic End-to-End System for Prostate Histology Grading

Introduction

The paper "Going Deeper through the Gleason Scoring Scale: An Automatic end-to-end System for Histology Prostate Grading and Cribriform Pattern Detection" presents an innovative approach to enhance the diagnosis and analysis of prostate cancer—specifically through histology biopsies—with an automatic system. Prostate cancer remains one of the most prevalent malignancies among men worldwide. Accurate grading and scoring are critical for prognosis and subsequent treatment decisions, predominantly relying on Gleason grading. However, the manual analysis of histology samples is labor-intensive, subjective, and prone to variability, imposing significant workloads on pathologists.

Technological advancements in digitization and computer vision now allow for automating some aspects of histology analysis. This paper aims to develop and validate a comprehensive end-to-end system leveraging deep learning techniques to bolster pathologists' analyses, focusing not just on Gleason grading but also on detecting specific patterns within Gleason grade 4—such as the prognostically significant cribriform pattern. Figure 1

Figure 1

Figure 1

Figure 1

Figure 1

Figure 1

Figure 1

Figure 1

Figure 1: Patches of H&E histology samples presenting different Gleason patterns.

Methods

Patch-Level Gleason Grading

The core of the proposed system is a convolutional neural network (CNN) designed to predict Gleason grades at the patch level. The authors introduce a self-designed CNN architecture named FSConv, characterized by three simple convolutional layers and employing max-pooling for dimensional reduction (Figure 2). The system adopts global-max pooling as a top model to mitigate issues related to the variability of tissue pattern location and size within histology patches, thus reducing the model's complexity and minimizing overfitting risks. Figure 2

Figure 2: Flowchart in which the different blocks of our system are presented.

Cribriform Pattern Detection

The detection of cribriform patterns in Gleason grade 4 patches was addressed by fine-tuning the FSConv model initially trained for Gleason grade prediction. Focusing on the complexity of this task and the limited availability of cribriform samples, the fine-tuning approach allowed for optimizing the system without requiring extensive computational resources or adjusting a vast array of parameters.

Whole Slide Image Gleason Scoring

The system reconstructs biopsy-level prediction maps based on patch-level outputs through bi-linear interpolation, which serves to compute the percentage of each Gleason grade within the tissue. A multi-layer perceptron (MLP) is developed to predict the biopsy-level Gleason score, factoring in both the proportion and the severity of the grades present. Figure 3

Figure 3: Proposed Multi-Layer Perceptron (MLP) for the whole slide image Gleason scoring.

Results

The system was rigorously validated on the SICAPv2 database, composed of 182 whole slide images. The FSConv network achieved a strong agreement with pathologist annotations, with a quadratic Cohen’s kappa of 0.77 for patch-level grading, surpassing previous benchmarks. Additionally, the cribriform pattern detection task yielded an ROC curve area of 0.82, affirming the reliability of the model in medical contexts. Figure 4

Figure 4

Figure 4: ROC curves obtained for cribriform pattern detection in samples with Gleason grade 4.

Moreover, the biopsy Gleason scoring achieved a quadratic Cohen’s kappa of 0.81, showcasing the MLP model’s fidelity in simulating pathologists' decision-making processes more accurately than previous simpler models.

Discussion

This system represents a significant advancement in automating prostate biopsy grading, enabling more efficient and accurate assessments which could reduce pathologist workload and variability. Importantly, this is the first paper addressing the automatic detection of cribriform patterns—a critical factor in assessing severe prostate cancer cases. Future work aimed at refining the CNN architecture could further integrate low and high-level features for robust pavilion-style training, potentially extending the system’s capability to predict other individual cancerous patterns.

Conclusion

The work detailed in this paper reflects a substantial step forward in prostate histology analysis. The presented end-to-end system exhibits strong potential for practical deployment, promising to support and streamline pathologist workflows. With continual development and enlargement of databases like SICAPv2, combined with increasingly sophisticated deep learning models, the field is poised to significantly enhance diagnostic accuracy and efficiency in prostate and potentially other cancers.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.