3D Deeply Supervised Network for Automatic Liver Segmentation from CT Volumes (1607.00582v1)

Published 3 Jul 2016 in cs.CV

Abstract: Automatic liver segmentation from CT volumes is a crucial prerequisite yet challenging task for computer-aided hepatic disease diagnosis and treatment. In this paper, we present a novel 3D deeply supervised network (3D DSN) to address this challenging task. The proposed 3D DSN takes advantage of a fully convolutional architecture which performs efficient end-to-end learning and inference. More importantly, we introduce a deep supervision mechanism during the learning process to combat potential optimization difficulties, and thus the model can acquire a much faster convergence rate and more powerful discrimination capability. On top of the high-quality score map produced by the 3D DSN, a conditional random field model is further employed to obtain refined segmentation results. We evaluated our framework on the public MICCAI-SLiver07 dataset. Extensive experiments demonstrated that our method achieves competitive segmentation results to state-of-the-art approaches with a much faster processing speed.

Authors (6)

Qi Dou (163 papers)
Hao Chen (1007 papers)
Yueming Jin (71 papers)
Lequan Yu (89 papers)
Jing Qin (145 papers)
Pheng-Ann Heng (197 papers)

Citations (335)

View on Semantic Scholar

Summary

The paper introduces a novel 3D deeply supervised network that enhances liver segmentation accuracy from CT volumes.
It leverages a fully convolutional 3D architecture with deep supervision to overcome issues like low-intensity contrast and inter-patient shape variations.
Post-processing with a CRF refines boundaries, achieving a VOE of 5.42% and an AvgD of 0.79 mm in approximately 1.5 minutes per subject.

Analyzing the 3D Deeply Supervised Network for Automatic Liver Segmentation from CT Volumes

The paper "3D Deeply Supervised Network for Automatic Liver Segmentation from CT Volumes" presents a significant advancement in the field of medical image analysis by introducing a novel approach for automatic liver segmentation. Liver segmentation from CT volumes is a critical task necessary for developing computer-aided systems for hepatic disease diagnosis and treatment planning. The proposed method addresses challenges such as inter-patient shape variation and low-intensity contrast between the liver and adjacent organs by leveraging innovative deep learning techniques.

Methodological Advancements

The core contribution of this work is the introduction of a 3D deeply supervised network (3D DSN). The 3D DSN distinguishes itself from previous methods through several key innovations:

3D Fully Convolutional Architecture: The network utilizes a fully convolutional architecture specifically designed to handle 3D spatial data. This architecture allows for efficient end-to-end learning and inference, providing significant computational efficiency over traditional patch-based 3D CNN approaches.
Deep Supervision: To enhance learning efficiency and accuracy, the paper implements a deep supervision mechanism. This involves injecting supervision directly into hidden layers, thus addressing the problem of vanishing gradients and improving the convergence rate of the model. Deep supervision aids the network in acquiring robust high-level feature representations essential for accurate liver segmentation.
Conditional Random Field (CRF) for Post-processing: The authors further refine segmentation results using a CRF model. This step is critical for improving the precision of liver boundaries, which is particularly challenging in regions with ambiguous contours.

Experimental Validation

The proposed 3D DSN was rigorously evaluated on the public MICCAI-SLiver07 dataset. The combination of the 3D DSN and CRF post-processing demonstrated notable performance, achieving competitive segmentation accuracy compared to existing state-of-the-art methods. Specifically, the method achieved a Volumetric Overlap Error (VOE) of 5.42% and an Average Symmetric Surface Distance (AvgD) of 0.79 mm on the testing set, which are prominent metrics for assessing liver segmentation performance.

Furthermore, the method's efficiency is evidenced by its processing speed, taking approximately 1.5 minutes per subject. This rapid processing time holds particular significance for clinical applications where real-time or near-real-time results are beneficial.

Implications and Future Directions

The results showcased in this paper underscore the potential of deep learning frameworks in advancing medical image segmentation tasks, specifically in handling 3D volumetric data like CT scans. The integration of deep supervision within a fully convolutional 3D network architecture presents a compelling solution to the challenges of accurate and efficient automatic liver segmentation.

Looking ahead, the presented methodology is generalizable and can be adapted to other medical segmentation tasks involving volumetric datasets. Future work might focus on enhancing shape modeling capabilities to further improve metrics such as maximum symmetric surface distance, which is sensitive to outliers. There is also significant potential in refining the CRF post-processing using advanced graphical models that can be more deeply integrated within the network's architecture.

In conclusion, this paper makes a substantial contribution to medical image analysis, providing both theoretical insights and practical solutions that can be extended to broader applications in patient diagnosis and treatment planning. The introduction of the 3D DSN with deep supervision contributes to ongoing efforts in harnessing deep learning to advance the accuracy and speed of medical image processing.

PDF Markdown