Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 30 tok/s Pro
GPT-5 High 23 tok/s Pro
GPT-4o 99 tok/s Pro
Kimi K2 190 tok/s Pro
GPT OSS 120B 425 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Deep Symmetric Adaptation Network for Cross-modality Medical Image Segmentation (2101.06853v1)

Published 18 Jan 2021 in eess.IV and cs.CV

Abstract: Unsupervised domain adaptation (UDA) methods have shown their promising performance in the cross-modality medical image segmentation tasks. These typical methods usually utilize a translation network to transform images from the source domain to target domain or train the pixel-level classifier merely using translated source images and original target images. However, when there exists a large domain shift between source and target domains, we argue that this asymmetric structure could not fully eliminate the domain gap. In this paper, we present a novel deep symmetric architecture of UDA for medical image segmentation, which consists of a segmentation sub-network, and two symmetric source and target domain translation sub-networks. To be specific, based on two translation sub-networks, we introduce a bidirectional alignment scheme via a shared encoder and private decoders to simultaneously align features 1) from source to target domain and 2) from target to source domain, which helps effectively mitigate the discrepancy between domains. Furthermore, for the segmentation sub-network, we train a pixel-level classifier using not only original target images and translated source images, but also original source images and translated target images, which helps sufficiently leverage the semantic information from the images with different styles. Extensive experiments demonstrate that our method has remarkable advantages compared to the state-of-the-art methods in both cross-modality Cardiac and BraTS segmentation tasks.

Citations (67)

Summary

  • The paper presents a novel unsupervised domain adaptation framework for cross-modality medical image segmentation using a symmetric architecture.
  • DSAN employs bidirectional feature alignment and adversarial losses to extract domain-invariant features and enhance semantic mining.
  • Experimental results on Cardiac and BraTS datasets show improvements in Dice scores and reductions in segmentation errors.

Deep Symmetric Adaptation Network for Cross-modality Medical Image Segmentation

This paper introduces a novel approach to unsupervised domain adaptation for medical image segmentation, focusing on cross-modality tasks such as MRI to CT segmentation. The proposed method, termed Deep Symmetric Adaptation Network (DSAN), leverages symmetric architecture to perform effective feature alignment and semantic mining. The key innovation lies in its bidirectional alignment of features between source and target domains and segmentation network training using multiple image styles generated by adversarial networks.

Method Overview

The DSAN framework is characterized by a completely symmetric architecture incorporating shared and domain-specific components. The network is composed of a common encoder shared across domains, two domain-specific private decoders, and a pixel-wise classifier. The shared encoder and private decoders form translation sub-networks for reconstructing images and generating cross-domain images. A pixel-wise classifier and the encoder form the segmentation sub-network aimed at leveraging semantic information from stylized images derived from adversarial training. Figure 1

Figure 1: An overview of the proposed method, highlighting symmetric architecture for cross-domain adaptation in medical image segmentation.

Translation Sub-networks

The translation sub-networks implement adversarial losses to mitigate domain shifts. They employ a bidirectional approach: translating from source to target domain and vice versa, thereby aligning features across domains. The private decoders specialize in domain-specific tasks, ensuring the encoder focuses solely on domain-invariant features. Adversarial loss encourages the generated cross-domain images to be indistinguishable from real images in the target domain.

Segmentation Sub-network

The segmentation sub-network is trained using images generated from both source and target domains, exploiting semantic information across different styles. It employs deep supervision through additional classifiers for lower feature maps. The semantic mining is enhanced by adversarial losses on prediction maps, aligning segmentation outputs from different domains.

Experimental Results

The efficacy of the DSAN method is demonstrated through experiments on two medical datasets: Cardiac dataset and BraTS dataset. The results show significant improvements over state-of-the-art methods in these tasks, with notable enhancements in Dice scores and reduction in ASD and Hausdorff distances. The use of bidirectional feature alignment and comprehensive semantic mining proves advantageous compared to methods utilizing either image translation or feature alignment independently. Figure 2

Figure 2: Cardiac segmentation results comparing different domain adaptation methods on MRI to CT task.

Figure 3

Figure 3: Brain tumor segmentation showcasing results of various methods in the unsupervised domain adaptation task.

Ablation Study

The ablation studies further highlight the importance of each component within the DSAN. Specific experiments verify the impact of bidirectional feature alignment, semantic mining using adversarial loss, and the architecture choices regarding shared versus private network components. The findings indicate that leveraging all styled images in training further enhances segmentation performance.

Discussion

The DSAN framework’s design choices, such as sharing the encoder across segmentation and translation tasks, demonstrate that effective domain-invariant feature extraction can significantly address domain shifts. Variations in network components, like private decoders and shared discriminators, were discussed, emphasizing their contribution to improved performance. Figure 4

Figure 4

Figure 4: Training progress indicating effect of different components and settings on segmentation performance.

Conclusion

The DSAN model represents a robust method for unsupervised domain adaptation in medical image segmentation. Its symmetric architecture effectively aligns cross-modality features and harnesses diverse semantic information, achieving superior segmentation results. Future directions may focus on enhancing the method by integrating self-training or pseudo-labeling strategies to further utilize unlabeled target domain data. Figure 5

Figure 5: Comparison of initialization strategies indicating training performance benefits of pre-trained initialization.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube