SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification (2403.13148v1)
Abstract: Digital Breast Tomosynthesis (DBT) is a widely used medical imaging modality for breast cancer screening and diagnosis, offering higher spatial resolution and greater detail through its 3D-like breast volume imaging capability. However, the increased data volume also introduces pronounced data imbalance challenges, where only a small fraction of the volume contains suspicious tissue. This further exacerbates the data imbalance due to the case-level distribution in real-world data and leads to learning a trivial classification model that only predicts the majority class. To address this, we propose a novel method using view-level contrastive Self-supervised Initialization and Fine-Tuning for identifying abnormal DBT images, namely SIFT-DBT. We further introduce a patch-level multi-instance learning method to preserve spatial resolution. The proposed method achieves 92.69% volume-wise AUC on an evaluation of 970 unique studies.
- “Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries,” CA: A Cancer Journal for Clinicians, vol. 71, no. 3, pp. 209–249, 2021.
- “Advances in digital breast tomosynthesis,” American Journal of Roentgenology, vol. 208, no. 2, pp. 256–266, 2017.
- “A competition, benchmark, code, and data for using artificial intelligence to detect lesions in digital breast tomosynthesis,” JAMA Network Open, vol. 6, no. 2, pp. e230524–e230524, 2023.
- “Detection of masses and architectural distortions in digital breast tomosynthesis: a publicly available dataset of 5,060 patients and a deep learning model,” arXiv preprint arXiv:2011.07995, 2020.
- “Deep long-tailed learning: A survey,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
- “Hierarchical-instance contrastive learning for minority detection on imbalanced medical datasets,” IEEE Transactions on Medical Imaging, 2023.
- “Focal loss for dense object detection,” in Proceedings of the IEEE international conference on computer vision, 2017, pp. 2980–2988.
- “Transformer-based deep neural network for breast cancer classification on digital breast tomosynthesis images,” Radiology: Artificial Intelligence, vol. 5, no. 3, pp. e220159, 2023.
- M. Tardy and D. Mateus, “Trainable summarization to improve breast tomosynthesis classification,” in International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2021, pp. 140–149.
- “Representation learning with contrastive predictive coding,” arXiv preprint arXiv:1807.03748, 2018.
- “Momentum contrast for unsupervised visual representation learning,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 9729–9738.
- “Medaug: Contrastive learning leveraging patient metadata improves representations for chest x-ray interpretation,” in Machine Learning for Healthcare Conference. PMLR, 2021, pp. 755–769.
- J. Howard and S. Ruder, “Universal language model fine-tuning for text classification,” arXiv preprint arXiv:1801.06146, 2018.
- “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778.
- “Predicting breast cancer by applying deep learning to linked health records and mammograms,” Radiology, vol. 292, no. 2, pp. 331–342, 2019.
- “The cancer imaging archive (tcia): maintaining and operating a public information repository,” Journal of digital imaging, vol. 26, pp. 1045–1057, 2013.
- Yuexi Du (11 papers)
- Regina J. Hooley (1 paper)
- John Lewin (1 paper)
- Nicha C. Dvornek (41 papers)