AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers (2309.14492v1)
Abstract: To date, endovascular surgeries are performed using the golden standard of Fluoroscopy, which uses ionising radiation to visualise catheters and vasculature. Prolonged Fluoroscopic exposure is harmful for the patient and the clinician, and may lead to severe post-operative sequlae such as the development of cancer. Meanwhile, the use of interventional Ultrasound has gained popularity, due to its well-known benefits of small spatial footprint, fast data acquisition, and higher tissue contrast images. However, ultrasound images are hard to interpret, and it is difficult to localise vessels, catheters, and guidewires within them. This work proposes a solution using an adaptation of a state-of-the-art machine learning transformer architecture to detect and segment catheters in axial interventional Ultrasound image sequences. The network architecture was inspired by the Attention in Attention mechanism, temporal tracking networks, and introduced a novel 3D segmentation head that performs 3D deconvolution across time. In order to facilitate training of such deep learning networks, we introduce a new data synthesis pipeline that used physics-based catheter insertion simulations, along with a convolutional ray-casting ultrasound simulator to produce synthetic ultrasound images of endovascular interventions. The proposed method is validated on a hold-out validation dataset, thus demonstrated robustness to ultrasound noise and a wide range of scanning angles. It was also tested on data collected from silicon-based aorta phantoms, thus demonstrated its potential for translation from sim-to-real. This work represents a significant step towards safer and more efficient endovascular surgery using interventional ultrasound.
- Kaplans cardiac anesthesia: For cardiac and noncardiac surgery, 2017.
- X-ray to mr: the progress of flexible instruments for endovascular navigation. Progress in Biomedical Engineering, 3(3):032004, 2021.
- Clinical practice guidelines for endovascular abdominal aortic aneurysm repair: written by the standards of practice committee for the society of interventional radiology and endorsed by the cardiovascular and interventional radiological society of europe and the canadian interventional radiology association. Journal of Vascular and Interventional Radiology, 21(11):1632–1655, 2010.
- Epidemiology and contemporary management of abdominal aortic aneurysms. Abdominal Radiology, 43(5):1032–1043, 2018.
- Thomas L Szabo. Diagnostic ultrasound imaging: inside out. Academic press, 2004.
- Routine use of intravascular ultrasound for endovascular aneurysm repair: angiography is not necessary. European journal of vascular and endovascular surgery, 23(6):537–542, 2002.
- First experience using intraoperative contrast-enhanced ultrasound during endovascular aneurysm repair for infrarenal aortic aneurysms. Journal of vascular surgery, 51(5):1103–1110, 2010.
- Ultrasound-guided endovascular treatment for vascular access malfunction: results in 4896 cases. The journal of vascular access, 14(3):225–230, 2013.
- Intracardiac echocardiography-guided, anatomically based radiofrequency ablation of focal atrial fibrillation originating from pulmonary veins. Journal of the American College of Cardiology, 39(12):1964–1972, 2002.
- Ultrasound measurement of aortic diameter in a national screening programme. European Journal of Vascular and Endovascular Surgery, 42(2):195–199, 2011.
- Ultrasound physics and technology: how, why and when. Elsevier Health Sciences, 2011.
- Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28, 2015.
- Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
- End-to-end object detection with transformers. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, pages 213–229. Springer, 2020.
- End-to-end people detection in crowded scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2325–2333, 2016.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pages 234–241. Springer, 2015.
- nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021.
- Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6881–6890, 2021.
- Aiatrack: Attention in attention for transformer visual tracking. In European Conference on Computer Vision, pages 146–164. Springer, 2022.
- Comparison of real-time ultrasound simulation models using abdominal ct images. In 12th international symposium on medical information processing and analysis, volume 10160, pages 55–63. SPIE, 2017.
- Cactuss: Common anatomical ct-us space for us examinations. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2022: 25th International Conference, Singapore, September 18–22, 2022, Proceedings, Part III, pages 492–501. Springer, 2022.
- Patient-specific 3d ultrasound simulation based on convolutional ray-tracing and appearance optimization. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part II 18, pages 510–518. Springer, 2015.
- Jørgen Arendt Jensen. A new approach to calculating spatial impulse responses. In 1997 IEEE Ultrasonics Symposium Proceedings. An International Symposium (Cat. No. 97CH36118), volume 2, pages 1755–1759. IEEE, 1997.
- Nonlinear ultrasound simulation in an axisymmetric coordinate system using a k-space pseudospectral method. The Journal of the Acoustical Society of America, 148(4):2288–2300, 2020.
- Cathsim: An open-source simulator for endovascular intervention, 2023.
- Mujoco: A physics engine for model-based control. In 2012 IEEE/RSJ international conference on intelligent robots and systems, pages 5026–5033. IEEE, 2012.
- Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4, pages 3–11. Springer, 2018.
- Khalid Mammou. V-hacd: Volume hierarchical approximate convex decomposition. https://github.com/kmammou/v-hacd, 2014.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
- Lasot: A high-quality benchmark for large-scale single object tracking. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5374–5383, 2019.
- Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pages 740–755. Springer, 2014.
- Detectron2. https://github.com/facebookresearch/detectron2, 2019.
- Monai: An open-source framework for deep learning in healthcare. arXiv preprint arXiv:2211.02701, 2022.