XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network

Published 31 May 2023 in eess.IV, cs.CV, and physics.med-ph | (2305.19621v2)

Abstract: Computed tomography (CT) scans offer a detailed, three-dimensional representation of patients' internal organs. However, conventional CT reconstruction techniques necessitate acquiring hundreds or thousands of x-ray projections through a complete rotational scan of the body, making navigation or positioning during surgery infeasible. In image-guided radiation therapy, a method that reconstructs ultra-sparse X-ray projections into CT images, we can exploit the substantially reduced radiation dose and minimize equipment burden for localization and navigation. In this study, we introduce a novel Transformer architecture, termed XTransCT, devised to facilitate real-time reconstruction of CT images from two-dimensional X-ray images. We assess our approach regarding image quality and structural reliability using a dataset of fifty patients, supplied by a hospital, as well as the larger public dataset LIDC-IDRI, which encompasses thousands of patients. Additionally, we validated our algorithm's generalizability on the LNDb dataset. Our findings indicate that our algorithm surpasses other methods in image quality, structural precision, and generalizability. Moreover, in comparison to previous 3D convolution-based approaches, we note a substantial speed increase of approximately 300 %, achieving 44 ms per 3D image reconstruction.

Abstract PDF HTML Upgrade to Chat

Citations (7)

View on Semantic Scholar

Summary

The paper introduces XTransCT, which uses dual X-ray projections and a Transformer network to achieve 3D CT reconstruction in just 44ms.
The methodology bypasses traditional 3D convolutions by leveraging a pre-trained ResNet backbone and a voxel space search module within the Transformer architecture.
Empirical results on diverse datasets show significant improvements in SSIM, PSNR, and Dice metrics, outperforming models like X2CTGAN.

Overview of "XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network"

Introduction

The research paper presents XTransCT, a novel framework for ultra-fast volumetric CT reconstruction based on only two orthogonal X-ray projections, tailored for image-guided radiation therapy (IGRT). This innovative approach melds a voxel-space-searching Transformer network to enhance the efficiency and accuracy of CT image reconstruction, crucial for clinical settings where minimizing radiation exposure and maximizing reconstruction speed are paramount.

Methodology

The proposed XTransCT algorithm circumvents the constraints of traditional CT methods by employing a model devoid of 3D convolutions, significantly ensuring reduced latency and heightened performance. At the core of this model is the Transformer architecture, traditionally used in NLP, repurposed here to interpret X-ray data to achieve volumetric reconstruction.

Dual X-ray Setup

The study employs a dual X-ray setup, with beams set at 45° and 135° angles to ensure robust data acquisition from sparse projections (Figure 1).

Figure 1: Dual X-ray setup. Two X-rays irradiate the patient, and their real-time fusion with CT scans enables precise patient positioning while accounting for potential misalignments.

Framework

The framework starts with simulating X-ray images from CT scans using digitally reconstructed radiograph (DRR) methods, which are fed into a pre-trained ResNet backbone. The extracted features undergo transformation via a voxel space search module encoded in a Transformer, which then decodes these coordinates into the corresponding voxel values, recreating full 3D CT images (Figure 2).

Figure 2: Framework of this study. We employ the DRR method to generate X-ray images simulated from CT scans.

Network Architecture

The architecture entails a pre-trained ResNet applied on the DRR images, whose features are mapped linearly and input into a Transformer encoder and decoder (Figure 3 and Figure 4). This architecture facilitates more efficient and effective 2D to 3D information aggregation compared to traditional CNN-based methods.

Figure 3: Structure of the ResNet as a backbone.

Figure 4: Structure of the transformer.

Results and Discussion

The application of XTransCT across a 50-patient dataset, LIDC-IDRI, and LNDb datasets demonstrates impressive results in terms of SSIM, PSNR, and Dice metrics, with significant improvements over existing models like X2CT and X2CTGAN (Figure 5, Figure 6, and Figure 7).

Figure 5: Comparative experiment on the 50-patient dataset.

Figure 6: Comparative experiments on the LIDC-IDRI Dataset.

Figure 7: Generalization of the method is verified using comparative experiments on the LNDb Dataset.

The performance advantage lies notably in speed, with each 3D image reconstructed in merely 44ms, a stark contrast to previous 3D convolution-dependent algorithms that require significantly more time. This rapid reconstruction aligns perfectly with the needs of real-time IGRT.

Conclusion

XTransCT stands out in the domain of medical imaging by reducing reconstruction times substantially while maintaining high image quality and structural fidelity. By leveraging Transformers for sparse X-ray data understanding, this method offers a promising direction for efficient, low-radiation imaging solutions in clinical applications. Future developments could investigate expanding this approach across other imaging modalities and further refining voxel-space interactions for enhanced detail retrieval.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Glossary

off on

Practical Applications

off on

Conceptual Simplification

off on

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Generate Now

XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network

Summary

Overview of "XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network"

Introduction

Methodology

Dual X-ray Setup

Framework

Network Architecture

Results and Discussion

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Authors (9)

Collections

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network

Summary

Overview of "XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network"

Introduction

Methodology

Dual X-ray Setup

Framework

Network Architecture

Results and Discussion

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (9)

Collections

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research