PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images (2305.15993v1)

Published 25 May 2023 in cs.HC

Abstract: Surgical phase recognition is a challenging and necessary task for the development of context-aware intelligent systems that can support medical personnel for better patient care and effective operating room management. In this paper, we present a surgical phase recognition framework that employs a Multi-Stage Temporal Convolution Network using speech and X-Ray images for the first time. We evaluate our proposed approach using our dataset that comprises 31 port-catheter placement operations and report 82.56 \% frame-wise accuracy with eight surgical phases. Additionally, we investigate the design choices in the temporal model and solutions for the class-imbalance problem. Our experiments demonstrate that speech and X-Ray data can be effectively utilized for surgical phase recognition, providing a foundation for the development of speech assistants in operating rooms of the future.

Authors (6)

Kubilay Can Demir (6 papers)
Tobias Weise (8 papers)
Matthias May (3 papers)
Axel Schmid (2 papers)
Andreas Maier (394 papers)
Seung Hee Yang (18 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images (2305.15993v1)

Summary

Related Papers