Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images (2305.15993v1)

Published 25 May 2023 in cs.HC

Abstract: Surgical phase recognition is a challenging and necessary task for the development of context-aware intelligent systems that can support medical personnel for better patient care and effective operating room management. In this paper, we present a surgical phase recognition framework that employs a Multi-Stage Temporal Convolution Network using speech and X-Ray images for the first time. We evaluate our proposed approach using our dataset that comprises 31 port-catheter placement operations and report 82.56 \% frame-wise accuracy with eight surgical phases. Additionally, we investigate the design choices in the temporal model and solutions for the class-imbalance problem. Our experiments demonstrate that speech and X-Ray data can be effectively utilized for surgical phase recognition, providing a foundation for the development of speech assistants in operating rooms of the future.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Kubilay Can Demir (6 papers)
  2. Tobias Weise (8 papers)
  3. Matthias May (3 papers)
  4. Axel Schmid (2 papers)
  5. Andreas Maier (394 papers)
  6. Seung Hee Yang (18 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.