SegmentOR: Obtaining Efficient Operating Room Semantics through Temporal Propagation.

Technical University of Munich, Germany
*Contributed Equally

SegmentOR enables Surgical Operating Room segmentation through sparse one-click annotations. Temporal consistency significantly improves performance and robustness while maininting low annotation burden.

Abstract

The digitization of surgical operating rooms (OR) has gained significant traction in the scientific and medical communities. However, existing deep-learning methods for operating room recognition tasks still require substantial quantities of annotated data. In this paper, we introduce a method for weakly-supervised semantic segmentation for surgical operating rooms. Our method operates directly on 4D point cloud sequences from multiple ceiling-mounted RGB-D sensors and requires less than 0.01% of annotated data. This is achieved by incorporating a self-supervised temporal prior, enforcing semantic consistency in 4D point cloud video recordings. We show how refining these priors with learned semantic features can increase segmentation mIoU to 10% above existing works, achieving higher segmentation scores than baselines that use four times the number of labels. Furthermore, the 3D semantic predictions from our method can be projected back into 2D images; we establish that these 2D predictions can be used to improve the performance of existing surgical phase recognition methods. Our method shows promise in automating 3D OR segmentation with a 20 times lower annotation cost than existing methods, demonstrating the potential to improve surgical scene understanding systems.

Pipeline

SegmentOR Pipeline
SegmentOR extracts semantic and class-specific features from a sequence of point clouds (a). Sparse labels (in red) are expanded to their nearest supervoxel cluster (b) and used as supervision for learning the segmentation task. In contrast to previous works, we propose to incorporate a prior to establish a temporal consistency between the pooled semantic features in a point cloud sequence (c), enabling spatiotemporal pseudo-label propagation across timesteps.

BibTeX


          @inproceedings{bastian2023segmentor,
            title={SegmentOR: Obtaining Efficient Operating Room Semantics Through Temporal Propagation},
            author={Bastian, Lennart and Derkacz-Bogner, Daniel and Wang, Tony D and Busam, Benjamin and Navab, Nassir},
            booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
            pages={57--67},
            year={2023},
            organization={Springer}
          }
        
This website is hosted on GitHub Pages. GitHub may collect some of your data, such as your IP address and user agent, for analytics and security purposes. The data may be transferred to servers in the United States or other countries outside of the European Economic Area. We do not control GitHub's data collection and processing activities. By using this website, you consent to any data collection and processing by GitHub under their terms and privacy policy. For more information, please read their Privacy Policy.