Academic Library

Question 1

A Multi-View Pipeline and Benchmark Dataset for 3D Hand Pose Estimation in Surgery

Answer

Fischer, V., Magdaleno, A., Calek, A., Cavalcanti, N., Hoffman, N., Germann, C., Wüthrich., J., Krähenmann, M., Farshad, M., Fürnstahl, P. & Calvet, L..

arXiv (2026)

The study presents a robust multi-view pipeline for 3D hand pose estimation in surgical environments, designed to operate without domain-specific fine-tuning and relying solely on off-the-shelf pretrained models. By combining person detection, whole-body pose estimation, 2D hand keypoint prediction, and constrained 3D optimization, the approach addresses challenges such as occlusions, intense lighting, and uniform glove appearance. In addition, a large-scale surgical benchmark dataset with over 68,000 frames and 3,000 annotated hand poses is introduced. Quantitative evaluation shows substantial improvements over baselines, establishing a strong foundation for future research in surgical computer vision.

Academic Library

Scientific Evidence for Surgical Innovation

2026

A Multi-View Pipeline and Benchmark Dataset for 3D Hand Pose Estimation in Surgery

Multi-view Surface Reconstruction Using Normal and Reflectance Cues

RocSync: Millisecond-Accurate Temporal Synchronization for Heterogeneous Camera Systems

UltraBoneUDF: Self-supervised Bone Surface Reconstruction from Ultrasound Based on Neural Unsigned Distance Functions

2025

A Modular Edge Device Network for Surgery Digitalization

A novel augmented reality-based simulator for enhancing orthopedic surgical training

Acquiring submillimeter-accurate multi-task vision datasets for computer-assisted orthopedic surgery

ArthroPhase: a novel dataset and method for phase recognition inarthroscopic video

Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction

Automatic multi-view X-ray/CT registration using bone substructure contours

Intra-, Epidural And Intracranial Pressure Changes During Interlaminar Endoscopy, With and Without Dural Tear

Intraoperative 3D Reconstruction from Sparse, Arbitrarily Posed Real X-rays

Localising under the drape: proprioception in the era of distributed surgical robotic system

NeuralBoneReg: A Novel Self-Supervised Method for Robust and Accurate Multi-Modal Bone Surface Registration

Sound Source Localization for Spatial Mapping of Surgical Actions in Dynamic Scenes

The importance of the posterior osteoligamentous complex of the lumbar spine: dogma changing biomechanical insights

Towards Egocentric Understanding of Surgery

2024

Creating a Digital Twin of Spinal Surgery: A Proof of Concept

Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data

Spatial context awareness in surgery through sound source localization

Spinal navigation with AI-driven 3D-reconstruction of fluoroscopy images: an ex-vivo feasibility study

The journey of FAROS from technical design to in-vivo animal validation

Virtual reality for immersive education in orthopedic surgery digital twins

2023

Marker-less Multi-view 6DoF Pose Estimation of Surgical Instruments

Translation of medical AR research into clinical practice