Robust Hand Detection Algorithm for Medical Settings: RoHan

Saturday 08 March 2025


The development of hand detection algorithms has been a long-standing challenge in computer vision research, particularly in medical settings where accurate and robust detection is crucial for various applications such as surgical skill assessment and patient safety monitoring. Recent advancements have shown promising results, but existing methods often struggle to generalize across different environments and camera conditions.


Enter RoHan, a novel approach that leverages advanced semi-supervised domain adaptation techniques to tackle the challenges of hand detection in operating rooms (ORs). By introducing artificial gloves into public datasets and incorporating an iterative prediction refinement process, RoHan aims to improve model performance and adaptability across diverse medical settings.


The researchers’ approach begins with pre-training a hand detection model using a combination of publicly available datasets, including EgoHands, OneHand10K, HaDR, and Assembly101. Each dataset is augmented with artificial gloves, which helps bridge the domain gap between surgical and non-surgical environments. The pre-trained model is then fine-tuned on two benchmark datasets: Enterotomy Repair Simulator (ERS) and Saphenous Vein Graft Harvesting (SVGH).


The ERS dataset features simulated bowel enterotomy repair procedures conducted by surgeons, while SVGH captures saphenous vein graft harvesting operations. Both datasets were captured using a camera mounted on a tripod, providing a unique perspective on hand movements during surgical procedures.


RoHan’s iterative prediction refinement process is designed to eliminate potential errors in the initial pseudo-labels generated by the pre-trained model. By leveraging temporal characteristics of video data, the algorithm filters out low-quality predictions and refines the model through multiple cycles. This approach allows RoHan to adapt to the unique conditions of each OR environment.


Experiments demonstrate that RoHan achieves superior performance compared to state-of-the-art hand detection models in both ERS and SVGH datasets. The algorithm’s ability to generalize across different camera angles, lighting conditions, and glove colors sets a new benchmark for hand detection in medical settings.


The potential applications of RoHan are vast, extending beyond surgical skill assessment to other areas such as 3D hand reconstruction, tool detection, and patient safety monitoring. As the field continues to evolve, researchers can build upon RoHan’s innovations to develop more sophisticated computer vision algorithms that improve patient outcomes and enhance medical care.


RoHan’s success highlights the importance of domain adaptation techniques in overcoming the challenges of hand detection in medical settings.


Cite this article: “Robust Hand Detection Algorithm for Medical Settings: RoHan”, The Science Archive, 2025.


Computer Vision, Hand Detection, Domain Adaptation, Semi-Supervised Learning, Medical Settings, Surgical Skill Assessment, Patient Safety Monitoring, 3D Reconstruction, Tool Detection, Artificial Gloves


Reference: Roi Papo, Sapir Gershov, Tom Friedman, Itay Or, Gil Bolotin, Shlomi Laufer, “RoHan: Robust Hand Detection in Operation Room” (2025).


Leave a Reply