logo
ResearchBunny Logo
Development, deployment and scaling of operating room-ready artificial intelligence for real-time surgical decision support

Medicine and Health

Development, deployment and scaling of operating room-ready artificial intelligence for real-time surgical decision support

S. Protserov, J. Hunter, et al.

This groundbreaking research by Sergey Protserov and colleagues tackles the challenges of generalizability and scalability in surgical guidance systems. They present a real-time, equipment-agnostic framework for laparoscopic cholecystectomy that shows promising performance metrics and operates seamlessly even on low-bandwidth connections.

00:00
00:00
~3 min • Beginner • English
Abstract
Deep learning for computer vision can be leveraged for interpreting surgical scenes and providing surgeons with real-time guidance to avoid complications. However, neither generalizability nor scalability of computer-vision-based surgical guidance systems have been demonstrated, especially to geographic locations that lack hardware and infrastructure necessary for real-time inference. We propose a new equipment-agnostic framework for real-time use in operating suites. Using laparoscopic cholecystectomy and semantic segmentation models for predicting safe/dangerous ("Go"/"No-Go") zones of dissection as an example use case, this study aimed to develop and test the performance of a novel data pipeline linked to a web-platform that enables real-time deployment from any edge device. To test this infrastructure and demonstrate its scalability and generalizability, lightweight U-Net and SegFormer models were trained on annotated frames from a large and diverse multicenter dataset from 136 institutions, and then tested on a separate prospectively collected dataset. A web-platform was created to enable real-time inference on any surgical video stream, and performance was tested on and optimized for a range of network speeds. The U-Net and SegFormer models respectively achieved mean Dice scores of 57% and 60%, precision 45% and 53%, and recall 82% and 75% for predicting the Go zone, and mean Dice scores of 76% and 76%, precision 68% and 68%, and recall 92% and 92% for predicting the No-Go zone. After optimization of the client-server interaction over the network, we deliver a prediction stream of at least 60 fps and with a maximum round-trip delay of 70 ms for speeds above 8 Mbps. Clinical deployment of machine learning models for surgical guidance is feasible and cost-effective using a generalizable, scalable and equipment-agnostic framework that lacks dependency on hardware with high computing performance or ultra-fast internet connection speed.
Publisher
npj Digital Medicine
Published On
Sep 03, 2024
Authors
Sergey Protserov, Jaryd Hunter, Haochi Zhang, Pouria Mashouri, Caterina Masino, Michael Brudno, Amin Madani
Tags
surgical guidance
machine learning
real-time inference
computer vision
laparoscopic cholecystectomy
U-Net
SegFormer
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny