logo
ResearchBunny Logo
A vision transformer for decoding surgeon activity from surgical videos

Medicine and Health

A vision transformer for decoding surgeon activity from surgical videos

D. Kiyasseh, R. Ma, et al.

Discover the cutting-edge machine learning system, SAIS, developed by a talented team including Dani Kiyasseh, Runzhuo Ma, and others, which decodes surgical activity from robotic surgery videos with remarkable accuracy. This innovative tool provides vital insights into surgical skills and can potentially transform surgeon feedback and improvement methods.

00:00
00:00
~3 min • Beginner • English
Abstract
The intraoperative activity of a surgeon has substantial impact on postoperative outcomes. However, for most surgical procedures, the details of intraoperative surgical actions, which can vary widely, are not well understood. Here we report a machine learning system leveraging a vision transformer and supervised contrastive learning for the decoding of elements of intraoperative surgical activity from videos commonly collected during robotic surgeries. The system accurately identified surgical steps, actions performed by the surgeon, the quality of these actions and the relative contribution of individual video frames to the decoding of the actions. Through extensive testing on data from three different hospitals located in two different continents, we show that the system generalizes across videos, surgeons, hospitals and surgical procedures, and that it can provide information on surgical gestures and skills from unannotated videos. Decoding intraoperative activity via accurate machine learning systems could be used to provide surgeons with feedback on their operating skills, and may allow for the identification of optimal surgical behaviour and for the study of relationships between intraoperative factors and postoperative outcomes.
Publisher
Nature Biomedical Engineering
Published On
Mar 30, 2023
Authors
Dani Kiyasseh, Runzhuo Ma, Taseen F. Haque, Brian J. Miles, Christian Wagner, Daniel A. Donoho, Animashree Anandkumar, Andrew J. Hung
Tags
machine learning
surgical activity
robotic surgery
vision transformer
contrastive learning
intraoperative insights
surgeon skills
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny