logo
Loading...
Vision-language foundation model for echocardiogram interpretation

Medicine and Health

Vision-language foundation model for echocardiogram interpretation

M. Christensen, M. Vukadinovic, et al.

Discover the groundbreaking research by Matthew Christensen, Milos Vukadinovic, Neal Yuan, and David Ouyang on EchoCLIP, a cutting-edge model for echocardiography that elevates cardiac imaging interpretation. With its remarkable ability to assess cardiac function and identify devices, this model is set to transform how clinicians work with echocardiograms.... show more
Abstract
The development of robust artificial intelligence models for echocardiography has been limited by the availability of annotated clinical data. Here, to address this challenge and improve the performance of cardiac imaging models, we developed EchoCLIP, a vision-language foundation model for echocardiography, that learns the relationship between cardiac ultrasound images and the interpretations of expert cardiologists across a wide range of patients and indications for imaging. After training on 1,032,975 cardiac ultrasound videos and corresponding expert text, EchoCLIP performs well on a diverse range of benchmarks for cardiac image interpretation, despite not having been explicitly trained for individual interpretation tasks. EchoCLIP can assess cardiac function (mean absolute error of 7.1% when predicting left ventricular ejection fraction in an external validation dataset) and identify implanted intracardiac devices (area under the curve (AUC) of 0.84, 0.92 and 0.97 for pacemakers, percutaneous mitral valve repair and artificial aortic valves, respectively). We also developed a long-context variant (EchoCLIP-R) using a custom tokenizer based on common echocardiography concepts. EchoCLIP-R accurately identified unique patients across multiple videos (AUC of 0.86), identified clinical transitions such as heart transplants (AUC of 0.79) and cardiac surgery (AUC 0.77) and enabled robust image-to-text search (mean cross-modal retrieval rank in the top 1% of candidate text reports). These capabilities represent a substantial step toward understanding and applying foundation models in cardiovascular imaging for preliminary interpretation of echocardiographic findings.
Publisher
Nature Medicine
Published On
May 01, 2024
Authors
Matthew Christensen, Milos Vukadinovic, Neal Yuan, David Ouyang
Tags
EchoCLIP
echocardiography
cardiac imaging
foundation model
cardiac ultrasound
image interpretation
clinical application
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny