
Medicine and Health
A framework for human evaluation of large language models in healthcare derived from literature review
T. Y. C. Tam, S. Sivarajkumar, et al.
This study by Thomas Yu Chow Tam, Sonish Sivarajkumar, and colleagues delves into the critical evaluation of Large Language Models in healthcare. It highlights the need for robust human evaluation methodologies and proposes the innovative QUEST framework to bridge existing gaps, ensuring safer and more effective AI applications in health.
Playback language: English
Related Publications
Explore these studies to deepen your understanding of the subject.