logo
Loading...
Evaluation of large language models on mental health: from knowledge test to illness diagnosis
Medicine and HealthFrontiers in Psychiatry

Evaluation of large language models on mental health: from knowledge test to illness diagnosis

Y. Xu, Z. Fang, et al.

Large language models are put to the test for Chinese mental health tasks in this study — evaluating 15 state-of-the-art LLMs (e.g., DeepSeek-R1, GPT-4.1, QwQ) on knowledge and diagnostic benchmarks (Dreaddit, SDCNL, CAS exam). This research, conducted by the authors listed in the <Authors> tag, highlights top performers and offers clear guidance for selecting and improving models in sensitive mental health scenarios.... show more
Introduction
Literature Review
Methodology
Key Findings
Discussion
Conclusion
Limitations
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 22+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny