Computer Science
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
W. Chiang, L. Zheng, et al.
Discover Chatbot Arena, an open, crowdsourced platform that evaluates Large Language Models via pairwise human-preference comparisons, powered by over 240K votes and rigorous statistical ranking — research conducted by Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios N. Angelopoulos, Tianle Li, Dacheng Li, Banghua Zhu, Hao Zhang, Michael I. Jordan, Joseph E. Gonzalez, and Ion Stoica.
~3 min • Beginner • English
Related Publications
Explore these studies to deepen your understanding of the subject.

